Don't forget to check these 10 SQL projects with corresponding datasets that you could use to practice your SQL skills:
1. Analysis of Sales Data:
(https://www.kaggle.com/kyanyoga/sample-sales-data)
2. HR Analytics:
(https://www.kaggle.com/pavansubhasht/ibm-hr-analytics-attrition-dataset)
3. Social Media Analytics:
(https://www.kaggle.com/datasets/ramjasmaurya/top-1000-social-media-channels)
4. Financial Data Analysis:
(https://www.kaggle.com/datasets/nitindatta/finance-data)
5. Healthcare Data Analysis:
(https://www.kaggle.com/cdc/mortality)
6. Customer Relationship Management:
(https://www.kaggle.com/pankajjsh06/ibm-watson-marketing-customer-value-data)
7. Web Analytics:
(https://www.kaggle.com/zynicide/wine-reviews)
8. E-commerce Analysis:
(https://www.kaggle.com/olistbr/brazilian-ecommerce)
9. Supply Chain Management:
(https://www.kaggle.com/datasets/harshsingh2209/supply-chain-analysis)
10. Inventory Management:
(https://www.kaggle.com/datasets?search=inventory+management)
Share this channel with your friends 🤝🤩
Join for more -> https://whatsapp.com/channel/0029VaxbzNFCxoAmYgiGTL3Z
ENJOY LEARNING 👍👍
1. Analysis of Sales Data:
(https://www.kaggle.com/kyanyoga/sample-sales-data)
2. HR Analytics:
(https://www.kaggle.com/pavansubhasht/ibm-hr-analytics-attrition-dataset)
3. Social Media Analytics:
(https://www.kaggle.com/datasets/ramjasmaurya/top-1000-social-media-channels)
4. Financial Data Analysis:
(https://www.kaggle.com/datasets/nitindatta/finance-data)
5. Healthcare Data Analysis:
(https://www.kaggle.com/cdc/mortality)
6. Customer Relationship Management:
(https://www.kaggle.com/pankajjsh06/ibm-watson-marketing-customer-value-data)
7. Web Analytics:
(https://www.kaggle.com/zynicide/wine-reviews)
8. E-commerce Analysis:
(https://www.kaggle.com/olistbr/brazilian-ecommerce)
9. Supply Chain Management:
(https://www.kaggle.com/datasets/harshsingh2209/supply-chain-analysis)
10. Inventory Management:
(https://www.kaggle.com/datasets?search=inventory+management)
Share this channel with your friends 🤝🤩
Join for more -> https://whatsapp.com/channel/0029VaxbzNFCxoAmYgiGTL3Z
ENJOY LEARNING 👍👍
❤2
The best fine-tuning guide you'll find on arXiv this year.
Covers:
> NLP basics
> PEFT/LoRA/QLoRA techniques
> Mixture of Experts
> Seven-stage fine-tuning pipeline
Source: https://arxiv.org/pdf/2408.13296v1
Covers:
> NLP basics
> PEFT/LoRA/QLoRA techniques
> Mixture of Experts
> Seven-stage fine-tuning pipeline
Source: https://arxiv.org/pdf/2408.13296v1
❤3
Kandinsky 5.0 Video Lite and Kandinsky 5.0 Video Pro generative models on the global text-to-video landscape
🔘Pro is currently the #1 open-source model worldwide
🔘Lite (2B parameters) outperforms Sora v1.
🔘Only Google (Veo 3.1, Veo 3), OpenAI (Sora 2), Alibaba (Wan 2.5), and KlingAI (Kling 2.5, 2.6) outperform Pro — these are objectively the strongest video generation models in production today. We are on par with Luma AI (Ray 3) and MiniMax (Hailuo 2.3): the maximum ELO gap is 3 points, with a 95% CI of ±21.
Useful links
🔘Full leaderboard: LM Arena
🔘Kandinsky 5.0 details: technical report
🔘Open-source Kandinsky 5.0: GitHub and Hugging Face
🔘Pro is currently the #1 open-source model worldwide
🔘Lite (2B parameters) outperforms Sora v1.
🔘Only Google (Veo 3.1, Veo 3), OpenAI (Sora 2), Alibaba (Wan 2.5), and KlingAI (Kling 2.5, 2.6) outperform Pro — these are objectively the strongest video generation models in production today. We are on par with Luma AI (Ray 3) and MiniMax (Hailuo 2.3): the maximum ELO gap is 3 points, with a 95% CI of ±21.
Useful links
🔘Full leaderboard: LM Arena
🔘Kandinsky 5.0 details: technical report
🔘Open-source Kandinsky 5.0: GitHub and Hugging Face
❤3
📈 Data Visualisation Cheatsheet: 13 Must-Know Chart Types ✅
1️⃣ Gantt Chart
Tracks project schedules over time.
🔹 Advantage: Clarifies timelines & tasks
🔹 Use case: Project management & planning
2️⃣ Bubble Chart
Shows data with bubble size variations.
🔹 Advantage: Displays 3 data dimensions
🔹 Use case: Comparing social media engagement
3️⃣ Scatter Plots
Plots data points on two axes.
🔹 Advantage: Identifies correlations & clusters
🔹 Use case: Analyzing variable relationships
4️⃣ Histogram Chart
Visualizes data distribution in bins.
🔹 Advantage: Easy to see frequency
🔹 Use case: Understanding age distribution in surveys
5️⃣ Bar Chart
Uses rectangular bars to visualize data.
🔹 Advantage: Easy comparison across groups
🔹 Use case: Comparing sales across regions
6️⃣ Line Chart
Shows trends over time with lines.
🔹 Advantage: Clear display of data changes
🔹 Use case: Tracking stock market performance
7️⃣ Pie Chart
Represents data in circular segments.
🔹 Advantage: Simple proportion visualization
🔹 Use case: Displaying market share distribution
8️⃣ Maps
Geographic data representation on maps.
🔹 Advantage: Recognizes spatial patterns
🔹 Use case: Visualizing population density by area
9️⃣ Bullet Charts
Measures performance against a target.
🔹 Advantage: Compact alternative to gauges
🔹 Use case: Tracking sales vs quotas
🔟 Highlight Table
Colors tabular data based on values.
🔹 Advantage: Quickly identifies highs & lows
🔹 Use case: Heatmapping survey responses
1️⃣1️⃣ Tree Maps
Hierarchical data with nested rectangles.
🔹 Advantage: Efficient space usage
🔹 Use case: Displaying file system usage
1️⃣2️⃣ Box & Whisker Plot
Summarizes data distribution & outliers.
🔹 Advantage: Concise data spread representation
🔹 Use case: Comparing exam scores across classes
1️⃣3️⃣ Waterfall Charts / Walks
Visualizes sequential cumulative effect.
🔹 Advantage: Clarifies source of final value
🔹 Use case: Understanding profit & loss components
💡 Use the right chart to tell your data story clearly.
Power BI Resources: https://whatsapp.com/channel/0029Vai1xKf1dAvuk6s1v22c
Tap ♥️ for more!
1️⃣ Gantt Chart
Tracks project schedules over time.
🔹 Advantage: Clarifies timelines & tasks
🔹 Use case: Project management & planning
2️⃣ Bubble Chart
Shows data with bubble size variations.
🔹 Advantage: Displays 3 data dimensions
🔹 Use case: Comparing social media engagement
3️⃣ Scatter Plots
Plots data points on two axes.
🔹 Advantage: Identifies correlations & clusters
🔹 Use case: Analyzing variable relationships
4️⃣ Histogram Chart
Visualizes data distribution in bins.
🔹 Advantage: Easy to see frequency
🔹 Use case: Understanding age distribution in surveys
5️⃣ Bar Chart
Uses rectangular bars to visualize data.
🔹 Advantage: Easy comparison across groups
🔹 Use case: Comparing sales across regions
6️⃣ Line Chart
Shows trends over time with lines.
🔹 Advantage: Clear display of data changes
🔹 Use case: Tracking stock market performance
7️⃣ Pie Chart
Represents data in circular segments.
🔹 Advantage: Simple proportion visualization
🔹 Use case: Displaying market share distribution
8️⃣ Maps
Geographic data representation on maps.
🔹 Advantage: Recognizes spatial patterns
🔹 Use case: Visualizing population density by area
9️⃣ Bullet Charts
Measures performance against a target.
🔹 Advantage: Compact alternative to gauges
🔹 Use case: Tracking sales vs quotas
🔟 Highlight Table
Colors tabular data based on values.
🔹 Advantage: Quickly identifies highs & lows
🔹 Use case: Heatmapping survey responses
1️⃣1️⃣ Tree Maps
Hierarchical data with nested rectangles.
🔹 Advantage: Efficient space usage
🔹 Use case: Displaying file system usage
1️⃣2️⃣ Box & Whisker Plot
Summarizes data distribution & outliers.
🔹 Advantage: Concise data spread representation
🔹 Use case: Comparing exam scores across classes
1️⃣3️⃣ Waterfall Charts / Walks
Visualizes sequential cumulative effect.
🔹 Advantage: Clarifies source of final value
🔹 Use case: Understanding profit & loss components
💡 Use the right chart to tell your data story clearly.
Power BI Resources: https://whatsapp.com/channel/0029Vai1xKf1dAvuk6s1v22c
Tap ♥️ for more!
❤8
Data Analyst Roadmap 📊
📂 Python Basics
∟📂 Numpy & Pandas
∟📂 Data Cleaning
∟📂 Data Visualization (Matplotlib, Seaborn)
∟📂 SQL for Data Analysis
∟📂 Excel & Google Sheets
∟📂 Statistics for Analysis
∟📂 BI Tools (Power BI / Tableau)
∟📂 Real-World Projects
∟✅ Apply for Data Analyst Roles
❤️ React for More!
📂 Python Basics
∟📂 Numpy & Pandas
∟📂 Data Cleaning
∟📂 Data Visualization (Matplotlib, Seaborn)
∟📂 SQL for Data Analysis
∟📂 Excel & Google Sheets
∟📂 Statistics for Analysis
∟📂 BI Tools (Power BI / Tableau)
∟📂 Real-World Projects
∟✅ Apply for Data Analyst Roles
❤️ React for More!
❤4
Data Analyst Roadmap
Like if it helps ❤️
Like if it helps ❤️
❤5👏1
Important Topics to become a data scientist
[Advanced Level]
👇👇
1. Mathematics
Linear Algebra
Analytic Geometry
Matrix
Vector Calculus
Optimization
Regression
Dimensionality Reduction
Density Estimation
Classification
2. Probability
Introduction to Probability
1D Random Variable
The function of One Random Variable
Joint Probability Distribution
Discrete Distribution
Normal Distribution
3. Statistics
Introduction to Statistics
Data Denoscription
Random Samples
Sampling Distribution
Parameter Estimation
Hypotheses Testing
Regression
4. Programming
Python:
Python Basics
List
Set
Tuples
Dictionary
Function
NumPy
Pandas
Matplotlib/Seaborn
R Programming:
R Basics
Vector
List
Data Frame
Matrix
Array
Function
dplyr
ggplot2
Tidyr
Shiny
DataBase:
SQL
MongoDB
Data Structures
Web scraping
Linux
Git
5. Machine Learning
How Model Works
Basic Data Exploration
First ML Model
Model Validation
Underfitting & Overfitting
Random Forest
Handling Missing Values
Handling Categorical Variables
Pipelines
Cross-Validation(R)
XGBoost(Python|R)
Data Leakage
6. Deep Learning
Artificial Neural Network
Convolutional Neural Network
Recurrent Neural Network
TensorFlow
Keras
PyTorch
A Single Neuron
Deep Neural Network
Stochastic Gradient Descent
Overfitting and Underfitting
Dropout Batch Normalization
Binary Classification
7. Feature Engineering
Baseline Model
Categorical Encodings
Feature Generation
Feature Selection
8. Natural Language Processing
Text Classification
Word Vectors
9. Data Visualization Tools
BI (Business Intelligence):
Tableau
Power BI
Qlik View
Qlik Sense
10. Deployment
Microsoft Azure
Heroku
Google Cloud Platform
Flask
Django
Join @datasciencefun to learning important data science and machine learning concepts
ENJOY LEARNING 👍👍
[Advanced Level]
👇👇
1. Mathematics
Linear Algebra
Analytic Geometry
Matrix
Vector Calculus
Optimization
Regression
Dimensionality Reduction
Density Estimation
Classification
2. Probability
Introduction to Probability
1D Random Variable
The function of One Random Variable
Joint Probability Distribution
Discrete Distribution
Normal Distribution
3. Statistics
Introduction to Statistics
Data Denoscription
Random Samples
Sampling Distribution
Parameter Estimation
Hypotheses Testing
Regression
4. Programming
Python:
Python Basics
List
Set
Tuples
Dictionary
Function
NumPy
Pandas
Matplotlib/Seaborn
R Programming:
R Basics
Vector
List
Data Frame
Matrix
Array
Function
dplyr
ggplot2
Tidyr
Shiny
DataBase:
SQL
MongoDB
Data Structures
Web scraping
Linux
Git
5. Machine Learning
How Model Works
Basic Data Exploration
First ML Model
Model Validation
Underfitting & Overfitting
Random Forest
Handling Missing Values
Handling Categorical Variables
Pipelines
Cross-Validation(R)
XGBoost(Python|R)
Data Leakage
6. Deep Learning
Artificial Neural Network
Convolutional Neural Network
Recurrent Neural Network
TensorFlow
Keras
PyTorch
A Single Neuron
Deep Neural Network
Stochastic Gradient Descent
Overfitting and Underfitting
Dropout Batch Normalization
Binary Classification
7. Feature Engineering
Baseline Model
Categorical Encodings
Feature Generation
Feature Selection
8. Natural Language Processing
Text Classification
Word Vectors
9. Data Visualization Tools
BI (Business Intelligence):
Tableau
Power BI
Qlik View
Qlik Sense
10. Deployment
Microsoft Azure
Heroku
Google Cloud Platform
Flask
Django
Join @datasciencefun to learning important data science and machine learning concepts
ENJOY LEARNING 👍👍
❤1👍1
📈 Want to Excel at Data Analytics? Master These Essential Skills! ☑️
Core Concepts:
• Statistics & Probability – Understand distributions, hypothesis testing
• Excel – Pivot tables, formulas, dashboards
Programming:
• Python – NumPy, Pandas, Matplotlib, Seaborn
• R – Data analysis & visualization
• SQL – Joins, filtering, aggregation
Data Cleaning & Wrangling:
• Handle missing values, duplicates
• Normalize and transform data
Visualization:
• Power BI, Tableau – Dashboards
• Plotly, Seaborn – Python visualizations
• Data Storytelling – Present insights clearly
Advanced Analytics:
• Regression, Classification, Clustering
• Time Series Forecasting
• A/B Testing & Hypothesis Testing
ETL & Automation:
• Web Scraping – BeautifulSoup, Scrapy
• APIs – Fetch and process real-world data
• Build ETL Pipelines
Tools & Deployment:
• Jupyter Notebook / Colab
• Git & GitHub
• Cloud Platforms – AWS, GCP, Azure
• Google BigQuery, Snowflake
Hope it helps :)
Core Concepts:
• Statistics & Probability – Understand distributions, hypothesis testing
• Excel – Pivot tables, formulas, dashboards
Programming:
• Python – NumPy, Pandas, Matplotlib, Seaborn
• R – Data analysis & visualization
• SQL – Joins, filtering, aggregation
Data Cleaning & Wrangling:
• Handle missing values, duplicates
• Normalize and transform data
Visualization:
• Power BI, Tableau – Dashboards
• Plotly, Seaborn – Python visualizations
• Data Storytelling – Present insights clearly
Advanced Analytics:
• Regression, Classification, Clustering
• Time Series Forecasting
• A/B Testing & Hypothesis Testing
ETL & Automation:
• Web Scraping – BeautifulSoup, Scrapy
• APIs – Fetch and process real-world data
• Build ETL Pipelines
Tools & Deployment:
• Jupyter Notebook / Colab
• Git & GitHub
• Cloud Platforms – AWS, GCP, Azure
• Google BigQuery, Snowflake
Hope it helps :)
❤3