⭐⭐⭐ Advance Level Data science Projects ⭐⭐⭐
1) Identify your Digits Dataset : https://www.kaggle.com/c/digit-recognizer/data
2) Recommendation Engine : https://cseweb.ucsd.edu/~jmcauley/datasets.html
3) Visual QA : https://visualqa.org/download.html
4) Vox Celebrity : http://www.robots.ox.ac.uk/~vgg/data/voxceleb/
5) Breast cancer classification : https://www.kaggle.com/martinab/breast-cancer-classification-wisconsin-dataset
6) Traffic signals : http://benchmark.ini.rub.de/?section=gtsrb&subsection=dataset
7) Image caption generator : https://academictorrents.com/details/9dea07ba660a722ae1008c4c8afdd303b6f6e53b
1) Identify your Digits Dataset : https://www.kaggle.com/c/digit-recognizer/data
2) Recommendation Engine : https://cseweb.ucsd.edu/~jmcauley/datasets.html
3) Visual QA : https://visualqa.org/download.html
4) Vox Celebrity : http://www.robots.ox.ac.uk/~vgg/data/voxceleb/
5) Breast cancer classification : https://www.kaggle.com/martinab/breast-cancer-classification-wisconsin-dataset
6) Traffic signals : http://benchmark.ini.rub.de/?section=gtsrb&subsection=dataset
7) Image caption generator : https://academictorrents.com/details/9dea07ba660a722ae1008c4c8afdd303b6f6e53b
👍7
Practice projects to consider:
1. Implement a basic search engine: Read a set of documents and build an index of keywords. Then, implement a search function that returns a list of documents that match the query.
2. Build a recommendation system: Read a set of user-item interactions and build a recommendation system that suggests items to users based on their past behavior.
3. Create a data analysis tool: Read a large dataset and implement a tool that performs various analyses, such as calculating summary statistics, visualizing distributions, and identifying patterns and correlations.
4. Implement a graph algorithm: Study a graph algorithm such as Dijkstra's shortest path algorithm, and implement it in Python. Then, test it on real-world graphs to see how it performs.
1. Implement a basic search engine: Read a set of documents and build an index of keywords. Then, implement a search function that returns a list of documents that match the query.
2. Build a recommendation system: Read a set of user-item interactions and build a recommendation system that suggests items to users based on their past behavior.
3. Create a data analysis tool: Read a large dataset and implement a tool that performs various analyses, such as calculating summary statistics, visualizing distributions, and identifying patterns and correlations.
4. Implement a graph algorithm: Study a graph algorithm such as Dijkstra's shortest path algorithm, and implement it in Python. Then, test it on real-world graphs to see how it performs.
👍6
Data Analytics Projects for Beginners 👇
Web Scraping
https://github.com/shreyaswankhede/IMDb-Web-Scraping-and-Sentiment-Analysis
Product Price Scraping and Analysis
https://github.com/CodesdaLu/Web-Scrapping
News Scraping
https://github.com/rohit-yadav/scraping-news-articles
Real Time Stock Price Scraping with Python
https://youtu.be/rONhdonaWUo?si=A3oDEVbLIAP78cCz
Zomato Analysis
https://youtu.be/fFi_TBw27is?si=E0iLd3J06YHfQkRk
IPL Analysis
https://github.com/Yashmenaria1/IPL-Data-Exploration
https://www.youtube.com/watch?v=ur-v0dv0Qtw
https://www.youtube.com/watch?v=ur-v0dv0Qtw
Football Data Analysis
https://youtu.be/yat7soj__4w?si=h5CLIvVFzzKm8IEP
Market Basket Analysis
https://youtu.be/Ne8Sbp2hJIk?si=ThEuvdOnRrpcVjOg
Customer Churn Prediction
https://github.com/Pradnya1208/Telecom-Customer-Churn-prediction
Employee’s Performance for HR Analytics
https://www.kaggle.com/code/rajatraj0502/employee-s-performance-for-hr-analytics
Food Price Prediction
https://github.com/VectorInstitute/foodprice-forecasting
Web Scraping
https://github.com/shreyaswankhede/IMDb-Web-Scraping-and-Sentiment-Analysis
Product Price Scraping and Analysis
https://github.com/CodesdaLu/Web-Scrapping
News Scraping
https://github.com/rohit-yadav/scraping-news-articles
Real Time Stock Price Scraping with Python
https://youtu.be/rONhdonaWUo?si=A3oDEVbLIAP78cCz
Zomato Analysis
https://youtu.be/fFi_TBw27is?si=E0iLd3J06YHfQkRk
IPL Analysis
https://github.com/Yashmenaria1/IPL-Data-Exploration
https://www.youtube.com/watch?v=ur-v0dv0Qtw
https://www.youtube.com/watch?v=ur-v0dv0Qtw
Football Data Analysis
https://youtu.be/yat7soj__4w?si=h5CLIvVFzzKm8IEP
Market Basket Analysis
https://youtu.be/Ne8Sbp2hJIk?si=ThEuvdOnRrpcVjOg
Customer Churn Prediction
https://github.com/Pradnya1208/Telecom-Customer-Churn-prediction
Employee’s Performance for HR Analytics
https://www.kaggle.com/code/rajatraj0502/employee-s-performance-for-hr-analytics
Food Price Prediction
https://github.com/VectorInstitute/foodprice-forecasting
👍8❤1
If you want to get a job as a machine learning engineer, don’t start by diving into the hottest libraries like PyTorch,TensorFlow, Langchain, etc.
Yes, you might hear a lot about them or some other trending technology of the year...but guess what!
Technologies evolve rapidly, especially in the age of AI, but core concepts are always seen as more valuable than expertise in any particular tool. Stop trying to perform a brain surgery without knowing anything about human anatomy.
Instead, here are basic skills that will get you further than mastering any framework:
𝐌𝐚𝐭𝐡𝐞𝐦𝐚𝐭𝐢𝐜𝐬 𝐚𝐧𝐝 𝐒𝐭𝐚𝐭𝐢𝐬𝐭𝐢𝐜𝐬 - My first exposure to probability and statistics was in college, and it felt abstract at the time, but these concepts are the backbone of ML.
You can start here: Khan Academy Statistics and Probability - https://www.khanacademy.org/math/statistics-probability
𝐋𝐢𝐧𝐞𝐚𝐫 𝐀𝐥𝐠𝐞𝐛𝐫𝐚 𝐚𝐧𝐝 𝐂𝐚𝐥𝐜𝐮𝐥𝐮𝐬 - Concepts like matrices, vectors, eigenvalues, and derivatives are fundamental to understanding how ml algorithms work. These are used in everything from simple regression to deep learning.
𝐏𝐫𝐨𝐠𝐫𝐚𝐦𝐦𝐢𝐧𝐠 - Should you learn Python, Rust, R, Julia, JavaScript, etc.? The best advice is to pick the language that is most frequently used for the type of work you want to do. I started with Python due to its simplicity and extensive library support, and it remains my go-to language for machine learning tasks.
You can start here: Automate the Boring Stuff with Python - https://automatetheboringstuff.com/
𝐀𝐥𝐠𝐨𝐫𝐢𝐭𝐡𝐦 𝐔𝐧𝐝𝐞𝐫𝐬𝐭𝐚𝐧𝐝𝐢𝐧𝐠 - Understand the fundamental algorithms before jumping to deep learning. This includes linear regression, decision trees, SVMs, and clustering algorithms.
𝐃𝐞𝐩𝐥𝐨𝐲𝐦𝐞𝐧𝐭 𝐚𝐧𝐝 𝐏𝐫𝐨𝐝𝐮𝐜𝐭𝐢𝐨𝐧:
Knowing how to take a model from development to production is invaluable. This includes understanding APIs, model optimization, and monitoring. Tools like Docker and Flask are often used in this process.
𝐂𝐥𝐨𝐮𝐝 𝐂𝐨𝐦𝐩𝐮𝐭𝐢𝐧𝐠 𝐚𝐧𝐝 𝐁𝐢𝐠 𝐃𝐚𝐭𝐚:
Familiarity with cloud platforms (AWS, Google Cloud, Azure) and big data tools (Spark) is increasingly important as datasets grow larger. These skills help you manage and process large-scale data efficiently.
You can start here: Google Cloud Machine Learning - https://cloud.google.com/learn/training/machinelearning-ai
I love frameworks and libraries, and they can make anyone's job easier.
But the more solid your foundation, the easier it will be to pick up any new technologies and actually validate whether they solve your problems.
Best Data Science & Machine Learning Resources: https://topmate.io/coding/914624
All the best 👍👍
Yes, you might hear a lot about them or some other trending technology of the year...but guess what!
Technologies evolve rapidly, especially in the age of AI, but core concepts are always seen as more valuable than expertise in any particular tool. Stop trying to perform a brain surgery without knowing anything about human anatomy.
Instead, here are basic skills that will get you further than mastering any framework:
𝐌𝐚𝐭𝐡𝐞𝐦𝐚𝐭𝐢𝐜𝐬 𝐚𝐧𝐝 𝐒𝐭𝐚𝐭𝐢𝐬𝐭𝐢𝐜𝐬 - My first exposure to probability and statistics was in college, and it felt abstract at the time, but these concepts are the backbone of ML.
You can start here: Khan Academy Statistics and Probability - https://www.khanacademy.org/math/statistics-probability
𝐋𝐢𝐧𝐞𝐚𝐫 𝐀𝐥𝐠𝐞𝐛𝐫𝐚 𝐚𝐧𝐝 𝐂𝐚𝐥𝐜𝐮𝐥𝐮𝐬 - Concepts like matrices, vectors, eigenvalues, and derivatives are fundamental to understanding how ml algorithms work. These are used in everything from simple regression to deep learning.
𝐏𝐫𝐨𝐠𝐫𝐚𝐦𝐦𝐢𝐧𝐠 - Should you learn Python, Rust, R, Julia, JavaScript, etc.? The best advice is to pick the language that is most frequently used for the type of work you want to do. I started with Python due to its simplicity and extensive library support, and it remains my go-to language for machine learning tasks.
You can start here: Automate the Boring Stuff with Python - https://automatetheboringstuff.com/
𝐀𝐥𝐠𝐨𝐫𝐢𝐭𝐡𝐦 𝐔𝐧𝐝𝐞𝐫𝐬𝐭𝐚𝐧𝐝𝐢𝐧𝐠 - Understand the fundamental algorithms before jumping to deep learning. This includes linear regression, decision trees, SVMs, and clustering algorithms.
𝐃𝐞𝐩𝐥𝐨𝐲𝐦𝐞𝐧𝐭 𝐚𝐧𝐝 𝐏𝐫𝐨𝐝𝐮𝐜𝐭𝐢𝐨𝐧:
Knowing how to take a model from development to production is invaluable. This includes understanding APIs, model optimization, and monitoring. Tools like Docker and Flask are often used in this process.
𝐂𝐥𝐨𝐮𝐝 𝐂𝐨𝐦𝐩𝐮𝐭𝐢𝐧𝐠 𝐚𝐧𝐝 𝐁𝐢𝐠 𝐃𝐚𝐭𝐚:
Familiarity with cloud platforms (AWS, Google Cloud, Azure) and big data tools (Spark) is increasingly important as datasets grow larger. These skills help you manage and process large-scale data efficiently.
You can start here: Google Cloud Machine Learning - https://cloud.google.com/learn/training/machinelearning-ai
I love frameworks and libraries, and they can make anyone's job easier.
But the more solid your foundation, the easier it will be to pick up any new technologies and actually validate whether they solve your problems.
Best Data Science & Machine Learning Resources: https://topmate.io/coding/914624
All the best 👍👍
👍5❤1
Top 10 Programming Languages to learn in 2025 (With Free Resources to learn) :-
1. Python
- learnpython.org
- t.me/pythonfreebootcamp
2. Java
- learnjavaonline.org
- t.me/free4unow_backup/550
3. C#
- learncs.org
- w3schools.com
4. JavaScript
- learnjavanoscript.online
- t.me/javanoscript_courses
5. Rust
- rust-lang.org
- exercism.org
6. Go Programming
- go.dev
- learn-golang.org
7. Kotlin
- kotlinlang.org
- w3schools.com/KOTLIN
8. TypeScript
- Typenoscriptlang.org
- learntypenoscript.dev
9. SQL
- datasimplifier.com
- t.me/sqlanalyst
10. R Programming
- w3schools.com/r/
- r-coder.com
ENJOY LEARNING 👍👍
1. Python
- learnpython.org
- t.me/pythonfreebootcamp
2. Java
- learnjavaonline.org
- t.me/free4unow_backup/550
3. C#
- learncs.org
- w3schools.com
4. JavaScript
- learnjavanoscript.online
- t.me/javanoscript_courses
5. Rust
- rust-lang.org
- exercism.org
6. Go Programming
- go.dev
- learn-golang.org
7. Kotlin
- kotlinlang.org
- w3schools.com/KOTLIN
8. TypeScript
- Typenoscriptlang.org
- learntypenoscript.dev
9. SQL
- datasimplifier.com
- t.me/sqlanalyst
10. R Programming
- w3schools.com/r/
- r-coder.com
ENJOY LEARNING 👍👍
👍4❤1
Randomized experiments are the gold standard for measuring impact. Here’s how to measure impact with randomized trials. 👇
𝟏. 𝐃𝐞𝐬𝐢𝐠𝐧 𝐄𝐱𝐩𝐞𝐫𝐢𝐦𝐞𝐧𝐭
Planning the structure and methodology of the experiment, including defining the hypothesis, selecting metrics, and conducting a power analysis to determine sample size.
⤷ Ensures the experiment is well-structured and statistically sound, minimizing bias and maximizing reliability.
𝟐. 𝐈𝐦𝐩𝐥𝐞𝐦𝐞𝐧𝐭 𝐕𝐚𝐫𝐢𝐚𝐧𝐭𝐬
Creating different versions of the intervention by developing and deploying the control (A) and treatment (B) versions.
⤷ Allows for a clear comparison between the current state and the proposed change.
𝟑. 𝐂𝐨𝐧𝐝𝐮𝐜𝐭 𝐓𝐞𝐬𝐭
Choosing the right statistical test and calculating test statistics, such as confidence intervals, p-values, and effect sizes.
⤷ Ensures the results are statistically valid and interpretable.
𝟒. 𝐀𝐧𝐚𝐥𝐲𝐳𝐞 𝐑𝐞𝐬𝐮𝐥𝐭𝐬
Evaluating the data collected from the experiment, interpreting confidence intervals, p-values, and effect sizes to determine statistical significance and practical impact.
⤷ Helps determine whether the observed changes are meaningful and should be implemented.
𝟓. 𝐀𝐝𝐝𝐢𝐭𝐢𝐨𝐧𝐚𝐥 𝐅𝐚𝐜𝐭𝐨𝐫𝐬
⤷ Network Effects: User interactions affecting experiment outcomes.
⤷ P-Hacking: Manipulating data for significant results.
⤷ Novelty Effects: Temporary boost from new features.
Hope this helps you 😊
𝟏. 𝐃𝐞𝐬𝐢𝐠𝐧 𝐄𝐱𝐩𝐞𝐫𝐢𝐦𝐞𝐧𝐭
Planning the structure and methodology of the experiment, including defining the hypothesis, selecting metrics, and conducting a power analysis to determine sample size.
⤷ Ensures the experiment is well-structured and statistically sound, minimizing bias and maximizing reliability.
𝟐. 𝐈𝐦𝐩𝐥𝐞𝐦𝐞𝐧𝐭 𝐕𝐚𝐫𝐢𝐚𝐧𝐭𝐬
Creating different versions of the intervention by developing and deploying the control (A) and treatment (B) versions.
⤷ Allows for a clear comparison between the current state and the proposed change.
𝟑. 𝐂𝐨𝐧𝐝𝐮𝐜𝐭 𝐓𝐞𝐬𝐭
Choosing the right statistical test and calculating test statistics, such as confidence intervals, p-values, and effect sizes.
⤷ Ensures the results are statistically valid and interpretable.
𝟒. 𝐀𝐧𝐚𝐥𝐲𝐳𝐞 𝐑𝐞𝐬𝐮𝐥𝐭𝐬
Evaluating the data collected from the experiment, interpreting confidence intervals, p-values, and effect sizes to determine statistical significance and practical impact.
⤷ Helps determine whether the observed changes are meaningful and should be implemented.
𝟓. 𝐀𝐝𝐝𝐢𝐭𝐢𝐨𝐧𝐚𝐥 𝐅𝐚𝐜𝐭𝐨𝐫𝐬
⤷ Network Effects: User interactions affecting experiment outcomes.
⤷ P-Hacking: Manipulating data for significant results.
⤷ Novelty Effects: Temporary boost from new features.
Hope this helps you 😊
👍1
This is a quick and easy guide to the four main categories: Supervised, Unsupervised, Semi-Supervised, and Reinforcement Learning.
1. Supervised Learning
In supervised learning, the model learns from examples that already have the answers (labeled data). The goal is for the model to predict the correct result when given new data.
Some common supervised learning algorithms include:
➡️ Linear Regression – For predicting continuous values, like house prices.
➡️ Logistic Regression – For predicting categories, like spam or not spam.
➡️ Decision Trees – For making decisions in a step-by-step way.
➡️ K-Nearest Neighbors (KNN) – For finding similar data points.
➡️ Random Forests – A collection of decision trees for better accuracy.
➡️ Neural Networks – The foundation of deep learning, mimicking the human brain.
2. Unsupervised Learning
With unsupervised learning, the model explores patterns in data that doesn’t have any labels. It finds hidden structures or groupings.
Some popular unsupervised learning algorithms include:
➡️ K-Means Clustering – For grouping data into clusters.
➡️ Hierarchical Clustering – For building a tree of clusters.
➡️ Principal Component Analysis (PCA) – For reducing data to its most important parts.
➡️ Autoencoders – For finding simpler representations of data.
3. Semi-Supervised Learning
This is a mix of supervised and unsupervised learning. It uses a small amount of labeled data with a large amount of unlabeled data to improve learning.
Common semi-supervised learning algorithms include:
➡️ Label Propagation – For spreading labels through connected data points.
➡️ Semi-Supervised SVM – For combining labeled and unlabeled data.
➡️ Graph-Based Methods – For using graph structures to improve learning.
4. Reinforcement Learning
In reinforcement learning, the model learns by trial and error. It interacts with its environment, receives feedback (rewards or penalties), and learns how to act to maximize rewards.
Popular reinforcement learning algorithms include:
➡️ Q-Learning – For learning the best actions over time.
➡️ Deep Q-Networks (DQN) – Combining Q-learning with deep learning.
➡️ Policy Gradient Methods – For learning policies directly.
➡️ Proximal Policy Optimization (PPO) – For stable and effective learning.
1. Supervised Learning
In supervised learning, the model learns from examples that already have the answers (labeled data). The goal is for the model to predict the correct result when given new data.
Some common supervised learning algorithms include:
➡️ Linear Regression – For predicting continuous values, like house prices.
➡️ Logistic Regression – For predicting categories, like spam or not spam.
➡️ Decision Trees – For making decisions in a step-by-step way.
➡️ K-Nearest Neighbors (KNN) – For finding similar data points.
➡️ Random Forests – A collection of decision trees for better accuracy.
➡️ Neural Networks – The foundation of deep learning, mimicking the human brain.
2. Unsupervised Learning
With unsupervised learning, the model explores patterns in data that doesn’t have any labels. It finds hidden structures or groupings.
Some popular unsupervised learning algorithms include:
➡️ K-Means Clustering – For grouping data into clusters.
➡️ Hierarchical Clustering – For building a tree of clusters.
➡️ Principal Component Analysis (PCA) – For reducing data to its most important parts.
➡️ Autoencoders – For finding simpler representations of data.
3. Semi-Supervised Learning
This is a mix of supervised and unsupervised learning. It uses a small amount of labeled data with a large amount of unlabeled data to improve learning.
Common semi-supervised learning algorithms include:
➡️ Label Propagation – For spreading labels through connected data points.
➡️ Semi-Supervised SVM – For combining labeled and unlabeled data.
➡️ Graph-Based Methods – For using graph structures to improve learning.
4. Reinforcement Learning
In reinforcement learning, the model learns by trial and error. It interacts with its environment, receives feedback (rewards or penalties), and learns how to act to maximize rewards.
Popular reinforcement learning algorithms include:
➡️ Q-Learning – For learning the best actions over time.
➡️ Deep Q-Networks (DQN) – Combining Q-learning with deep learning.
➡️ Policy Gradient Methods – For learning policies directly.
➡️ Proximal Policy Optimization (PPO) – For stable and effective learning.
👍5❤4✍1
🔟 Project Ideas for a data analyst
Customer Segmentation: Analyze customer data to segment them based on their behaviors, preferences, or demographics, helping businesses tailor their marketing strategies.
Churn Prediction: Build a model to predict customer churn, identifying factors that contribute to churn and proposing strategies to retain customers.
Sales Forecasting: Use historical sales data to create a predictive model that forecasts future sales, aiding inventory management and resource planning.
Market Basket Analysis: Analyze
transaction data to identify associations between products often purchased together, assisting retailers in optimizing product placement and cross-selling.
Sentiment Analysis: Analyze social media or customer reviews to gauge public sentiment about a product or service, providing valuable insights for brand reputation management.
Healthcare Analytics: Examine medical records to identify trends, patterns, or correlations in patient data, aiding in disease prediction, treatment optimization, and resource allocation.
Financial Fraud Detection: Develop algorithms to detect anomalous transactions and patterns in financial data, helping prevent fraud and secure transactions.
A/B Testing Analysis: Evaluate the results of A/B tests to determine the effectiveness of different strategies or changes on websites, apps, or marketing campaigns.
Energy Consumption Analysis: Analyze energy usage data to identify patterns and inefficiencies, suggesting strategies for optimizing energy consumption in buildings or industries.
Real Estate Market Analysis: Study housing market data to identify trends in property prices, rental rates, and demand, assisting buyers, sellers, and investors in making informed decisions.
Remember to choose a project that aligns with your interests and the domain you're passionate about.
Data Analyst Roadmap
👇👇
https://news.1rj.ru/str/sqlspecialist/379
ENJOY LEARNING 👍👍
Customer Segmentation: Analyze customer data to segment them based on their behaviors, preferences, or demographics, helping businesses tailor their marketing strategies.
Churn Prediction: Build a model to predict customer churn, identifying factors that contribute to churn and proposing strategies to retain customers.
Sales Forecasting: Use historical sales data to create a predictive model that forecasts future sales, aiding inventory management and resource planning.
Market Basket Analysis: Analyze
transaction data to identify associations between products often purchased together, assisting retailers in optimizing product placement and cross-selling.
Sentiment Analysis: Analyze social media or customer reviews to gauge public sentiment about a product or service, providing valuable insights for brand reputation management.
Healthcare Analytics: Examine medical records to identify trends, patterns, or correlations in patient data, aiding in disease prediction, treatment optimization, and resource allocation.
Financial Fraud Detection: Develop algorithms to detect anomalous transactions and patterns in financial data, helping prevent fraud and secure transactions.
A/B Testing Analysis: Evaluate the results of A/B tests to determine the effectiveness of different strategies or changes on websites, apps, or marketing campaigns.
Energy Consumption Analysis: Analyze energy usage data to identify patterns and inefficiencies, suggesting strategies for optimizing energy consumption in buildings or industries.
Real Estate Market Analysis: Study housing market data to identify trends in property prices, rental rates, and demand, assisting buyers, sellers, and investors in making informed decisions.
Remember to choose a project that aligns with your interests and the domain you're passionate about.
Data Analyst Roadmap
👇👇
https://news.1rj.ru/str/sqlspecialist/379
ENJOY LEARNING 👍👍
👍7❤1👏1
Hi Guys,
Here are some of the telegram channels which may help you in data analytics journey 👇👇
SQL: https://news.1rj.ru/str/sqlanalyst
Power BI & Tableau: https://news.1rj.ru/str/PowerBI_analyst
Excel: https://news.1rj.ru/str/excel_analyst
Python: https://news.1rj.ru/str/dsabooks
Jobs: https://news.1rj.ru/str/jobs_SQL
Data Science: https://news.1rj.ru/str/datasciencefree
Artificial intelligence: https://news.1rj.ru/str/machinelearning_deeplearning
Data Engineering: https://news.1rj.ru/str/sql_engineer
Data Analysts: https://news.1rj.ru/str/sqlspecialist
Hope it helps :)
Here are some of the telegram channels which may help you in data analytics journey 👇👇
SQL: https://news.1rj.ru/str/sqlanalyst
Power BI & Tableau: https://news.1rj.ru/str/PowerBI_analyst
Excel: https://news.1rj.ru/str/excel_analyst
Python: https://news.1rj.ru/str/dsabooks
Jobs: https://news.1rj.ru/str/jobs_SQL
Data Science: https://news.1rj.ru/str/datasciencefree
Artificial intelligence: https://news.1rj.ru/str/machinelearning_deeplearning
Data Engineering: https://news.1rj.ru/str/sql_engineer
Data Analysts: https://news.1rj.ru/str/sqlspecialist
Hope it helps :)
❤2👍2❤🔥1
You don't need to buy a GPU for machine learning work!
There are other alternatives. Here are some:
1. Google Colab
2. Kaggle
3. Deepnote
4. AWS SageMaker
5. GCP Notebooks
6. Azure Notebooks
7. Cocalc
8. Binder
9. Saturncloud
10. Datablore
11. IBM Notebooks
12. Ola kutrim
Spend your time focusing on your problem.💪💪
There are other alternatives. Here are some:
1. Google Colab
2. Kaggle
3. Deepnote
4. AWS SageMaker
5. GCP Notebooks
6. Azure Notebooks
7. Cocalc
8. Binder
9. Saturncloud
10. Datablore
11. IBM Notebooks
12. Ola kutrim
Spend your time focusing on your problem.💪💪
👍5🙏4🔥2❤1
95% of Machine Learning solutions in the real world are for tabular data.
Not LLMs, not transformers, not agents, not fancy stuff.
Learning to do feature engineering and build tree-based models will open a ton of opportunities.
Not LLMs, not transformers, not agents, not fancy stuff.
Learning to do feature engineering and build tree-based models will open a ton of opportunities.
❤4👍2🔥1
“The Best Public Datasets for Machine Learning and Data Science” by Stacy Stanford
https://datasimplifier.com/best-data-analyst-projects-for-freshers/
https://toolbox.google.com/datasetsearch
https://www.kaggle.com/datasets
http://mlr.cs.umass.edu/ml/
https://www.visualdata.io/
https://guides.library.cmu.edu/machine-learning/datasets
https://www.data.gov/
https://nces.ed.gov/
https://www.ukdataservice.ac.uk/
https://datausa.io/
https://www.cs.toronto.edu/~delve/data/boston/bostonDetail.html
https://www.kaggle.com/xiuchengwang/python-dataset-download
https://www.quandl.com/
https://data.worldbank.org/
https://www.imf.org/en/Data
https://markets.ft.com/data/
https://trends.google.com/trends/?q=google&ctab=0&geo=all&date=all&sort=0
https://www.aeaweb.org/resources/data/us-macro-regional
http://xviewdataset.org/#dataset
http://labelme.csail.mit.edu/Release3.0/browserTools/php/dataset.php
http://image-net.org/
http://cocodataset.org/
http://visualgenome.org/
https://ai.googleblog.com/2016/09/introducing-open-images-dataset.html?m=1
http://vis-www.cs.umass.edu/lfw/
http://vision.stanford.edu/aditya86/ImageNetDogs/
http://web.mit.edu/torralba/www/indoor.html
http://www.cs.jhu.edu/~mdredze/datasets/sentiment/
http://ai.stanford.edu/~amaas/data/sentiment/
http://nlp.stanford.edu/sentiment/code.html
http://help.sentiment140.com/for-students/
https://www.kaggle.com/crowdflower/twitter-airline-sentiment
https://hotpotqa.github.io/
https://www.cs.cmu.edu/~./enron/
https://snap.stanford.edu/data/web-Amazon.html
https://aws.amazon.com/datasets/google-books-ngrams/
http://u.cs.biu.ac.il/~koppel/BlogCorpus.htm
https://code.google.com/archive/p/wiki-links/downloads
http://www.dt.fee.unicamp.br/~tiago/smsspamcollection/
https://www.yelp.com/dataset
https://news.1rj.ru/str/DataPortfolio/2
https://archive.ics.uci.edu/ml/datasets/Spambase
https://bdd-data.berkeley.edu/
http://apolloscape.auto/
https://archive.org/details/comma-dataset
https://www.cityscapes-dataset.com/
http://aplicaciones.cimat.mx/Personal/jbhayet/ccsad-dataset
http://www.vision.ee.ethz.ch/~timofter/traffic_signs/
http://cvrr.ucsd.edu/LISA/datasets.html
https://hci.iwr.uni-heidelberg.de/node/6132
http://www.lara.prd.fr/benchmarks/trafficlightsrecognition
http://computing.wpi.edu/dataset.html
https://mimic.physionet.org/
✅ Best Telegram channels to get free coding & data science resources
https://news.1rj.ru/str/addlist/4q2PYC0pH_VjZDk5
✅ Free Courses with Certificate:
https://news.1rj.ru/str/free4unow_backup
https://datasimplifier.com/best-data-analyst-projects-for-freshers/
https://toolbox.google.com/datasetsearch
https://www.kaggle.com/datasets
http://mlr.cs.umass.edu/ml/
https://www.visualdata.io/
https://guides.library.cmu.edu/machine-learning/datasets
https://www.data.gov/
https://nces.ed.gov/
https://www.ukdataservice.ac.uk/
https://datausa.io/
https://www.cs.toronto.edu/~delve/data/boston/bostonDetail.html
https://www.kaggle.com/xiuchengwang/python-dataset-download
https://www.quandl.com/
https://data.worldbank.org/
https://www.imf.org/en/Data
https://markets.ft.com/data/
https://trends.google.com/trends/?q=google&ctab=0&geo=all&date=all&sort=0
https://www.aeaweb.org/resources/data/us-macro-regional
http://xviewdataset.org/#dataset
http://labelme.csail.mit.edu/Release3.0/browserTools/php/dataset.php
http://image-net.org/
http://cocodataset.org/
http://visualgenome.org/
https://ai.googleblog.com/2016/09/introducing-open-images-dataset.html?m=1
http://vis-www.cs.umass.edu/lfw/
http://vision.stanford.edu/aditya86/ImageNetDogs/
http://web.mit.edu/torralba/www/indoor.html
http://www.cs.jhu.edu/~mdredze/datasets/sentiment/
http://ai.stanford.edu/~amaas/data/sentiment/
http://nlp.stanford.edu/sentiment/code.html
http://help.sentiment140.com/for-students/
https://www.kaggle.com/crowdflower/twitter-airline-sentiment
https://hotpotqa.github.io/
https://www.cs.cmu.edu/~./enron/
https://snap.stanford.edu/data/web-Amazon.html
https://aws.amazon.com/datasets/google-books-ngrams/
http://u.cs.biu.ac.il/~koppel/BlogCorpus.htm
https://code.google.com/archive/p/wiki-links/downloads
http://www.dt.fee.unicamp.br/~tiago/smsspamcollection/
https://www.yelp.com/dataset
https://news.1rj.ru/str/DataPortfolio/2
https://archive.ics.uci.edu/ml/datasets/Spambase
https://bdd-data.berkeley.edu/
http://apolloscape.auto/
https://archive.org/details/comma-dataset
https://www.cityscapes-dataset.com/
http://aplicaciones.cimat.mx/Personal/jbhayet/ccsad-dataset
http://www.vision.ee.ethz.ch/~timofter/traffic_signs/
http://cvrr.ucsd.edu/LISA/datasets.html
https://hci.iwr.uni-heidelberg.de/node/6132
http://www.lara.prd.fr/benchmarks/trafficlightsrecognition
http://computing.wpi.edu/dataset.html
https://mimic.physionet.org/
✅ Best Telegram channels to get free coding & data science resources
https://news.1rj.ru/str/addlist/4q2PYC0pH_VjZDk5
✅ Free Courses with Certificate:
https://news.1rj.ru/str/free4unow_backup
👍3