Data Science Portfolio - Kaggle Datasets & AI Projects | Artificial Intelligence – Telegram
Data Science Portfolio - Kaggle Datasets & AI Projects | Artificial Intelligence
37.1K subscribers
282 photos
76 files
336 links
Free Datasets For Data Science Projects & Portfolio

Buy ads: https://telega.io/c/DataPortfolio

For Promotions/ads: @coderfun @love_data
Download Telegram
cryptos historical data.zip
26.5 MB
Dataset Name: top 1000 cryptos historical data ( Daily updates )
Instagram fake spammer genuine accounts.zip
6.8 KB
Dataset Name: Instagram fake spammer genuine accounts
    
👍73
Don't forget to check these 10 SQL projects with corresponding datasets that you could use to practice your SQL skills:

1. Analysis of Sales Data:

(https://www.kaggle.com/kyanyoga/sample-sales-data)

2. HR Analytics:

(https://www.kaggle.com/pavansubhasht/ibm-hr-analytics-attrition-dataset)

3. Social Media Analytics:

(https://www.kaggle.com/datasets/ramjasmaurya/top-1000-social-media-channels)

4. Financial Data Analysis:

(https://www.kaggle.com/datasets/nitindatta/finance-data)

5. Healthcare Data Analysis:

(https://www.kaggle.com/cdc/mortality)

6. Customer Relationship Management:

(https://www.kaggle.com/pankajjsh06/ibm-watson-marketing-customer-value-data)

7. Web Analytics:

(https://www.kaggle.com/zynicide/wine-reviews)

8. E-commerce Analysis:

(https://www.kaggle.com/olistbr/brazilian-ecommerce)

9. Supply Chain Management:

(https://www.kaggle.com/datasets/harshsingh2209/supply-chain-analysis)

10. Inventory Management:

(https://www.kaggle.com/datasets?search=inventory+management)

Share this channel with your friends 🤝🤩

Join for more -> https://news.1rj.ru/str/addlist/ID95piZJZa0wYzk5

ENJOY LEARNING 👍👍
👍83
The key to starting your data analysis career:

It's not your education
It's not your experience

It's how you apply these principles:

1. Learn the job through "doing"
2. Build a portfolio
3. Make yourself known

No one starts an expert, but everyone can become one.

If you're looking for a career in data analysis, start by:

⟶ Watching videos
⟶ Reading experts advice
⟶ Doing internships
⟶ Building a portfolio
⟶ Learning from seniors

You'll be amazed at how fast you'll learn and how quickly you'll become an expert.

So, start today and let the data analysis career begin
👍84
Here is the list of few projects (found on kaggle). They cover Basics of Python, Advanced Statistics, Supervised Learning (Regression and Classification problems) & Data Science

Please also check the discussions and notebook submissions for different approaches and solution after you tried yourself.

1. Basic python and statistics

Pima Indians :- https://www.kaggle.com/uciml/pima-indians-diabetes-database
Cardio Goodness fit :- https://www.kaggle.com/saurav9786/cardiogoodfitness
Automobile :- https://www.kaggle.com/toramky/automobile-dataset

2. Advanced Statistics

Game of Thrones:-https://www.kaggle.com/mylesoneill/game-of-thrones
World University Ranking:-https://www.kaggle.com/mylesoneill/world-university-rankings
IMDB Movie Dataset:- https://www.kaggle.com/carolzhangdc/imdb-5000-movie-dataset

3. Supervised Learning

a) Regression Problems

How much did it rain :- https://www.kaggle.com/c/how-much-did-it-rain-ii/overview
Inventory Demand:- https://www.kaggle.com/c/grupo-bimbo-inventory-demand
Property Inspection predictiion:- https://www.kaggle.com/c/liberty-mutual-group-property-inspection-prediction
Restaurant Revenue prediction:- https://www.kaggle.com/c/restaurant-revenue-prediction/data
IMDB Box office Prediction:-https://www.kaggle.com/c/tmdb-box-office-prediction/overview

b) Classification problems

Employee Access challenge :- https://www.kaggle.com/c/amazon-employee-access-challenge/overview
Titanic :- https://www.kaggle.com/c/titanic
San Francisco crime:- https://www.kaggle.com/c/sf-crime
Customer satisfcation:-https://www.kaggle.com/c/santander-customer-satisfaction
Trip type classification:- https://www.kaggle.com/c/walmart-recruiting-trip-type-classification
Categorize cusine:- https://www.kaggle.com/c/whats-cooking

4. Some helpful Data science projects for beginners

https://www.kaggle.com/c/house-prices-advanced-regression-techniques

https://www.kaggle.com/c/digit-recognizer

https://www.kaggle.com/c/titanic

5. Intermediate Level Data science Projects

Black Friday Data : https://www.kaggle.com/sdolezel/black-friday

Human Activity Recognition Data : https://www.kaggle.com/uciml/human-activity-recognition-with-smartphones

Trip History Data : https://www.kaggle.com/pronto/cycle-share-dataset

Million Song Data : https://www.kaggle.com/c/msdchallenge

Census Income Data : https://www.kaggle.com/c/census-income/data

Movie Lens Data : https://www.kaggle.com/grouplens/movielens-20m-dataset

Twitter Classification Data : https://www.kaggle.com/c/twitter-sentiment-analysis2

Share with credits: https://news.1rj.ru/str/sqlproject

ENJOY LEARNING 👍👍
👍114
𝐒𝐐𝐋 𝐂𝐚𝐬𝐞 𝐒𝐭𝐮𝐝𝐢𝐞𝐬 𝐟𝐨𝐫 𝐈𝐧𝐭𝐞𝐫𝐯𝐢𝐞𝐰:

Join for more: https://news.1rj.ru/str/sqlanalyst

1. Danny’s Diner:
Restaurant analytics to understand the customer orders pattern.
Link: https://8weeksqlchallenge.com/case-study-1/

2. Pizza Runner
Pizza shop analytics to optimize the efficiency of the operation
Link: https://8weeksqlchallenge.com/case-study-2/

3. Foodie Fie
Subnoscription-based food content platform
Link: https://lnkd.in/gzB39qAT

4. Data Bank: That’s money
Analytics based on customer activities with the digital bank
Link: https://lnkd.in/gH8pKPyv

5. Data Mart: Fresh is Best
Analytics on Online supermarket
Link: https://lnkd.in/gC5bkcDf

6. Clique Bait: Attention capturing
Analytics on the seafood industry
Link: https://lnkd.in/ggP4JiYG

7. Balanced Tree: Clothing Company
Analytics on the sales performance of clothing store
Link: https://8weeksqlchallenge.com/case-study-7

8. Fresh segments: Extract maximum value
Analytics on online advertising
Link: https://8weeksqlchallenge.com/case-study-8
👍54
Creating a data science portfolio is a great way to showcase your skills and experience to potential employers. Here are some steps to help you create a strong data science portfolio:

1. Choose relevant projects: Select a few data science projects that demonstrate your skills and interests. These projects can be from your previous work experience, personal projects, or online competitions.

2. Clean and organize your code: Make sure your code is well-documented, organized, and easy to understand. Use comments to explain your thought process and the steps you took in your analysis.

3. Include a variety of projects: Try to include a mix of projects that showcase different aspects of data science, such as data cleaning, exploratory data analysis, machine learning, and data visualization.

4. Create visualizations: Data visualizations can help make your portfolio more engaging and easier to understand. Use tools like Matplotlib, Seaborn, or Tableau to create visually appealing charts and graphs.

5. Write project summaries: For each project, provide a brief summary of the problem you were trying to solve, the dataset you used, the methods you applied, and the results you obtained. Include any insights or recommendations that came out of your analysis.

6. Showcase your technical skills: Highlight the programming languages, libraries, and tools you used in each project. Mention any specific techniques or algorithms you implemented.

7. Link to your code and data: Provide links to your code repositories (e.g., GitHub) and any datasets you used in your projects. This allows potential employers to review your work in more detail.

8. Keep it updated: Regularly update your portfolio with new projects and skills as you gain more experience in data science. This will show that you are actively engaged in the field and continuously improving your skills.

So, start today and let the data analysis career begin
👍74
Sites to Find Datasets

Below are sites I've found free and public datasets.

Datahub - This site covers a wide range of topics from climate change to entertainment, but it mainly focuses on economic and business data.
Dataset Search - You're able to use Google to search for datasets. It's great if you have a particular topic in mind.
Kaggle - It has variety of free datasets provided by users from everything to arts & entertainment to social science data.
Data Gov - Public data from the US government from everything from crime to healthcare.
Maven Analytics Data Playground - Datasets that are hand picked by Maven's instructors. These datasets can be more fun like analyzing the Harry Potter movies noscripts to more business focused like analyzing sales of a pizza place.
Awesome Public Datasets - A list of topic focused public data sources that are high quality. These are collected from blogs, answers, and user responses.
Datacamp Datasets - These datasets are from a variety of fields from real estate to retail. All of the datasets have the data and packages needed.
NASA Data - Has open-data provided to the public from NASA. The dataset pages only hold the metadata and the actual data may be on another NASA site. There will be links to the data in these other locations.
Dataportfolio - Telegram Channel with Free Datasets
Google BigQuery - It's free to sign up and you can practice with plenty of free datasets.
👍154
Creating a data science portfolio is a great way to showcase your skills and experience to potential employers. Here are some steps to help you create a strong data science portfolio:

1. Choose relevant projects: Select a few data science projects that demonstrate your skills and interests. These projects can be from your previous work experience, personal projects, or online competitions.

2. Clean and organize your code: Make sure your code is well-documented, organized, and easy to understand. Use comments to explain your thought process and the steps you took in your analysis.

3. Include a variety of projects: Try to include a mix of projects that showcase different aspects of data science, such as data cleaning, exploratory data analysis, machine learning, and data visualization.

4. Create visualizations: Data visualizations can help make your portfolio more engaging and easier to understand. Use tools like Matplotlib, Seaborn, or Tableau to create visually appealing charts and graphs.

5. Write project summaries: For each project, provide a brief summary of the problem you were trying to solve, the dataset you used, the methods you applied, and the results you obtained. Include any insights or recommendations that came out of your analysis.

6. Showcase your technical skills: Highlight the programming languages, libraries, and tools you used in each project. Mention any specific techniques or algorithms you implemented.

7. Link to your code and data: Provide links to your code repositories (e.g., GitHub) and any datasets you used in your projects. This allows potential employers to review your work in more detail.

8. Keep it updated: Regularly update your portfolio with new projects and skills as you gain more experience in data science. This will show that you are actively engaged in the field and continuously improving your skills.

By following these steps, you can create a comprehensive and visually appealing data science portfolio that will impress potential employers and help you stand out in the competitive job market.
5
Step-by-Step Data Analysis Projects with Python Code


Below are popular data analysis projects from users. They will:

- Help you gain skills in working with real data
- Introduce you to Python libraries for data analysis
- Inspire you for your own data analysis projects

Netflix Data Analysis

Video Game Sales Analysis

Is There a Trend of Increasing Geek Girls?

Let's Discover More About the Olympic Games!

Marketing Analysis

Animal Shelter Data Analysis

Amazon Data Analysis

Billionaire Data Analysis

Credit Card Data Analysis

Pokemon Data Analysis

Spotify Data Analysis. What Does It Take to Hit the Charts
👍103🔥2
Learning Python for data science can be a rewarding experience. Here are some steps you can follow to get started:

1. Learn the Basics of Python: Start by learning the basics of Python programming language such as syntax, data types, functions, loops, and conditional statements. There are many online resources available for free to learn Python.

2. Understand Data Structures and Libraries: Familiarize yourself with data structures like lists, dictionaries, tuples, and sets. Also, learn about popular Python libraries used in data science such as NumPy, Pandas, Matplotlib, and Scikit-learn.

3. Practice with Projects: Start working on small data science projects to apply your knowledge. You can find datasets online to practice your skills and build your portfolio.

4. Take Online Courses: Enroll in online courses specifically tailored for learning Python for data science. Websites like Coursera, Udemy, and DataCamp offer courses on Python programming for data science.

5. Join Data Science Communities: Join online communities and forums like Stack Overflow, Reddit, or Kaggle to connect with other data science enthusiasts and get help with any questions you may have.

6. Read Books: There are many great books available on Python for data science that can help you deepen your understanding of the subject. Some popular books include "Python for Data Analysis" by Wes McKinney and "Data Science from Scratch" by Joel Grus.

7. Practice Regularly: Practice is key to mastering any skill. Make sure to practice regularly and work on real-world data science problems to improve your skills.

Remember that learning Python for data science is a continuous process, so be patient and persistent in your efforts. Good luck!
6👍2
Please go through this top 10 SQL projects with Datasets that you can practice and can add in your resume

📌1. Social Media Analytics:
(https://www.kaggle.com/amanajmera1/framingham-heart-study-dataset)

🚀2. Web Analytics:
(https://www.kaggle.com/zynicide/wine-reviews)

📌3. HR Analytics:
(https://www.kaggle.com/pavansubhasht/ibm-hr-analytics-
attrition-dataset)

🚀4. Healthcare Data Analysis:
(https://www.kaggle.com/cdc/mortality)

📌5. E-commerce Analysis:
(https://www.kaggle.com/olistbr/brazilian-ecommerce)

🚀6. Inventory Management:
(https://www.kaggle.com/datasets?
search=inventory+management)

📌 7.Customer Relationship Management:
(https://www.kaggle.com/pankajjsh06/ibm-watson-
marketing-customer-value-data)

🚀8. Financial Data Analysis:
(https://www.kaggle.com/awaiskalia/banking-database)

📌9. Supply Chain Management:
(https://www.kaggle.com/shashwatwork/procurement-analytics)

🚀10. Analysis of Sales Data:
(https://www.kaggle.com/kyanyoga/sample-sales-data)

Small suggestion from my side for non tech students: kindly pick those datasets which you like the subject in general, that way you will be more excited to practice it, instead of just doing it for the sake of resume, you will learn SQL more passionately, since it’s a programming language try to make it more exciting for yourself.

Join for more: https://news.1rj.ru/str/DataPortfolio

Hope this piece of information helps you
👍9
Kaggle Datasets are often too perfect for real-world scenarios.

I'm about to share a method for real-life data analysis.

You see …

… most of the time, a data analyst cleans and transforms data.

So … let’s practice that.

How?

Well … you can use ChatGPT.

Just write this prompt:

Create a downloadable CSV dataset of 10,000 rows of financial credit card transactions with 10 columns of customer data so I can perform some data analysis to segment customers.

Now…

Download the dataset and start your analysis.

You'll see that, most of the time…

… numbers don’t match.

There are no patterns.

Data is incorrect and doesn’t make sense.

And that’s good.

Now you know what a data analyst deals with.

Your job is to make sense of that dataset.

To create a story that justifies the numbers.

This is how you can mimic real-life work using A.I.
👍137
Creating a data science portfolio is a great way to showcase your skills and experience to potential employers. Here are some steps to help you create a strong data science portfolio:

1. Choose relevant projects: Select a few data science projects that demonstrate your skills and interests. These projects can be from your previous work experience, personal projects, or online competitions.

2. Clean and organize your code: Make sure your code is well-documented, organized, and easy to understand. Use comments to explain your thought process and the steps you took in your analysis.

3. Include a variety of projects: Try to include a mix of projects that showcase different aspects of data science, such as data cleaning, exploratory data analysis, machine learning, and data visualization.

4. Create visualizations: Data visualizations can help make your portfolio more engaging and easier to understand. Use tools like Matplotlib, Seaborn, or Tableau to create visually appealing charts and graphs.

5. Write project summaries: For each project, provide a brief summary of the problem you were trying to solve, the dataset you used, the methods you applied, and the results you obtained. Include any insights or recommendations that came out of your analysis.

6. Showcase your technical skills: Highlight the programming languages, libraries, and tools you used in each project. Mention any specific techniques or algorithms you implemented.

7. Link to your code and data: Provide links to your code repositories (e.g., GitHub) and any datasets you used in your projects. This allows potential employers to review your work in more detail.

8. Keep it updated: Regularly update your portfolio with new projects and skills as you gain more experience in data science. This will show that you are actively engaged in the field and continuously improving your skills.

By following these steps, you can create a comprehensive and visually appealing data science portfolio that will impress potential employers and help you stand out in the competitive job market.
👍92
Are you a data science beginner?

Here are 5 beginner-friendly data science project ideas

Loan Approval Prediction

Predict whether a loan will be approved based on customer demographic and financial data. This requires data preprocessing, feature engineering, and binary classification techniques.

Credit Card Fraud Detection

Detect fraudulent credit card transactions with a dataset that contains transactions made by credit cards. This is a good project for learning about imbalanced datasets and anomaly detection methods.

Netflix Movies and TV Shows Analysis

Analyze Netflix's movies and TV shows to discover trends in ratings, popularity, and genre distributions. Visualization tools and exploratory data analysis are key components here.

Sentiment Analysis of Tweets

Analyze the sentiment of tweets to determine whether they are positive, negative, or neutral. This project involves natural language processing and working with text data.

Weather Data Analysis

Analyze historical weather data from the National Oceanic and Atmospheric Administration (NOAA) to look for seasonal trends, weather anomalies, or climate change indicators. This project involves time series analysis and data visualization.

Join for more: https://news.1rj.ru/str/sqlproject

ENJOY LEARNING 👍👍
👍10
Today, I’m sharing three essential SQL projects to boost your resume

Energy Consumption Analysis:
Managed data from smart meters to analyze patterns and optimize efficiency. 🌱

Logistics and Supply Chain Management:
Designed a database to optimize delivery routes and forecast inventory. 🚚

Healthcare Management System:
Built a database for patient records, optimizing scheduling and performance analysis. 🏥

📊 According to the World Economic Forum, data analysis and database management are top skills for 2024.
👍162
FREE DATASET BUILDING YOUR PORTFOLIO

1. Supermarket Sales - https://lnkd.in/e86UpCMv
2.Credit Card Fraud Detection - https://lnkd.in/eFTsZDCW
3. FIFA 22 complete player dataset - https://lnkd.in/eDScdUUM
4. Walmart Store Sales Forecasting - https://lnkd.in/eVT6h-CT
5. Netflix Movies and TV Shows - https://lnkd.in/eZ3cduwK
6.LinkedIn Data Analyst jobs listings - https://lnkd.in/ezqxcmrE
7. Top 50 Fast-Food Chains in USA - https://lnkd.in/esBjf5u4
8. Amazon and Best Buy Electronics - https://lnkd.in/e4fBZvJ3
9. Forecasting Book Sales - https://lnkd.in/eXHN2XsQ
10. Real / Fake Job Posting Prediction - https://lnkd.in/e5SDDW9G
👍105
The first function you should learn in each data tool:

SQL: DELETE

Tableau: Pie chart with 10+ categories

Power BI: importing from Microsoft Paint (where the real visualization is done)

Excel: inserting pictures

Python: how to defend yourself against snakes

It’s important to focus on the functions you’ll use everyday.
👍18🔥3