Which operator is used to match a pattern in SQL?
Anonymous Quiz
12%
A. IN
71%
B. LIKE
12%
C. BETWEEN
5%
D. IS
❤7
✅ Data Analyst Mock Interview Questions with Answers 📊🎯
1️⃣ Q: Explain the difference between a primary key and a foreign key.
A:
• Primary Key: Uniquely identifies each record in a table; cannot be null.
• Foreign Key: A field in one table that refers to the primary key of another table; establishes a relationship between the tables.
2️⃣ Q: What is the difference between WHERE and HAVING clauses in SQL?
A:
• WHERE: Filters rows before grouping.
• HAVING: Filters groups after aggregation (used with GROUP BY).
3️⃣ Q: How do you handle missing values in a dataset?
A: Common techniques include:
• Imputation: Replacing missing values with mean, median, mode, or a constant.
• Removal: Removing rows or columns with too many missing values.
• Using algorithms that handle missing data: Some machine learning algorithms can handle missing values natively.
4️⃣ Q: What is the difference between a line chart and a bar chart, and when would you use each?
A:
• Line Chart: Shows trends over time or continuous values.
• Bar Chart: Compares discrete categories or values.
• Use a line chart to show sales trends over months; use a bar chart to compare sales across different product categories.
5️⃣ Q: Explain what a p-value is and its significance.
A: The p-value is the probability of obtaining results as extreme as, or more extreme than, the observed results, assuming the null hypothesis is true. A small p-value (typically ≤ 0.05) indicates strong evidence against the null hypothesis.
6️⃣ Q: How would you deal with outliers in a dataset?
A:
• Identify Outliers: Using box plots, scatter plots, or statistical methods (e.g., Z-score).
• Treatment:
• Remove Outliers: If they are due to errors or anomalies.
• Transform Data: Using techniques like log transformation.
• Keep Outliers: If they represent genuine data points and provide valuable insights.
7️⃣ Q: What are the different types of joins in SQL?
A:
• INNER JOIN: Returns rows only when there is a match in both tables.
• LEFT JOIN (or LEFT OUTER JOIN): Returns all rows from the left table, and the matching rows from the right table. If there is no match, the right side will contain NULL values.
• RIGHT JOIN (or RIGHT OUTER JOIN): Returns all rows from the right table, and the matching rows from the left table. If there is no match, the left side will contain NULL values.
• FULL OUTER JOIN: Returns all rows from both tables, filling in NULLs when there is no match.
8️⃣ Q: How would you approach a data analysis project from start to finish?
A:
• Define the Problem: Understand the business question you're trying to answer.
• Collect Data: Gather relevant data from various sources.
• Clean and Preprocess Data: Handle missing values, outliers, and inconsistencies.
• Explore and Analyze Data: Use statistical methods and visualizations to identify patterns.
• Draw Conclusions and Make Recommendations: Summarize your findings and provide actionable insights.
• Communicate Results: Present your analysis to stakeholders.
👍 Tap ❤️ for more!
1️⃣ Q: Explain the difference between a primary key and a foreign key.
A:
• Primary Key: Uniquely identifies each record in a table; cannot be null.
• Foreign Key: A field in one table that refers to the primary key of another table; establishes a relationship between the tables.
2️⃣ Q: What is the difference between WHERE and HAVING clauses in SQL?
A:
• WHERE: Filters rows before grouping.
• HAVING: Filters groups after aggregation (used with GROUP BY).
3️⃣ Q: How do you handle missing values in a dataset?
A: Common techniques include:
• Imputation: Replacing missing values with mean, median, mode, or a constant.
• Removal: Removing rows or columns with too many missing values.
• Using algorithms that handle missing data: Some machine learning algorithms can handle missing values natively.
4️⃣ Q: What is the difference between a line chart and a bar chart, and when would you use each?
A:
• Line Chart: Shows trends over time or continuous values.
• Bar Chart: Compares discrete categories or values.
• Use a line chart to show sales trends over months; use a bar chart to compare sales across different product categories.
5️⃣ Q: Explain what a p-value is and its significance.
A: The p-value is the probability of obtaining results as extreme as, or more extreme than, the observed results, assuming the null hypothesis is true. A small p-value (typically ≤ 0.05) indicates strong evidence against the null hypothesis.
6️⃣ Q: How would you deal with outliers in a dataset?
A:
• Identify Outliers: Using box plots, scatter plots, or statistical methods (e.g., Z-score).
• Treatment:
• Remove Outliers: If they are due to errors or anomalies.
• Transform Data: Using techniques like log transformation.
• Keep Outliers: If they represent genuine data points and provide valuable insights.
7️⃣ Q: What are the different types of joins in SQL?
A:
• INNER JOIN: Returns rows only when there is a match in both tables.
• LEFT JOIN (or LEFT OUTER JOIN): Returns all rows from the left table, and the matching rows from the right table. If there is no match, the right side will contain NULL values.
• RIGHT JOIN (or RIGHT OUTER JOIN): Returns all rows from the right table, and the matching rows from the left table. If there is no match, the left side will contain NULL values.
• FULL OUTER JOIN: Returns all rows from both tables, filling in NULLs when there is no match.
8️⃣ Q: How would you approach a data analysis project from start to finish?
A:
• Define the Problem: Understand the business question you're trying to answer.
• Collect Data: Gather relevant data from various sources.
• Clean and Preprocess Data: Handle missing values, outliers, and inconsistencies.
• Explore and Analyze Data: Use statistical methods and visualizations to identify patterns.
• Draw Conclusions and Make Recommendations: Summarize your findings and provide actionable insights.
• Communicate Results: Present your analysis to stakeholders.
👍 Tap ❤️ for more!
❤21
✅ Step-by-Step Approach to Learn Data Analytics 📈🧠
➊ Excel Fundamentals:
✔ Master formulas, pivot tables, data validation, charts, and graphs.
➋ SQL Basics:
✔ Learn to query databases, use SELECT, FROM, WHERE, JOIN, GROUP BY, and aggregate functions.
➌ Data Visualization:
✔ Get proficient with tools like Tableau or Power BI to create insightful dashboards.
➍ Statistical Concepts:
✔ Understand denoscriptive statistics (mean, median, mode), distributions, and hypothesis testing.
➎ Data Cleaning & Preprocessing:
✔ Learn how to handle missing data, outliers, and data inconsistencies.
➏ Exploratory Data Analysis (EDA):
✔ Explore datasets, identify patterns, and formulate hypotheses.
➐ Python for Data Analysis (Optional but Recommended):
✔ Learn Pandas and NumPy for data manipulation and analysis.
➑ Real-World Projects:
✔ Analyze datasets from Kaggle, UCI Machine Learning Repository, or your own collection.
➒ Business Acumen:
✔ Understand key business metrics and how data insights impact business decisions.
➓ Build a Portfolio:
✔ Showcase your projects on GitHub, Tableau Public, or a personal website. Highlight the impact of your analysis.
👍 Tap ❤️ for more!
➊ Excel Fundamentals:
✔ Master formulas, pivot tables, data validation, charts, and graphs.
➋ SQL Basics:
✔ Learn to query databases, use SELECT, FROM, WHERE, JOIN, GROUP BY, and aggregate functions.
➌ Data Visualization:
✔ Get proficient with tools like Tableau or Power BI to create insightful dashboards.
➍ Statistical Concepts:
✔ Understand denoscriptive statistics (mean, median, mode), distributions, and hypothesis testing.
➎ Data Cleaning & Preprocessing:
✔ Learn how to handle missing data, outliers, and data inconsistencies.
➏ Exploratory Data Analysis (EDA):
✔ Explore datasets, identify patterns, and formulate hypotheses.
➐ Python for Data Analysis (Optional but Recommended):
✔ Learn Pandas and NumPy for data manipulation and analysis.
➑ Real-World Projects:
✔ Analyze datasets from Kaggle, UCI Machine Learning Repository, or your own collection.
➒ Business Acumen:
✔ Understand key business metrics and how data insights impact business decisions.
➓ Build a Portfolio:
✔ Showcase your projects on GitHub, Tableau Public, or a personal website. Highlight the impact of your analysis.
👍 Tap ❤️ for more!
❤20👍1🔥1
✅ How to Get a Data Analyst Job as a Fresher in 2025 📊💼
🔹 What’s the Market Like in 2025?
• High demand in BFSI, healthcare, retail & tech
• Companies expect Excel, SQL, BI tools & storytelling skills
• Python & data visualization give a strong edge
• Remote jobs are fewer, but freelance & internship opportunities are growing
🔹 Skills You MUST Have:
1️⃣ Excel – Pivot tables, formulas, dashboards
2️⃣ SQL – Joins, subqueries, CTEs, window functions
3️⃣ Power BI / Tableau – For interactive dashboards
4️⃣ Python – Data cleaning & analysis (Pandas, Matplotlib)
5️⃣ Statistics – Mean, median, correlation, hypothesis testing
6️⃣ Business Understanding – KPIs, revenue, churn etc.
🔹 Build a Strong Profile:
✔️ Do real-world projects (sales, HR, e-commerce data)
✔️ Publish dashboards on Tableau Public / Power BI
✔️ Share work on GitHub & LinkedIn
✔️ Earn certifications (Google Data Analytics, Power BI, SQL)
✔️ Practice mock interviews & case studies
🔹 Practice Platforms:
• Kaggle
• StrataScratch
• DataLemur
🔹 Fresher-Friendly Job Titles:
• Junior Data Analyst
• Business Analyst
• MIS Executive
• Reporting Analyst
🔹 Companies Hiring Freshers in 2025:
• TCS
• Infosys
• Wipro
• Cognizant
• Fractal Analytics
• EY, KPMG
• Startups & EdTech companies
📝 Tip: If a job says "1–2 yrs experience", apply anyway if your skills & projects match!
👍 Tap ❤️ if you found this helpful!
🔹 What’s the Market Like in 2025?
• High demand in BFSI, healthcare, retail & tech
• Companies expect Excel, SQL, BI tools & storytelling skills
• Python & data visualization give a strong edge
• Remote jobs are fewer, but freelance & internship opportunities are growing
🔹 Skills You MUST Have:
1️⃣ Excel – Pivot tables, formulas, dashboards
2️⃣ SQL – Joins, subqueries, CTEs, window functions
3️⃣ Power BI / Tableau – For interactive dashboards
4️⃣ Python – Data cleaning & analysis (Pandas, Matplotlib)
5️⃣ Statistics – Mean, median, correlation, hypothesis testing
6️⃣ Business Understanding – KPIs, revenue, churn etc.
🔹 Build a Strong Profile:
✔️ Do real-world projects (sales, HR, e-commerce data)
✔️ Publish dashboards on Tableau Public / Power BI
✔️ Share work on GitHub & LinkedIn
✔️ Earn certifications (Google Data Analytics, Power BI, SQL)
✔️ Practice mock interviews & case studies
🔹 Practice Platforms:
• Kaggle
• StrataScratch
• DataLemur
🔹 Fresher-Friendly Job Titles:
• Junior Data Analyst
• Business Analyst
• MIS Executive
• Reporting Analyst
🔹 Companies Hiring Freshers in 2025:
• TCS
• Infosys
• Wipro
• Cognizant
• Fractal Analytics
• EY, KPMG
• Startups & EdTech companies
📝 Tip: If a job says "1–2 yrs experience", apply anyway if your skills & projects match!
👍 Tap ❤️ if you found this helpful!
❤43👍2🥰1
✅ SQL Constraints 📊🛡️
Constraints are the rules that keep your database clean & accurate.
🔹 1. PRIMARY KEY
➤ Uniquely identifies each row in a table
➤ Cannot be NULL or duplicated
➤ Links to a primary key in another table
➤ Ensures data consistency across tables
➤ Ensures all values in a column are different
➤ Column cannot have NULL (empty) values
➤ Limits the values that can be entered
➤ Automatically sets a default value
✔️ No duplicates
✔️ No missing data
✔️ Valid and consistent values
✔️ Reliable database performance
SQL Roadmap: https://whatsapp.com/channel/0029VanC5rODzgT6TiTGoa1v/1394
👍 Tap ❤️ for more!
Constraints are the rules that keep your database clean & accurate.
🔹 1. PRIMARY KEY
➤ Uniquely identifies each row in a table
➤ Cannot be NULL or duplicated
CREATE TABLE users (🔹 2. FOREIGN KEY
user_id INT PRIMARY KEY,
name VARCHAR(50)
);
➤ Links to a primary key in another table
➤ Ensures data consistency across tables
CREATE TABLE orders (🔹 3. UNIQUE
order_id INT PRIMARY KEY,
user_id INT,
FOREIGN KEY (user_id) REFERENCES users(user_id)
);
➤ Ensures all values in a column are different
CREATE TABLE employees (🔹 4. NOT NULL
id INT PRIMARY KEY,
email VARCHAR(100) UNIQUE
);
➤ Column cannot have NULL (empty) values
CREATE TABLE products (🔹 5. CHECK
id INT PRIMARY KEY,
name VARCHAR(100) NOT NULL
);
➤ Limits the values that can be entered
CREATE TABLE students (🔹 6. DEFAULT
id INT PRIMARY KEY,
age INT CHECK (age >= 18)
);
➤ Automatically sets a default value
CREATE TABLE orders (🎯 Why Constraints Matter:
id INT PRIMARY KEY,
status VARCHAR(20) DEFAULT 'Pending'
);
✔️ No duplicates
✔️ No missing data
✔️ Valid and consistent values
✔️ Reliable database performance
SQL Roadmap: https://whatsapp.com/channel/0029VanC5rODzgT6TiTGoa1v/1394
👍 Tap ❤️ for more!
❤15👏2
🔹 Top 10 SQL Functions/Commands Commonly Used in Data Analysis 📊
1️⃣ SELECT
– Used to retrieve specific columns from a table.
2️⃣ WHERE
– Filters rows based on a condition.
3️⃣ GROUP BY
– Groups rows that have the same values into summary rows.
4️⃣ ORDER BY
– Sorts the result by one or more columns.
5️⃣ JOIN
– Combines rows from two or more tables based on a related column.
6️⃣ COUNT() / SUM() / AVG() / MIN() / MAX()
– Common aggregate functions for metrics and summaries.
7️⃣ HAVING
– Filters after a GROUP BY (unlike WHERE, which filters before).
8️⃣ LIMIT
– Restricts number of rows returned.
9️⃣ CASE
– Implements conditional logic in queries.
🔟 DATE functions (NOW(), DATE_PART(), DATEDIFF(), etc.)
– Handle and extract info from dates.
Join our WhatsApp channel: https://whatsapp.com/channel/0029VbAbnvPLSmbeFYNdNA29
👍 Tap ❤️ for more!
1️⃣ SELECT
– Used to retrieve specific columns from a table.
SELECT name, age FROM users;
2️⃣ WHERE
– Filters rows based on a condition.
SELECT * FROM sales WHERE region = 'North';
3️⃣ GROUP BY
– Groups rows that have the same values into summary rows.
SELECT region, SUM(sales) FROM sales GROUP BY region;
4️⃣ ORDER BY
– Sorts the result by one or more columns.
SELECT * FROM customers ORDER BY created_at DESC;
5️⃣ JOIN
– Combines rows from two or more tables based on a related column.
SELECT a.name, b.salary
FROM employees a
JOIN salaries b ON a.id = b.emp_id;
6️⃣ COUNT() / SUM() / AVG() / MIN() / MAX()
– Common aggregate functions for metrics and summaries.
SELECT COUNT(*) FROM orders WHERE status = 'completed';
7️⃣ HAVING
– Filters after a GROUP BY (unlike WHERE, which filters before).
SELECT department, COUNT(*) FROM employees GROUP BY department HAVING COUNT(*) > 10;
8️⃣ LIMIT
– Restricts number of rows returned.
SELECT * FROM products LIMIT 5;
9️⃣ CASE
– Implements conditional logic in queries.
SELECT name,
CASE
WHEN score >= 90 THEN 'A'
WHEN score >= 75 THEN 'B'
ELSE 'C'
END AS grade
FROM students;
🔟 DATE functions (NOW(), DATE_PART(), DATEDIFF(), etc.)
– Handle and extract info from dates.
SELECT DATE_PART('year', order_date) FROM orders;Join our WhatsApp channel: https://whatsapp.com/channel/0029VbAbnvPLSmbeFYNdNA29
👍 Tap ❤️ for more!
❤13👏4👍3🔥1
✅ 7 Habits That Make You a Better Data Analyst 📊🧠
1️⃣ Explore Real Datasets Regularly
– Use Kaggle, Data.gov, or Google Dataset Search
– Focus on different domains: sales, HR, marketing, etc.
2️⃣ Master the Art of Asking Questions
– Start with: What do we want to know?
– Then: What data do we need to answer it?
3️⃣ Use SQL & Excel Daily
– Practice joins, window functions, pivot tables, formulas
– Aim to solve 1 real-world query per day
4️⃣ Visualize Everything
– Use Power BI, Tableau, or Matplotlib
– Keep charts simple, clear, and insight-driven
5️⃣ Storytelling > Just Reporting
– Always add “So what?” to your analysis
– Help stakeholders take action, not just read numbers
6️⃣ Document Your Work
– Use Notion, Google Docs, or GitHub
– Write what you did, how, and why—it’ll save time later
7️⃣ Review & Reflect Weekly
– What did you learn? What confused you?
– Track mistakes + insights in a learning journal
💡 Pro Tip: Join data communities (Reddit, LinkedIn, Slack groups) to grow faster.
👍 Tap for more
1️⃣ Explore Real Datasets Regularly
– Use Kaggle, Data.gov, or Google Dataset Search
– Focus on different domains: sales, HR, marketing, etc.
2️⃣ Master the Art of Asking Questions
– Start with: What do we want to know?
– Then: What data do we need to answer it?
3️⃣ Use SQL & Excel Daily
– Practice joins, window functions, pivot tables, formulas
– Aim to solve 1 real-world query per day
4️⃣ Visualize Everything
– Use Power BI, Tableau, or Matplotlib
– Keep charts simple, clear, and insight-driven
5️⃣ Storytelling > Just Reporting
– Always add “So what?” to your analysis
– Help stakeholders take action, not just read numbers
6️⃣ Document Your Work
– Use Notion, Google Docs, or GitHub
– Write what you did, how, and why—it’ll save time later
7️⃣ Review & Reflect Weekly
– What did you learn? What confused you?
– Track mistakes + insights in a learning journal
💡 Pro Tip: Join data communities (Reddit, LinkedIn, Slack groups) to grow faster.
👍 Tap for more
❤26👍4👏2🥰1😁1
Which SQL command is used to add new records into a table?*
Anonymous Quiz
26%
a) UPDATE
2%
b) DELETE
70%
c) INSERT
2%
d) SELECT
❤10
What does a correlated subquery mean?
Anonymous Quiz
5%
a) A subquery executed only once
78%
b) A subquery that depends on the outer query for its values
10%
c) A subquery that returns multiple rows
7%
d) A subquery with UNION operation
❤4🤩1
Which of the following is used to combine the results of two SELECT statements and removes duplicates?
Anonymous Quiz
71%
UNION
29%
UNION ALL
❤5🥰1
Which SQL function would you use to find the number of days between two dates?
Anonymous Quiz
2%
a) NOW()
84%
b) DATEDIFF()
5%
c) SUBSTRING()
9%
d) COUNT()
❤5
What does the following SQL command do?
ALTER TABLE employees ADD COLUMN salary INT;
ALTER TABLE employees ADD COLUMN salary INT;
Anonymous Quiz
2%
a) Deletes the salary column
88%
b) Adds a new column named salary of type integer
9%
c) Changes salary column to integer
1%
d) Drops the table employees
❤3🥰1
Which constraint ensures that a column cannot have NULL values?
Anonymous Quiz
29%
UNIQUE
71%
NOT NULL
❤5🥰1
Which of the following statements about Views is TRUE?
Anonymous Quiz
9%
a) Views store data physically
8%
b) Views cannot be updated
75%
c) Views are virtual tables created by a query
8%
d) Views automatically index the data
❤8🔥2
⏰ Quick Reminder!
🚀 Agent.ai Challenge is LIVE!
💰 Win up to $50,000 — no code needed!
👥 Open to all. Limited time!
👇 Register now →
https://shorturl.at/lSfTv
Double Tap ❤️ for more AI Resources
🚀 Agent.ai Challenge is LIVE!
💰 Win up to $50,000 — no code needed!
👥 Open to all. Limited time!
👇 Register now →
https://shorturl.at/lSfTv
Double Tap ❤️ for more AI Resources
❤6👏2👍1
📊 Data Analyst Interview Cheat Sheet (2025 Edition)
✅ 1. SQL Essentials
Key Concepts:
• SELECT, WHERE, GROUP BY, HAVING
• JOINs (INNER, LEFT, RIGHT, FULL)
• Window Functions (ROW_NUMBER, RANK, LEAD/LAG)
• Subqueries & CTEs
• Aggregations & Filtering
Practice Queries:
• Top 3 customers by revenue
• Monthly active users
• Running total or moving average
• Products never sold
✅ 2. Excel/Spreadsheet Skills
Key Concepts:
• VLOOKUP, XLOOKUP, INDEX-MATCH
• IF, AND, OR logic
• Pivot Tables & Charts
• Conditional Formatting
• Data Cleaning Functions (TRIM, CLEAN, TEXTSPLIT)
✅ 3. Data Visualization
Tools: Tableau, Power BI, Excel
Key Charts:
• Line chart – Trend
• Bar chart – Comparison
• Pie chart – Distribution
• Scatter plot – Correlation
• Heatmaps
Best Practices:
• Keep visuals simple & clear
• Use color intentionally
• Add noscripts, labels, tooltips
✅ 4. Statistics & Analytics Concepts
Key Concepts:
• Mean, Median, Mode
• Standard Deviation, Variance
• Correlation vs Causation
• Hypothesis Testing (p-value, t-test)
• A/B Testing basics
• Confidence Intervals
✅ 5. Python for Data Analysis
Key Libraries:
• Pandas – data manipulation
• NumPy – numerical ops
• Matplotlib/Seaborn – visualization
• SQLAlchemy – database access
Common Tasks:
• Read CSV/excel files
• GroupBy and aggregations
• Handling missing data
• Merge/join datasets
• Create charts
✅ 6. Business Acumen & Communication
Key Skills:
• Ask the right questions
• Translate data into insights
• Storytelling with data
• Build dashboards with KPIs
• Communicate with non-tech stakeholders
✅ 7. Tools to Know
• Excel / Google Sheets
• SQL (MySQL, PostgreSQL, etc.)
• Tableau / Power BI
• Python / R
• Jupyter / VS Code
👍 Tap ❤️ for more!
✅ 1. SQL Essentials
Key Concepts:
• SELECT, WHERE, GROUP BY, HAVING
• JOINs (INNER, LEFT, RIGHT, FULL)
• Window Functions (ROW_NUMBER, RANK, LEAD/LAG)
• Subqueries & CTEs
• Aggregations & Filtering
Practice Queries:
• Top 3 customers by revenue
• Monthly active users
• Running total or moving average
• Products never sold
✅ 2. Excel/Spreadsheet Skills
Key Concepts:
• VLOOKUP, XLOOKUP, INDEX-MATCH
• IF, AND, OR logic
• Pivot Tables & Charts
• Conditional Formatting
• Data Cleaning Functions (TRIM, CLEAN, TEXTSPLIT)
✅ 3. Data Visualization
Tools: Tableau, Power BI, Excel
Key Charts:
• Line chart – Trend
• Bar chart – Comparison
• Pie chart – Distribution
• Scatter plot – Correlation
• Heatmaps
Best Practices:
• Keep visuals simple & clear
• Use color intentionally
• Add noscripts, labels, tooltips
✅ 4. Statistics & Analytics Concepts
Key Concepts:
• Mean, Median, Mode
• Standard Deviation, Variance
• Correlation vs Causation
• Hypothesis Testing (p-value, t-test)
• A/B Testing basics
• Confidence Intervals
✅ 5. Python for Data Analysis
Key Libraries:
• Pandas – data manipulation
• NumPy – numerical ops
• Matplotlib/Seaborn – visualization
• SQLAlchemy – database access
Common Tasks:
• Read CSV/excel files
• GroupBy and aggregations
• Handling missing data
• Merge/join datasets
• Create charts
✅ 6. Business Acumen & Communication
Key Skills:
• Ask the right questions
• Translate data into insights
• Storytelling with data
• Build dashboards with KPIs
• Communicate with non-tech stakeholders
✅ 7. Tools to Know
• Excel / Google Sheets
• SQL (MySQL, PostgreSQL, etc.)
• Tableau / Power BI
• Python / R
• Jupyter / VS Code
👍 Tap ❤️ for more!
❤19👍5🥰2👏2
✅ 20 Data Analyst Interview Questions
1. What is data analysis
The process of inspecting, cleaning, transforming, and modeling data to discover useful information and support decision-making.
2. What tools do data analysts commonly use
Excel, SQL, Python, R, Tableau, Power BI, SAS, and Google Sheets. Each tool serves different purposes like querying, visualization, or statistical analysis.
3. What is the difference between data analyst and data scientist
• Data Analyst: Focuses on interpreting existing data and generating reports
• Data Scientist: Builds predictive models and algorithms using advanced techniques
4. How do you handle missing data
• Remove rows
• Impute values (mean, median, mode)
• Use algorithms that handle missing data
• Flag missing values for analysis
5. What is the difference between INNER JOIN and LEFT JOIN in SQL
• INNER JOIN: Returns only matching rows
• LEFT JOIN: Returns all rows from the left table and matching rows from the right
6. What is normalization in databases
Organizing data to reduce redundancy and improve integrity. Common forms: 1NF, 2NF, 3NF.
7. How do you ensure data quality
• Validate data sources
• Check for duplicates and missing values
• Use consistency checks
• Automate data cleaning pipelines
8. What is the difference between structured and unstructured data
• Structured: Organized in rows and columns (e.g., SQL tables)
• Unstructured: No fixed format (e.g., images, emails, social media)
9. What is exploratory data analysis (EDA)
Initial investigation of data using visualizations and statistics to uncover patterns, anomalies, and relationships.
10. How do you visualize data effectively
Choose the right chart type (bar, line, pie, scatter), use clear labels, avoid clutter, and highlight key insights.
11. What is the difference between COUNT, COUNT(*) and COUNT(column) in SQL
• COUNT(*): Counts all rows
• COUNT(column): Counts non-null values in that column
12. What is a pivot table
A tool in Excel or BI platforms that summarizes data by grouping and aggregating values dynamically.
13. How do you calculate correlation between two variables
Use Pearson correlation coefficient in Python (df.corr()), R, or Excel. Values range from -1 to +1.
14. What is the difference between a dashboard and a report
• Dashboard: Interactive, real-time visual summary
• Report: Static or scheduled document with detailed analysis
15. What is the purpose of GROUP BY in SQL
Used to aggregate data across rows that share a common value in one or more columns.
16. What is the difference between WHERE and HAVING in SQL
• WHERE: Filters rows before aggregation
• HAVING: Filters groups after aggregation
17. How do you handle outliers in data
• Remove or cap them
• Use robust statistical methods
• Transform data (e.g., log scale)
18. What is the difference between mean, median, and mode
• Mean: Average
• Median: Middle value
• Mode: Most frequent value
19. What is time series analysis
Analyzing data points collected over time to identify trends, seasonality, and make forecasts.
20. How do you communicate insights to non-technical stakeholders
Use simple language, visualizations, storytelling, and focus on business impact rather than technical jargon.
👍 React for more Interview Resources
1. What is data analysis
The process of inspecting, cleaning, transforming, and modeling data to discover useful information and support decision-making.
2. What tools do data analysts commonly use
Excel, SQL, Python, R, Tableau, Power BI, SAS, and Google Sheets. Each tool serves different purposes like querying, visualization, or statistical analysis.
3. What is the difference between data analyst and data scientist
• Data Analyst: Focuses on interpreting existing data and generating reports
• Data Scientist: Builds predictive models and algorithms using advanced techniques
4. How do you handle missing data
• Remove rows
• Impute values (mean, median, mode)
• Use algorithms that handle missing data
• Flag missing values for analysis
5. What is the difference between INNER JOIN and LEFT JOIN in SQL
• INNER JOIN: Returns only matching rows
• LEFT JOIN: Returns all rows from the left table and matching rows from the right
6. What is normalization in databases
Organizing data to reduce redundancy and improve integrity. Common forms: 1NF, 2NF, 3NF.
7. How do you ensure data quality
• Validate data sources
• Check for duplicates and missing values
• Use consistency checks
• Automate data cleaning pipelines
8. What is the difference between structured and unstructured data
• Structured: Organized in rows and columns (e.g., SQL tables)
• Unstructured: No fixed format (e.g., images, emails, social media)
9. What is exploratory data analysis (EDA)
Initial investigation of data using visualizations and statistics to uncover patterns, anomalies, and relationships.
10. How do you visualize data effectively
Choose the right chart type (bar, line, pie, scatter), use clear labels, avoid clutter, and highlight key insights.
11. What is the difference between COUNT, COUNT(*) and COUNT(column) in SQL
• COUNT(*): Counts all rows
• COUNT(column): Counts non-null values in that column
12. What is a pivot table
A tool in Excel or BI platforms that summarizes data by grouping and aggregating values dynamically.
13. How do you calculate correlation between two variables
Use Pearson correlation coefficient in Python (df.corr()), R, or Excel. Values range from -1 to +1.
14. What is the difference between a dashboard and a report
• Dashboard: Interactive, real-time visual summary
• Report: Static or scheduled document with detailed analysis
15. What is the purpose of GROUP BY in SQL
Used to aggregate data across rows that share a common value in one or more columns.
16. What is the difference between WHERE and HAVING in SQL
• WHERE: Filters rows before aggregation
• HAVING: Filters groups after aggregation
17. How do you handle outliers in data
• Remove or cap them
• Use robust statistical methods
• Transform data (e.g., log scale)
18. What is the difference between mean, median, and mode
• Mean: Average
• Median: Middle value
• Mode: Most frequent value
19. What is time series analysis
Analyzing data points collected over time to identify trends, seasonality, and make forecasts.
20. How do you communicate insights to non-technical stakeholders
Use simple language, visualizations, storytelling, and focus on business impact rather than technical jargon.
👍 React for more Interview Resources
❤19👍4
Top Excel Formulas Every Data Analyst Should Know
SUM():
Purpose: Adds up a range of numbers.
Example: =SUM(A1:A10)
AVERAGE():
Purpose: Calculates the average of a range of numbers.
Example: =AVERAGE(B1:B10)
COUNT():
Purpose: Counts the number of cells containing numbers.
Example: =COUNT(C1:C10)
IF():
Purpose: Returns one value if a condition is true, and another if false.
Example: =IF(A1 > 10, "Yes", "No")
VLOOKUP():
Purpose: Searches for a value in the first column and returns a value in the same row from another column.
Example: =VLOOKUP(D1, A1:B10, 2, FALSE)
HLOOKUP():
Purpose: Searches for a value in the first row and returns a value in the same column from another row.
Example: =HLOOKUP("Sales", A1:F5, 3, FALSE)
INDEX():
Purpose: Returns the value of a cell based on row and column numbers.
Example: =INDEX(A1:C10, 2, 3)
MATCH():
Purpose: Searches for a value and returns its position in a range.
Example: =MATCH("Product B", A1:A10, 0)
CONCATENATE() or CONCAT():
Purpose: Joins multiple text strings into one.
Example: =CONCATENATE(A1, " ", B1)
TEXT():
Purpose: Formats numbers or dates as text.
Example: =TEXT(A1, "dd/mm/yyyy")
Excel Resources: t.me/excel_data
I have curated Data Analytics Resources 👇👇
https://whatsapp.com/channel/0029VaGgzAk72WTmQFERKh02
SUM():
Purpose: Adds up a range of numbers.
Example: =SUM(A1:A10)
AVERAGE():
Purpose: Calculates the average of a range of numbers.
Example: =AVERAGE(B1:B10)
COUNT():
Purpose: Counts the number of cells containing numbers.
Example: =COUNT(C1:C10)
IF():
Purpose: Returns one value if a condition is true, and another if false.
Example: =IF(A1 > 10, "Yes", "No")
VLOOKUP():
Purpose: Searches for a value in the first column and returns a value in the same row from another column.
Example: =VLOOKUP(D1, A1:B10, 2, FALSE)
HLOOKUP():
Purpose: Searches for a value in the first row and returns a value in the same column from another row.
Example: =HLOOKUP("Sales", A1:F5, 3, FALSE)
INDEX():
Purpose: Returns the value of a cell based on row and column numbers.
Example: =INDEX(A1:C10, 2, 3)
MATCH():
Purpose: Searches for a value and returns its position in a range.
Example: =MATCH("Product B", A1:A10, 0)
CONCATENATE() or CONCAT():
Purpose: Joins multiple text strings into one.
Example: =CONCATENATE(A1, " ", B1)
TEXT():
Purpose: Formats numbers or dates as text.
Example: =TEXT(A1, "dd/mm/yyyy")
Excel Resources: t.me/excel_data
I have curated Data Analytics Resources 👇👇
https://whatsapp.com/channel/0029VaGgzAk72WTmQFERKh02
❤15👍2👏1
❌ SQL alone won’t make you a Data Analyst
❌ SQL won’t guarantee you a 20 LPA job
❌ SQL cannot be mastered in one weekend
❌ SQL is not just “SELECT * FROM table”
❌ SQL isn’t only for technical people
❌ SQL is not outdated or getting replaced
But here’s what SQL *can* do:
✔️ SQL helps you handle millions of rows with ease
✔️ SQL empowers you to extract real insights from raw data
✔️ SQL makes you independent of Excel limitations
✔️ SQL lets you ask deep, complex business questions
✔️ SQL is the foundation of most data tools (Power BI, Tableau, Python, etc.)
✔️ SQL is a must-have skill for data professionals
✔️ SQL is trusted by companies across the globe
Right mindset = Right learning path
React ❤️ if you agree
❌ SQL won’t guarantee you a 20 LPA job
❌ SQL cannot be mastered in one weekend
❌ SQL is not just “SELECT * FROM table”
❌ SQL isn’t only for technical people
❌ SQL is not outdated or getting replaced
But here’s what SQL *can* do:
✔️ SQL helps you handle millions of rows with ease
✔️ SQL empowers you to extract real insights from raw data
✔️ SQL makes you independent of Excel limitations
✔️ SQL lets you ask deep, complex business questions
✔️ SQL is the foundation of most data tools (Power BI, Tableau, Python, etc.)
✔️ SQL is a must-have skill for data professionals
✔️ SQL is trusted by companies across the globe
Right mindset = Right learning path
React ❤️ if you agree
❤41👍8
✅ 📊 Top 10 Data Analyst Interview Questions
1️⃣ What is Data Wrangling?
Answer: It's the process of cleaning, structuring, and enriching raw data into a desired format for analysis. It includes handling nulls, removing duplicates, and standardizing formats.
2️⃣ How is Excel used in Data Analysis?
Answer: Excel is used for quick data cleaning, pivot tables, basic stats, visualizations, and what-if analysis.
3️⃣ What are the different types of data?
Answer:
- Structured: Organized in rows/columns (e.g. databases)
- Unstructured: No format (e.g. text, images)
- Semi-structured: Tags or markers (e.g. JSON, XML)
4️⃣ Define Normalization. Why is it important?
Answer: It's the process of organizing data to reduce redundancy. It ensures consistency and optimizes storage.
5️⃣ What is the difference between WHERE and HAVING in SQL?
Answer:
- WHERE: Filters rows before aggregation
- HAVING: Filters groups after aggregation
6️⃣ What is the use of GROUP BY in SQL?
Answer: It groups rows with the same values in specified columns, often used with aggregate functions like COUNT(), SUM(), AVG().
7️⃣ What is an Outlier? How do you detect it?
Answer: An outlier is a data point that differs significantly from others. Detection methods: IQR, Z-score, boxplots.
8️⃣ How do you prioritize tasks when handling multiple projects?
Answer: By assessing deadlines, impact, complexity, and using tools like Trello, Notion, or Excel trackers.
9️⃣ What are Data Dashboards?
Answer: Visual interfaces that display key metrics and KPIs in real-time, used for quick business decision-making.
🔟 What’s the difference between OLAP and OLTP?
Answer:
- OLAP (Analytical): Used for complex queries & reporting
- OLTP (Transactional): Used for real-time data processing (e.g. banking systems)
💡 Pro Tip: Be ready to explain your thought process with real-life projects or case studies during interviews!
👍 React ❤️ if this helped!
1️⃣ What is Data Wrangling?
Answer: It's the process of cleaning, structuring, and enriching raw data into a desired format for analysis. It includes handling nulls, removing duplicates, and standardizing formats.
2️⃣ How is Excel used in Data Analysis?
Answer: Excel is used for quick data cleaning, pivot tables, basic stats, visualizations, and what-if analysis.
3️⃣ What are the different types of data?
Answer:
- Structured: Organized in rows/columns (e.g. databases)
- Unstructured: No format (e.g. text, images)
- Semi-structured: Tags or markers (e.g. JSON, XML)
4️⃣ Define Normalization. Why is it important?
Answer: It's the process of organizing data to reduce redundancy. It ensures consistency and optimizes storage.
5️⃣ What is the difference between WHERE and HAVING in SQL?
Answer:
- WHERE: Filters rows before aggregation
- HAVING: Filters groups after aggregation
6️⃣ What is the use of GROUP BY in SQL?
Answer: It groups rows with the same values in specified columns, often used with aggregate functions like COUNT(), SUM(), AVG().
7️⃣ What is an Outlier? How do you detect it?
Answer: An outlier is a data point that differs significantly from others. Detection methods: IQR, Z-score, boxplots.
8️⃣ How do you prioritize tasks when handling multiple projects?
Answer: By assessing deadlines, impact, complexity, and using tools like Trello, Notion, or Excel trackers.
9️⃣ What are Data Dashboards?
Answer: Visual interfaces that display key metrics and KPIs in real-time, used for quick business decision-making.
🔟 What’s the difference between OLAP and OLTP?
Answer:
- OLAP (Analytical): Used for complex queries & reporting
- OLTP (Transactional): Used for real-time data processing (e.g. banking systems)
💡 Pro Tip: Be ready to explain your thought process with real-life projects or case studies during interviews!
👍 React ❤️ if this helped!
❤23