Data Analytics – Telegram
Data Analytics
108K subscribers
126 photos
2 files
791 links
Perfect channel to learn Data Analytics

Learn SQL, Python, Alteryx, Tableau, Power BI and many more

For Promotions: @coderfun @love_data
Download Telegram
Data Analytics
Data analyst interview Part-2 6. What is the difference between a WHERE and HAVING clause in SQL? Answer: WHERE is used to filter records before aggregation (used with SELECT, UPDATE, DELETE). HAVING is used to filter records after aggregation (used with…
Data Analyst Interview Series Part-3

11. What is the difference between UNION and UNION ALL in SQL?

Answer:
UNION combines results from two queries and removes duplicates.
UNION ALL combines results but keeps duplicates for better performance.

Example:

SELECT CustomerID FROM Orders_A
UNION
SELECT CustomerID FROM Orders_B; -- Removes duplicates

SELECT CustomerID FROM Orders_A
UNION ALL
SELECT CustomerID FROM Orders_B; -- Keeps duplicates


12. What are common DAX functions in Power BI?

Answer:

DAX (Data Analysis Expressions) is used in Power BI for calculations.

Common DAX functions include:

SUM() – Adds up values in a column.
AVERAGE() – Finds the mean value.
COUNT() – Counts the number of rows.
CALCULATE() – Modifies a measure based on conditions.
FILTER() – Returns a subset of data.

Example:

TotalSales = SUM(Sales[Amount]) FilteredSales = CALCULATE(SUM(Sales[Amount]), Sales[Region] = "North") 


13. How do you use VLOOKUP in Excel?

Answer:

VLOOKUP searches for a value in the first column of a range and returns a value from another column.

Syntax:

=VLOOKUP(lookup_value, table_array, col_index_num, [range_lookup])

Example:
To find the price of a product in a table:

=VLOOKUP("ProductA", A2:C10, 2, FALSE)

14. What is the difference between a Heatmap and a Scatter Plot?

Answer:

Heatmap: Uses color intensity to represent values across a matrix. Used for correlation analysis.

Scatter Plot: Shows relationships between two continuous variables using dots. Used for trend analysis.

Example:
A Heatmap can show sales performance by region and product category.
A Scatter Plot can show sales vs. profit for different stores.

15. How do you read a CSV file into Pandas in Python?

Answer:
You can read a CSV file using pandas.read_csv():

import pandas as pd df = pd.read_csv("data.csv") print(df.head()) # Displays first 5 rows 


To handle missing values:

df = pd.read_csv("data.csv", na_values=["NA", "Missing"]) 


To select specific columns:

df = pd.read_csv("data.csv", usecols=["Name", "Sales"])


I have curated best 80+ top-notch Data Analytics Resources 👇👇
https://news.1rj.ru/str/DataSimplifier

Like this post for if you want me to continue the interview series 👍♥️

Share with credits: https://news.1rj.ru/str/sqlspecialist

Hope it helps :)
👍87
Top companies currently hiring data analysts

Based on the current job market in 2025, here are the top companies hiring data analysts:

## Top Tech Companies

- Meta: Investing heavily in AI with significant GPU investments
- Amazon: Offers diverse data analyst roles with complex responsibilities
- Google (Alphabet): Leverages massive data ecosystems
- JP Morgan Chase & Co.: Strong focus on data-driven banking transformation

## Specialized Data Analytics Firms

- Tiger Analytics: Specializes in AI/ML solutions
- SG Analytics: Provides data-driven insights
- Monte Carlo Data: Focuses on data observability
- CB Insights: Excels in market intelligence

## Emerging Opportunities

Companies like Samsara, ScienceSoft, and Forage are also actively recruiting data analysts, offering competitive salaries ranging from $85,000 to $207,000 annually.

I have curated best 80+ top-notch Data Analytics Resources 👇👇
https://news.1rj.ru/str/DataSimplifier

Like this post for if you want me to continue the interview series 👍♥️

Share with credits: https://news.1rj.ru/str/sqlspecialist

Hope it helps :)
👍199
Hi guys,

Many people charge too much to teach Excel, Power BI, SQL, Python & Tableau but my mission is to break down barriers. I have shared complete learning series to start your data analytics journey from scratch.

For those of you who are new to this channel, here are some quick links to navigate this channel easily.

Data Analyst Learning Plan 👇
https://news.1rj.ru/str/sqlspecialist/752

Python Learning Plan 👇
https://news.1rj.ru/str/sqlspecialist/749

Power BI Learning Plan 👇
https://news.1rj.ru/str/sqlspecialist/745

SQL Learning Plan 👇
https://news.1rj.ru/str/sqlspecialist/738

SQL Learning Series 👇
https://news.1rj.ru/str/sqlspecialist/567

Excel Learning Series 👇
https://news.1rj.ru/str/sqlspecialist/664

Power BI Learning Series 👇
https://news.1rj.ru/str/sqlspecialist/768

Python Learning Series 👇
https://news.1rj.ru/str/sqlspecialist/615

Tableau Essential Topics 👇
https://news.1rj.ru/str/sqlspecialist/667

Best Data Analytics Resources 👇
https://heylink.me/DataAnalytics

You can find more resources on Medium & Linkedin

Like for more ❤️

Thanks to all who support our channel and share it with friends & loved ones. You guys are really amazing.

Hope it helps :)
👍2512🔥4
Data Analytics
Day 25: Backup and Restore Strategies in SQL (Good to know concept) 1. Why Are Backups Important? Backups protect your database from accidental deletions, hardware failures, or cyberattacks. A good backup strategy ensures minimal downtime and data recovery…
Day 26: Pivoting and Unpivoting Data in SQL

1. What is Pivoting in SQL?

Pivoting converts row-based data into columns to create a structured report. It's commonly used in reporting and summarization.

2. How to Pivot Data?

Example: You have a sales table with columns Month, Product, and Sales.

If you want to convert product names into columns and show total sales per month, use:

SELECT Month, SUM(CASE WHEN Product = 'Shoes' THEN Sales ELSE 0 END) AS Shoes, SUM(CASE WHEN Product = 'Shirts' THEN Sales ELSE 0 END) AS Shirts FROM Sales GROUP BY Month; 


3. What is Unpivoting in SQL?

Unpivoting converts columns back into rows, which is useful for normalizing data.

Example: If you have sales data stored in separate columns (Shoes, Shirts), but you need a column named Product instead, use:

SELECT Month, Product, Sales FROM SalesTable UNPIVOT (Sales FOR Product IN (Shoes, Shirts)) AS unpvt; 


4. When to Use Pivot and Unpivot?

Pivot when you need structured reports with categories as columns.

Unpivot when working with dynamic columns and data normalization.

Action Plan for Today:

1️⃣ Write a PIVOT query for summarizing data.

2️⃣ Use UNPIVOT to transform columns into rows.

3️⃣ Experiment with SUM, COUNT, and AVG while pivoting data.

Here you can find SQL Interview Resources👇
https://news.1rj.ru/str/sqlanalyst

Like this post if you want me to continue this SQL series 👍♥️

Share with credits: https://news.1rj.ru/str/sqlspecialist

Hope it helps :)
👍1813🥰1
Data Analytics
Day 26: Pivoting and Unpivoting Data in SQL 1. What is Pivoting in SQL? Pivoting converts row-based data into columns to create a structured report. It's commonly used in reporting and summarization. 2. How to Pivot Data? Example: You have a sales table…
Day 27: Writing Stored Procedures and Functions in SQL

1. What Are Stored Procedures?

A Stored Procedure is a reusable block of SQL code that executes multiple SQL statements in a single call. It improves performance, security, and maintainability.

2. Creating a Stored Procedure

📌 Basic Syntax

CREATE PROCEDURE GetEmployeeDetails AS BEGIN SELECT * FROM Employees; END; 

📌 Executing the Procedure

EXEC GetEmployeeDetails;


📌 Stored Procedure with Parameters

CREATE PROCEDURE GetEmployeeByID (@EmpID INT) AS BEGIN SELECT * FROM Employees WHERE EmployeeID = @EmpID; END; 


📌 Executing with a Parameter

EXEC GetEmployeeByID 101; 


3. What Are SQL Functions?

Functions return a single value and are used inside queries. Unlike stored procedures, functions cannot modify the database.

📌 Creating a Function

CREATE FUNCTION GetTotalSalary() RETURNS INT AS BEGIN DECLARE @TotalSalary INT; SELECT @TotalSalary = SUM(Salary) FROM Employees; RETURN @TotalSalary; END; 


📌 Calling the Function

SELECT dbo.GetTotalSalary();


4. Differences: Stored Procedures vs Functions

Stored Procedures: Perform multiple actions, support transactions, and can modify data.
Functions: Return a value, are used inside queries, and cannot change database state.

Action Plan for Today:

1️⃣ Create a Stored Procedure that retrieves filtered data.

2️⃣ Write a Function that calculates an aggregate value.

3️⃣ Compare performance differences between functions and stored procedures.

Here you can find SQL Interview Resources👇
https://news.1rj.ru/str/sqlanalyst

Like this post if you want me to continue this SQL series 👍♥️

Share with credits: https://news.1rj.ru/str/sqlspecialist

Hope it helps :)
9👍3
Data Analytics
Data Analyst Interview Series Part-3 11. What is the difference between UNION and UNION ALL in SQL? Answer: UNION combines results from two queries and removes duplicates. UNION ALL combines results but keeps duplicates for better performance. Example:…
Data Analyst Interview Part-4

16. What is the difference between OLTP and OLAP?

Answer:
OLTP (Online Transaction Processing): Handles real-time, transactional data (e.g., banking systems, e-commerce). Focuses on fast inserts, updates, and deletes.

OLAP (Online Analytical Processing): Used for complex queries, reporting, and business intelligence (e.g., data warehouses, dashboards). Optimized for data retrieval.

Example:
An OLTP system records a customer's purchase in an online store.
An OLAP system analyzes total sales trends for different products over time.

17. What is the difference between RANK(), DENSE_RANK(), and ROW_NUMBER() in SQL?

Answer:
These functions assign ranks to rows in a result set:

RANK(): Assigns a rank but skips numbers if there are duplicates.

DENSE_RANK(): Assigns a rank without skipping numbers.

ROW_NUMBER(): Assigns a unique row number to each row, even if values are the same.

Example:

SELECT Employee, Salary, RANK() OVER (ORDER BY Salary DESC) AS Rank, DENSE_RANK() OVER (ORDER BY Salary DESC) AS DenseRank, ROW_NUMBER() OVER (ORDER BY Salary DESC) AS RowNum FROM Employees; 


18. What are Measures and Dimensions in Tableau?

Answer:
Measures: Numeric values that can be aggregated (e.g., Sales, Profit, Quantity).
Dimensions: Categorical fields that define data granularity (e.g., Product, Region, Date).

Example:
"Sales" is a Measure (sum of sales).
"Customer Name" is a Dimension (used to group data).

19. How do you remove outliers from a dataset in Python?

Answer:
Outliers can be removed using statistical methods:

Using IQR (Interquartile Range) Method:

import pandas as pd import numpy as np Q1 = df["Sales"].quantile(0.25) Q3 = df["Sales"].quantile(0.75) IQR = Q3 - Q1 df_cleaned = df[(df["Sales"] >= Q1 - 1.5*IQR) & (df["Sales"] <= Q3 + 1.5*IQR)] 


Using Z-Score Method:

from scipy import stats df_cleaned = df[(np.abs(stats.zscore(df["Sales"])) < 3)] 


20. What is the difference between INNER JOIN and LEFT JOIN?

Answer:
INNER JOIN returns only matching records from both tables.

LEFT JOIN returns all records from the left table and matching records from the right table (fills NULL if no match).

Example:

-- INNER JOIN: Returns only matching Customers with Orders SELECT Customers.Name, Orders.OrderID FROM Customers INNER JOIN Orders ON Customers.CustomerID = Orders.CustomerID; -- LEFT JOIN: Returns all Customers, even if they have no Orders SELECT Customers.Name, Orders.OrderID FROM Customers LEFT JOIN Orders ON Customers.CustomerID = Orders.CustomerID; 


I have curated best 80+ top-notch Data Analytics Resources 👇👇
https://whatsapp.com/channel/0029VaGgzAk72WTmQFERKh02

Like this post for if you want me to continue the interview series 👍♥️

Share with credits: https://news.1rj.ru/str/sqlspecialist

Hope it helps :)
11👍7
Data Analytics
Day 27: Writing Stored Procedures and Functions in SQL 1. What Are Stored Procedures? A Stored Procedure is a reusable block of SQL code that executes multiple SQL statements in a single call. It improves performance, security, and maintainability. 2. Creating…
Day 28: Integrating SQL with Other Tools (Python, Power BI, Tableau) & SQL in Big Data

1. Using SQL with Python

Python is widely used to interact with databases via libraries like sqlite3, SQLAlchemy, and pandas.

📌 Example: Connecting to a Database in Python


import sqlite3 # Connect to the database conn = sqlite3.connect('my_database.db') cursor = conn.cursor() # Execute a query cursor.execute("SELECT * FROM Employees") rows = cursor.fetchall() # Print results for row in rows: print(row) conn.close()


2. Using SQL with Power BI
Power BI allows direct SQL connections for data visualization.

📌 Steps to Connect SQL to Power BI:

1️⃣ Open Power BI Desktop.
2️⃣ Click on Get Data → Select SQL Server.
3️⃣ Enter Server Name & Database Name.
4️⃣ Choose DirectQuery or Import Mode.
5️⃣ Load and create visualizations using SQL queries.

3. Using SQL with Tableau

Tableau connects with SQL databases to create interactive dashboards.

📌 Steps to Connect SQL to Tableau:

1️⃣ Open Tableau → Click Connect to Data.
2️⃣ Choose Microsoft SQL Server, MySQL, or PostgreSQL.
3️⃣ Enter Database Credentials.
4️⃣ Use SQL queries to fetch data and build charts & graphs.

4. SQL in Big Data (Introduction to NoSQL)

SQL is not always suitable for big data processing. NoSQL databases like MongoDB, Cassandra, and Hadoop are used for scalable, unstructured data.

📌 SQL vs NoSQL:

SQL: Structured data, strict schema, ACID compliance (e.g., MySQL, PostgreSQL).

NoSQL: Flexible schema, distributed storage, better for big data (e.g., MongoDB, Cassandra).

Action Plan for Today:

1️⃣ Try running a SQL query in Python.
2️⃣ Connect a SQL database to Power BI/Tableau.
3️⃣ Research the difference between SQL and NoSQL for big data.

Here you can find SQL Interview Resources👇
https://news.1rj.ru/str/sqlanalyst

Like this post if you want me to continue this SQL series 👍♥️

Share with credits: https://news.1rj.ru/str/sqlspecialist

Hope it helps :)
10👍8👏3
Data analyst interview questions Part-5:

21. What are Window Functions in SQL?

Window functions perform calculations across a set of rows related to the current row without collapsing the dataset like GROUP BY.

Common window functions:
RANK() – Assigns a rank with gaps for ties.
DENSE_RANK() – Assigns a rank without gaps.
ROW_NUMBER() – Assigns a unique row number.
LEAD() / LAG() – Access next/previous row values.

Example:

SELECT Employee, Department, Salary, RANK() OVER (PARTITION BY Department ORDER BY Salary DESC) AS Rank FROM Employees; 


22. How do you create a calculated column in Power BI?

A calculated column is a new column created using DAX formulas.

Example:

Profit Margin = Sales[Profit] / Sales[Revenue]

Steps in Power BI:

Open Power BI Desktop.
Go to the Modeling tab → Click New Column.
Enter the DAX formula and press Enter.

23. How do you find duplicate values in Excel?

Methods to identify duplicates:

Conditional Formatting:

Select data → Click Home > Conditional Formatting > Highlight Duplicates.

Using COUNTIF formula: =IF(COUNTIF(A:A, A2) > 1, "Duplicate", "Unique")

Using Power Query:

Load data into Power Query → Use Group By to count duplicates.

24. What is the difference between a Line Chart and an Area Chart?

Line Chart: Shows trends over time using a continuous line.
Area Chart: Similar to a Line Chart but fills the area below the line with color, emphasizing volume.

Example:
A Line Chart shows monthly stock prices over time.
An Area Chart shows cumulative sales trends over time.

25. How do you merge two DataFrames in Pandas?

You can use merge() for SQL-like joins:

import pandas as pd df1 = pd.DataFrame({"ID": [1, 2, 3], "Name": ["Alice", "Bob", "Charlie"]}) df2 = pd.DataFrame({"ID": [1, 2, 4], "Salary": [50000, 60000, 70000]}) # INNER JOIN df_inner = df1.merge(df2, on="ID", how="inner") # LEFT JOIN df_left = df1.merge(df2, on="ID", how="left") print(df_inner) 


Common merge types:
how="inner" → Returns only matching rows.
how="left" → Keeps all rows from the left DataFrame.
how="right" → Keeps all rows from the right DataFrame.
how="outer" → Returns all rows, filling missing values with NaN.

Like this post for if you want me to continue the interview series 👍♥️

Share with credits: https://news.1rj.ru/str/sqlspecialist

Hope it helps :)
👍215
Data Analytics
Day 28: Integrating SQL with Other Tools (Python, Power BI, Tableau) & SQL in Big Data 1. Using SQL with Python Python is widely used to interact with databases via libraries like sqlite3, SQLAlchemy, and pandas. 📌 Example: Connecting to a Database in Python…
Day 29: Query Performance Tuning – Optimize SQL Queries

1. Why Optimize SQL Queries?

Efficient queries reduce execution time, improve database performance, and minimize resource usage.

2. Key Techniques for Query Optimization

📌 1. Use Indexes to Speed Up Searches

Indexes improve query performance by reducing the number of scanned rows.

Example: Creating an Index

CREATE INDEX idx_employee_name ON Employees(Name); 


Check Existing Indexes

SELECT * FROM sys.indexes WHERE object_id = OBJECT_ID('Employees'); 


📌 2. Avoid SELECT * (Specify Columns Instead)

Fetching all columns increases memory usage and slows down queries.

Bad Query:
SELECT * FROM Employees;

Optimized Query:
SELECT Name, Salary FROM Employees;

📌 3. Use WHERE Instead of HAVING for Filtering

WHERE filters before grouping, while HAVING filters after aggregation, making WHERE more efficient.

Bad Query:

SELECT Department, COUNT(*) FROM Employees GROUP BY Department HAVING COUNT(*) > 5; 


Optimized Query:

SELECT Department, COUNT(*) FROM Employees WHERE Department IS NOT NULL GROUP BY Department; 


📌 4. Use EXISTS Instead of IN for Large Datasets

EXISTS stops searching after the first match, whereas IN scans the entire list.

Bad Query:

SELECT * FROM Employees WHERE DepartmentID IN (SELECT DepartmentID FROM Departments); 


Optimized Query:

SELECT * FROM Employees WHERE EXISTS (SELECT 1 FROM Departments WHERE Departments.DepartmentID = Employees.DepartmentID);


📌 5. Optimize JOINS by Selecting Required Columns

Avoid unnecessary columns and filters in JOIN queries.

Bad Query:

SELECT * FROM Employees e JOIN Departments d ON e.DepartmentID = d.DepartmentID; 


Optimized Query:

SELECT e.Name, d.DepartmentName FROM Employees e JOIN Departments d ON e.DepartmentID = d.DepartmentID; 


Action Plan for Today:

1️⃣ Create an index for a frequently searched column.
2️⃣ Rewrite a query to avoid SELECT *.
3️⃣ Experiment with EXISTS vs IN for filtering data.

Here you can find SQL Interview Resources👇
https://news.1rj.ru/str/sqlanalyst

Like this post if you want me to continue this SQL series 👍♥️

Share with credits: https://news.1rj.ru/str/sqlspecialist

Hope it helps :)
👍166🎉2
There’s one thing in common that Data Analysts did to land their first job

They never gave up

When things get tough and burnout starts to creep

- Take a small break (but get back into it)

- Don’t use the same applying strategies (switch it up)

- Understand you’re playing the long game

Don’t waste months of learning just to give up at the finish line

I have curated free Data Analytics Resources 👇👇
https://news.1rj.ru/str/DataSimplifier

Hope this helps you 😊
6👍4
You don't need to know everything about every data tool. Focus on what will help land you your job.

For Excel:
- IFS (all variations)
- XLOOKUP
- IMPORTRANGE (in GSheets)
- Pivot Tables
- Dynamic functions like TODAY()

For SQL:
- Sum
- Group By
- Window Functions
- CTEs
- Joins

For Tableau:
- Calculated Columns
- Sets
- Groups
- Formatting

For Power BI:
- Power Query for data transformation
- DAX (Data Analysis Expressions) for creating custom calculations
- Relationships between tables
- Creating interactive and dynamic dashboards
- Utilizing slicers and filters effectively

You can now find Data Analytics Resources on WhatsApp 👇👇
https://whatsapp.com/channel/0029VaGgzAk72WTmQFERKh02

Hope this helps you 😊
15👍7👏4🔥2🥰2
If you have time to learn...!

You have time to clean...!

Start from Scratch that !!!!

You have time to become a Data Analyst...!!


➜ learn Excel
➜ learn SQL
➜ learn either Power BI or Tableau
➜ learn what the heck ATS is and how to get around it
➜ learn to be ready for any interview question
➜ to build projects for a portfolio
➜ to put invest the time for your future
➜ to fail and pick yourself back up


And you don't need to do it all at once!

You can now find Data Analytics Resources on WhatsApp as well
👇👇
https://whatsapp.com/channel/0029VaGgzAk72WTmQFERKh02

Hope this helps you 😊
11👍9
Data Analytics
Day 29: Query Performance Tuning – Optimize SQL Queries 1. Why Optimize SQL Queries? Efficient queries reduce execution time, improve database performance, and minimize resource usage. 2. Key Techniques for Query Optimization 📌 1. Use Indexes to Speed…
Day 30: Final Review & SQL Projects

1. Recap of Key Topics

Week 1 (SQL Basics): SELECT, WHERE, ORDER BY, Aggregations, GROUP BY
Week 2 (Intermediate SQL): JOINS, Subqueries, String & Date Functions, UNION
Week 3 (Advanced SQL): CTEs, Window Functions, Transactions, Indexing
Week 4 (Database Management): Constraints, Performance Tuning, Stored Procedures

2. SQL Mini Projects (Hands-On Practice)

📌 Project 1: Employee Database Analysis


👉 Skills Used: Joins, Aggregations, Window Functions
🔹 Find the top 5 highest-paid employees in each department.
🔹 Calculate the average salary per department using GROUP BY.

Example Query:

SELECT DepartmentID, Name, Salary, 
RANK() OVER(PARTITION BY DepartmentID ORDER BY Salary DESC) AS SalaryRank
FROM Employees;


📌 Project 2: E-Commerce Sales Insights
👉 Skills Used: Joins, Date Functions, Subqueries
🔹 Find the total revenue generated in the last 6 months.
🔹 Identify the top-selling products.

Example Query:

SELECT ProductID, SUM(TotalAmount) AS TotalSales 
FROM Orders
WHERE OrderDate >= DATEADD(MONTH, -6, GETDATE())
GROUP BY ProductID
ORDER BY TotalSales DESC;


📌 Project 3: Customer Retention Analysis
👉 Skills Used: CTEs, Window Functions, Recursive Queries
🔹 Identify customers who made repeat purchases.
🔹 Find the time gap between first and last purchase.

Example Query:

WITH CustomerOrders AS (
SELECT CustomerID, OrderDate,
RANK() OVER (PARTITION BY CustomerID ORDER BY OrderDate ASC) AS FirstOrder
FROM Orders
)
SELECT CustomerID, MIN(OrderDate) AS FirstPurchase, MAX(OrderDate) AS LastPurchase
FROM CustomerOrders
GROUP BY CustomerID;


3. What’s Next?

🚀 Continue Improving: Solve problems on LeetCode, StrataScratch, SQLZoo & 365datascience
📈 Build Projects: Create a portfolio with real-world datasets
📚 Learn Advanced Topics: Explore Data Warehousing, BigQuery, NoSQL

🎉 Congratulations on Completing the 30-Day SQL Challenge! 🎉

If you found this useful, like this post and share it with your friends!

Here you can find SQL Interview Resources👇
https://news.1rj.ru/str/sqlanalyst

Share with credits: https://news.1rj.ru/str/sqlspecialist

Hope it helps :)
16👍12
Data Analyst Roadmap:

- Tier 1: Learn Excel & SQL
- Tier 2: Data Cleaning & Exploratory Data Analysis (EDA)
- Tier 3: Data Visualization & Business Intelligence (BI) Tools
- Tier 4: Statistical Analysis & Machine Learning Basics

Then build projects that include:

- Data Collection
- Data Cleaning
- Data Analysis
- Data Visualization

And if you want to make your portfolio stand out more:

- Solve real business problems
- Provide clear, impactful insights
- Create a presentation
- Record a video presentation
- Target specific industries
- Reach out to companies

Hope this helps you 😊
👍118
Data Analytics
Data analyst interview questions Part-5: 21. What are Window Functions in SQL? Window functions perform calculations across a set of rows related to the current row without collapsing the dataset like GROUP BY. Common window functions: RANK() – Assigns…
Data Analyst Interview Series Part-6

What is the difference between COUNT(), COUNT(*), and COUNT(DISTINCT) in SQL?

COUNT(column_name): Counts non-null values in a specific column.
COUNT(*): Counts all rows, including NULL values.
COUNT(DISTINCT column_name): Counts unique non-null values in a column.

Example:

SELECT COUNT(salary) FROM employees; -- Counts non-null salaries SELECT COUNT(*) FROM employees; -- Counts all rows SELECT COUNT(DISTINCT department) FROM employees; -- Counts unique departments 


What are the different types of filters in Power BI?

Power BI provides several types of filters:

Visual-level filters: Apply to a single visual.
Page-level filters: Apply to all visuals on a report page.
Report-level filters: Apply to the entire report.
Drillthrough filters: Allow focusing on specific details by navigating to another report page.
Top N filters: Show only the top N values based on a measure.
Example: Using a Top N filter to show the top 5 performing products in sales.

How do you use the VLOOKUP function in Excel?

VLOOKUP searches for a value in the first column of a range and returns a corresponding value from another column.

Syntax:
=VLOOKUP(lookup_value, table_array, col_index_num, [range_lookup])

Example: To find an employee’s department based on their ID:
=VLOOKUP(101, A2:C10, 2, FALSE)

101 → Value to search for
A2:C10 → Table range
2 → Column number to return data from
FALSE → Exact match

What is the difference between a Bar Chart and a Column Chart?

Bar Chart: Uses horizontal bars, suitable for comparing categories.
Column Chart: Uses vertical bars, good for showing trends over time.
Example:
A Bar Chart is useful for comparing sales across regions.
A Column Chart is useful for showing monthly revenue growth.

How do you handle missing data in Pandas?

Pandas provides multiple ways to handle missing data:

Remove missing values: df.dropna()

Fill missing values with a default value: df.fillna(0)

Fill missing values with the column mean: df['Salary'].fillna(df['Salary'].mean(), inplace=True)

Forward fill (copy previous value): df.fillna(method='ffill')

Backward fill (copy next value): df.fillna(method='bfill')

These methods ensure data quality while preventing errors in analysis.

Like this post for if you want me to continue the interview series 👍♥️

Share with credits: https://news.1rj.ru/str/sqlspecialist

Hope it helps :)
👍157
As a data analytics enthusiast, the end goal is not just to learn SQL, Power BI, Python, Excel, etc. but to get a job as a Data Analyst👨💻

Back then, when I was trying to switch my career into data analytics, I used to keep aside 1:00-1:30 hours of my day aside so that I can utilize those hours to search for job openings related to Data analytics and Business Intelligence.

Before going to bed, I used to utilize the first 30 minutes by going through various job portals such as naukri, LinkedIn, etc to find relevant openings and next 1 hour by collecting the keywords from the job denoscription to curate the resume accordingly and searching for profile of people who can refer me for the role.

📍 I will advise every aspiring data analyst to have a dedicated timing for searching and applying for the jobs.

📍To get into data analytics, applying for jobs is as important as learning and upskilling.

If you are not applying for the jobs, you are simply delaying your success to get into data analytics👨💻📊

Hope this helps you 😊
👍208👎1
Data Analytics
Data Analyst Interview Series Part-6 What is the difference between COUNT(), COUNT(*), and COUNT(DISTINCT) in SQL? COUNT(column_name): Counts non-null values in a specific column. COUNT(*): Counts all rows, including NULL values. COUNT(DISTINCT column_name):…
Data Analyst Interview Part-7

What is the difference between INNER JOIN, LEFT JOIN, RIGHT JOIN, and FULL JOIN in SQL?

INNER JOIN
: Returns only matching rows from both tables.
LEFT JOIN: Returns all rows from the left table and matching rows from the right table. If no match, NULL is returned.
RIGHT JOIN: Returns all rows from the right table and matching rows from the left table. If no match, NULL is returned.
FULL JOIN: Returns all rows from both tables, filling unmatched values with NULL.

Example:

SELECT employees.name, departments.department_name FROM employees INNER JOIN departments ON employees.department_id = departments.id; 


What are Measures and Calculated Columns in Power BI?

Measures: Used for dynamic calculations based on user interactions. They are created using DAX and are not stored in tables.
Calculated Columns: Are static columns created using DAX, stored in a table, and calculated when the data is loaded.

Example of a Measure (Total Sales):
Total Sales = SUM(Sales[Amount])

Example of a Calculated Column (Profit Margin):
Profit Margin = Sales[Profit] / Sales[Revenue]


How do you remove duplicate values in Excel?

To remove duplicates, use the Remove Duplicates feature:
Select the data range.
Click Data > Remove Duplicates.
Choose the columns to check for duplicates.
Click OK.

Alternatively, use a formula to highlight duplicates:
=COUNTIF(A:A, A2) > 1
For Power Query:

Load data into Power Query.
Select columns.
Click Remove Duplicates.


What is a Heatmap in Data Visualization?

A heatmap is a graphical representation where values are represented by colors. It is used to visualize density, intensity, or correlation between variables.

Common use cases:
Website click heatmaps to analyze user behavior.

Correlation heatmaps in data science to show relationships between variables.

In Python, create a heatmap using Seaborn:

import seaborn as sns import matplotlib.pyplot as plt sns.heatmap(df.corr(), annot=True, cmap="coolwarm") plt.show() 


What is the difference between APPLY and MAP functions in Pandas?

map(): Used for element-wise transformations on a single Pandas Series.
apply(): Used for applying functions to a Series or an entire DataFrame.

Example using map():
df['Salary'] = df['Salary'].map(lambda x: x * 1.1) # Increases salary by 10%

Example using apply():
df['New Salary'] = df['Salary'].apply(lambda x: x * 1.1)

For DataFrames, apply() can work on rows or columns:
df.apply(lambda x: x.max() - x.min(), axis=1) # Row-wise difference

Like this post for if you want me to continue the interview series 👍♥️

Share with credits: https://news.1rj.ru/str/sqlspecialist

Hope it helps :)
7👍7
Data Analyst Learning Plan in 2025

|-- Week 1: Introduction to Data Analysis
|   |-- Data Analysis Fundamentals
|   |   |-- What is Data Analysis?
|   |   |-- Types of Data Analysis
|   |   |-- Data Analysis Workflow
|   |-- Tools and Environment Setup
|   |   |-- Overview of Tools (Excel, SQL)
|   |   |-- Installing Necessary Software
|   |   |-- Setting Up Your Workspace
|   |-- First Data Analysis Project
|   |   |-- Data Collection
|   |   |-- Data Cleaning
|   |   |-- Basic Data Exploration
|
|-- Week 2: Data Collection and Cleaning
|   |-- Data Collection Methods
|   |   |-- Primary vs. Secondary Data
|   |   |-- Web Scraping
|   |   |-- APIs
|   |-- Data Cleaning Techniques
|   |   |-- Handling Missing Values
|   |   |-- Data Transformation
|   |   |-- Data Normalization
|   |-- Data Quality
|   |   |-- Ensuring Data Accuracy
|   |   |-- Data Integrity
|   |   |-- Data Validation
|
|-- Week 3: Data Exploration and Visualization
|   |-- Exploratory Data Analysis (EDA)
|   |   |-- Denoscriptive Statistics
|   |   |-- Data Distribution
|   |   |-- Correlation Analysis
|   |-- Data Visualization Basics
|   |   |-- Choosing the Right Chart Type
|   |   |-- Creating Basic Charts
|   |   |-- Customizing Visuals
|   |-- Advanced Data Visualization
|   |   |-- Interactive Dashboards
|   |   |-- Storytelling with Data
|   |   |-- Data Presentation Techniques
|
|-- Week 4: Statistical Analysis
|   |-- Introduction to Statistics
|   |   |-- Denoscriptive vs. Inferential Statistics
|   |   |-- Probability Theory
|   |-- Hypothesis Testing
|   |   |-- Null and Alternative Hypotheses
|   |   |-- t-tests, Chi-square tests
|   |   |-- p-values and Significance Levels
|   |-- Regression Analysis
|   |   |-- Simple Linear Regression
|   |   |-- Multiple Linear Regression
|   |   |-- Logistic Regression
|
|-- Week 5: SQL for Data Analysis
|   |-- SQL Basics
|   |   |-- SQL Syntax
|   |   |-- Select, Insert, Update, Delete
|   |-- Advanced SQL
|   |   |-- Joins and Subqueries
|   |   |-- Window Functions
|   |   |-- Stored Procedures
|   |-- SQL for Data Analysis
|   |   |-- Data Aggregation
|   |   |-- Data Transformation
|   |   |-- SQL for Reporting
|
|-- Week 6-8: Python for Data Analysis
|   |-- Python Basics
|   |   |-- Python Syntax
|   |   |-- Data Types and Structures
|   |   |-- Functions and Loops
|   |-- Data Analysis with Python
|   |   |-- NumPy for Numerical Data
|   |   |-- Pandas for Data Manipulation
|   |   |-- Matplotlib and Seaborn for Visualization
|   |-- Advanced Data Analysis in Python
|   |   |-- Time Series Analysis
|   |   |-- Machine Learning Basics
|   |   |-- Data Pipelines
|
|-- Week 9-11: Real-world Applications and Projects
|   |-- Capstone Project
|   |   |-- Project Planning
|   |   |-- Data Collection and Preparation
|   |   |-- Building and Optimizing Models
|   |   |-- Creating and Publishing Reports
|   |-- Case Studies
|   |   |-- Business Use Cases
|   |   |-- Industry-specific Solutions
|   |-- Integration with Other Tools
|   |   |-- Data Analysis with Excel
|   |   |-- Data Analysis with R
|   |   |-- Data Analysis with Tableau/Power BI
|
|-- Week 12: Post-Project Learning
|   |-- Data Analysis for Business Intelligence
|   |   |-- KPI Dashboards
|   |   |-- Financial Reporting
|   |   |-- Sales and Marketing Analytics
|   |-- Advanced Data Analysis Topics
|   |   |-- Big Data Technologies
|   |   |-- Cloud Data Warehousing
|   |-- Continuing Education
|   |   |-- Advanced Data Analysis Techniques
|   |   |-- Community and Forums
|   |   |-- Keeping Up with Updates
|
|-- Resources and Community
|   |-- Online Courses (edX, Udemy)
|   |-- Data Analysis Blogs
|   |-- Data Analysis Communities

Follow the Data Analysts - SQL, Tableau, Excel, Power BI & Python channel on WhatsApp: https://whatsapp.com/channel/0029VaGgzAk72WTmQFERKh02

Like this post for more content like this 👍♥️

Share with credits: https://news.1rj.ru/str/sqlspecialist

Hope it helps :)
20👍7🔥4👏4
Tableau Learning Plan in 2025

|-- Week 1: Introduction to Tableau
|   |-- Tableau Basics
|   |   |-- What is Tableau?
|   |   |-- Tableau Products Overview (Desktop, Public, Online, Server)
|   |   |-- Installing Tableau Desktop
|   |-- Setting up Tableau Environment
|   |   |-- Connecting to Data Sources
|   |   |-- Overview of the Tableau Interface
|   |   |-- Basic Operations (Open, Save, Close)
|   |-- First Tableau Dashboard
|   |   |-- Creating a Simple Dashboard
|   |   |-- Basic Charts and Visualizations
|   |   |-- Adding Filters and Actions
|
|-- Week 2: Data Preparation and Transformation
|   |-- Data Connections
|   |   |-- Connecting to Various Data Sources (Excel, SQL, Web Data)
|   |   |-- Data Extracts vs. Live Connections
|   |-- Data Cleaning and Shaping
|   |   |-- Data Interpreter
|   |   |-- Pivot and Unpivot Data
|   |   |-- Handling Null Values
|   |-- Data Blending and Joins
|   |   |-- Data Blending
|   |   |-- Joins and Relationships
|   |   |-- Union Data
|
|-- Week 3: Intermediate Tableau
|   |-- Advanced Calculations
|   |   |-- Calculated Fields
|   |   |-- Table Calculations
|   |   |-- Level of Detail (LOD) Expressions
|   |-- Advanced Visualizations
|   |   |-- Dual-Axis Charts
|   |   |-- Heat Maps and Highlight Tables
|   |   |-- Custom Geocoding
|   |-- Dashboard Interactivity
|   |   |-- Filters and Parameters
|   |   |-- Dashboard Actions
|   |   |-- Using Stories for Narrative
|
|-- Week 4: Data Visualization Best Practices
|   |-- Design Principles
|   |   |-- Choosing the Right Chart Type
|   |   |-- Color Theory
|   |   |-- Layout and Formatting
|   |-- Advanced Mapping
|   |   |-- Creating and Customizing Maps
|   |   |-- Using Map Layers
|   |   |-- Geographic Data Visualization
|   |-- Performance Optimization
|   |   |-- Optimizing Data Sources
|   |   |-- Reducing Load Times
|   |   |-- Extracts and Aggregations
|
|-- Week 5: Tableau for Business Intelligence
|   |-- Business Dashboards
|   |   |-- KPI Dashboards
|   |   |-- Sales and Revenue Dashboards
|   |   |-- Financial Dashboards
|   |-- Storytelling with Data
|   |   |-- Creating Data Stories
|   |   |-- Using Annotations
|   |   |-- Interactive Dashboards
|   |-- Sharing and Collaboration
|   |   |-- Publishing to Tableau Server/Public
|   |   |-- Tableau Online Collaboration
|   |   |-- Embedding Dashboards in Websites
|
|-- Week 6-8: Advanced Tableau Techniques
|   |-- Tableau Prep
|   |   |-- Data Preparation Workflows
|   |   |-- Cleaning and Shaping Data with Tableau Prep
|   |   |-- Combining Data from Multiple Sources
|   |-- Tableau and Scripting
|   |   |-- Using R and Python in Tableau
|   |   |-- Advanced Analytics with Scripting
|   |-- Advanced Analytics
|   |   |-- Forecasting
|   |   |-- Clustering
|   |   |-- Trend Lines
|   |-- Tableau Extensions
|   |   |-- Installing and Using Extensions
|   |   |-- Popular Extensions Overview
|
|-- Week 9-11: Real-world Applications and Projects
|   |-- Capstone Project
|   |   |-- Project Planning
|   |   |-- Data Collection and Preparation
|   |   |-- Building and Optimizing Dashboards
|   |   |-- Creating and Publishing Reports
|   |-- Case Studies
|   |   |-- Business Use Cases
|   |   |-- Industry-specific Solutions
|   |-- Integration with Other Tools
|   |   |-- Tableau and SQL
|   |   |-- Tableau and Excel
|   |   |-- Tableau and Power BI
|
|-- Week 12: Post-Project Learning
|   |-- Tableau Administration
|   |   |-- Managing Tableau Server
|   |   |-- User Roles and Permissions
|   |   |-- Monitoring and Auditing
|   |-- Advanced Tableau Topics
|   |   |-- New Tableau Features
|   |   |-- Latest Tableau Techniques
|   |   |-- Community and Forums
|   |   |-- Keeping Up with Updates
|
|-- Resources and Community
|   |-- Online Courses (Tableau Official)
|   |-- Tableau Blogs and Podcasts
|   |-- Tableau Communities

You can refer these Tableau Interview Resources to learn more: https://whatsapp.com/channel/0029VasYW1V5kg6z4EHOHG1t

Like this post for more resources ♥️

Share with credits: https://news.1rj.ru/str/sqlspecialist

Hope it helps :)
👍134
It takes time to learn Excel.

It takes time to master SQL.

It takes time to understand Power BI.

It takes time to analyze complex datasets.

It takes time to create impactful dashboards.

It takes time to work on real-world data projects.

It takes time to build a strong LinkedIn profile.

It takes time to prepare for technical and behavioral interviews.

Here’s one tip from someone who’s been through it all:

Be Patient. Good things take time ☺️

Keep building your skills and showcasing your value. Your time will come!
35👍17🔥2🥰2👎1
Data Analytics
Data Analyst Interview Part-7 What is the difference between INNER JOIN, LEFT JOIN, RIGHT JOIN, and FULL JOIN in SQL? INNER JOIN: Returns only matching rows from both tables. LEFT JOIN: Returns all rows from the left table and matching rows from the right…
Data Analyst Interview Part-8

How do you perform data cleaning in Python?

Data cleaning in Python involves several steps:

Handling missing data:
Drop missing values: df.dropna()
Fill missing values: df.fillna(value)

Removing duplicates:
df.drop_duplicates()

Converting data types:
df['column'] = df['column'].astype(int)

Handling outliers:
Use filtering or statistical methods to identify and remove outliers.

Standardizing or normalizing data:
Use libraries like scikit-learn for scaling:

from sklearn.preprocessing import StandardScaler scaler = StandardScaler() df[['column']] = scaler.fit_transform(df[['column']]) 


What is the use of GROUP BY in SQL?

GROUP BY is used to group rows that have the same values into summary rows, often with aggregate functions like COUNT, SUM, AVG, etc.

Example:

SELECT department, AVG(salary) FROM employees GROUP BY department; 

This will calculate the average salary for each department.

What is the significance of normalization in SQL?

Normalization is the process of organizing data in a way that reduces redundancy and dependency by dividing large tables into smaller ones and using relationships (foreign keys).

1st Normal Form (1NF): Ensures atomicity (no multi-valued fields).

2nd Normal Form (2NF): Ensures that all non-key attributes are fully dependent on the primary key.

3rd Normal Form (3NF): Ensures that no transitive dependencies exist (non-key attributes do not depend on other non-key attributes).

How do you handle time series data in Python?

Handling time series data in Python involves several steps:

Converting to DateTime format: df['date'] = pd.to_datetime(df['date'])

Resampling: To aggregate data at different frequencies:
df.set_index('date').resample('M').sum()

Decomposition: Split the time series into trend, seasonality, and residuals:

from statsmodels.tsa.seasonal
import seasonal_decompose decomposition = seasonal_decompose(df['value'], model='additive', period=12) decomposition.plot()


Plotting: Use libraries like Matplotlib and Seaborn to visualize trends over time.


What are the advantages of using Power BI over Excel?

Data Handling: Power BI can handle much larger datasets (millions of rows) compared to Excel.

Data Modeling: Power BI allows creating complex data models and relationships between tables, which is harder to manage in Excel.

Interactive Visualizations: Power BI offers interactive dashboards with drill-down capabilities.

Advanced Features: Power BI supports advanced analytics, DAX for custom calculations, and integration with other tools like Azure and SharePoint.

Scheduled Refresh: Power BI allows automatic data refresh from connected sources, while in Excel, this needs to be done manually.

Like this post for if you want me to continue the interview series 👍♥️

Share with credits: https://news.1rj.ru/str/sqlspecialist

Hope it helps :)
👍117