Data Science & Machine Learning – Telegram
Data Science & Machine Learning
73.6K subscribers
796 photos
2 videos
68 files
695 links
Join this channel to learn data science, artificial intelligence and machine learning with funny quizzes, interesting projects and amazing resources for free

For collaborations: @love_data
Download Telegram
Tableau Cheat Sheet

This Tableau cheatsheet is designed to be your quick reference guide for data visualization and analysis using Tableau. Whether you’re a beginner learning the basics or an experienced user looking for a handy resource, this cheatsheet covers essential topics.

1. Connecting to Data
   - Use *Connect* pane to connect to various data sources (Excel, SQL Server, Text files, etc.).

2. Data Preparation
   - Data Interpreter: Clean data automatically using the Data Interpreter.
   - Join Data: Combine data from multiple tables using joins (Inner, Left, Right, Outer).
   - Union Data: Stack data from multiple tables with the same structure.

3. Creating Views
   - Drag & Drop: Drag fields from the Data pane onto Rows, Columns, or Marks to create visualizations.
   - Show Me: Use the *Show Me* panel to select different visualization types.

4. Types of Visualizations
   - Bar Chart: Compare values across categories.
   - Line Chart: Display trends over time.
   - Pie Chart: Show proportions of a whole (use sparingly).
   - Map: Visualize geographic data.
   - Scatter Plot: Show relationships between two variables.

5. Filters
   - Dimension Filters: Filter data based on categorical values.
   - Measure Filters: Filter data based on numerical values.
   - Context Filters: Set a context for other filters to improve performance.

6. Calculated Fields
   - Create calculated fields to derive new data:
     - Example: Sales Growth = SUM([Sales]) - SUM([Previous Sales])

7. Parameters
   - Use parameters to allow user input and control measures dynamically.

8. Formatting
   - Format fonts, colors, borders, and lines using the Format pane for better visual appeal.

9. Dashboards
   - Combine multiple sheets into a dashboard using the *Dashboard* tab.
   - Use dashboard actions (filter, highlight, URL) to create interactivity.

10. Story Points
    - Create a story to guide users through insights with narrative and visualizations.

11. Publishing & Sharing
    - Publish dashboards to Tableau Server or Tableau Online for sharing and collaboration.

12. Export Options
    - Export to PDF or image for offline use.

13. Keyboard Shortcuts
    - Show/Hide Sidebar: Ctrl+Alt+T
    - Duplicate Sheet: Ctrl + D
    - Undo: Ctrl + Z
    - Redo: Ctrl + Y

14. Performance Optimization
    - Use extracts instead of live connections for faster performance.
    - Optimize calculations and filters to improve dashboard loading times.

Best Resources to learn Tableau: https://whatsapp.com/channel/0029VasYW1V5kg6z4EHOHG1t

Hope you'll like it

Share with credits: https://news.1rj.ru/str/sqlspecialist

Hope it helps :)
SQL interview questions with answers 😄👇

1. Question: What is SQL?

   Answer: SQL (Structured Query Language) is a programming language designed for managing and manipulating relational databases. It is used to query, insert, update, and delete data in databases.

2. Question: Differentiate between SQL and MySQL.

   Answer: SQL is a language for managing relational databases, while MySQL is an open-source relational database management system (RDBMS) that uses SQL as its language.

3. Question: Explain the difference between INNER JOIN and LEFT JOIN.

   Answer: INNER JOIN returns rows when there is a match in both tables, while LEFT JOIN returns all rows from the left table and the matched rows from the right table, filling in with NULLs for non-matching rows.

4. Question: How do you remove duplicate records from a table?

   Answer: Use the DISTINCT keyword in a SELECT statement to retrieve unique records. For example: SELECT DISTINCT column1, column2 FROM table;

5. Question: What is a subquery in SQL?

   Answer: A subquery is a query nested inside another query. It can be used to retrieve data that will be used in the main query as a condition to further restrict the data to be retrieved.

6. Question: Explain the purpose of the GROUP BY clause.

   Answer: The GROUP BY clause is used to group rows that have the same values in specified columns into summary rows, like when using aggregate functions such as COUNT, SUM, AVG, etc.

7. Question: How can you add a new record to a table?

   Answer: Use the INSERT INTO statement. For example: INSERT INTO table_name (column1, column2) VALUES (value1, value2);

8. Question: What is the purpose of the HAVING clause?

   Answer: The HAVING clause is used in combination with the GROUP BY clause to filter the results of aggregate functions based on a specified condition.

9. Question: Explain the concept of normalization in databases.

   Answer: Normalization is the process of organizing data in a database to reduce redundancy and improve data integrity. It involves breaking down tables into smaller, related tables.

10. Question: How do you update data in a table in SQL?

    Answer: Use the UPDATE statement to modify existing records in a table. For example: UPDATE table_name SET column1 = value1 WHERE condition;
7👏2
30 Days Python Roadmap for Data Analysts 👆
8
Data Analyst Roadmap

Like if it helps ❤️
12👍1
Core data science concepts you should know:

🔢 1. Statistics & Probability

Denoscriptive statistics: Mean, median, mode, standard deviation, variance

Inferential statistics: Hypothesis testing, confidence intervals, p-values, t-tests, ANOVA

Probability distributions: Normal, Binomial, Poisson, Uniform

Bayes' Theorem

Central Limit Theorem


📊 2. Data Wrangling & Cleaning

Handling missing values

Outlier detection and treatment

Data transformation (scaling, encoding, normalization)

Feature engineering

Dealing with imbalanced data


📈 3. Exploratory Data Analysis (EDA)

Univariate, bivariate, and multivariate analysis

Correlation and covariance

Data visualization tools: Matplotlib, Seaborn, Plotly

Insights generation through visual storytelling


🤖 4. Machine Learning Fundamentals

Supervised Learning: Linear regression, logistic regression, decision trees, SVM, k-NN

Unsupervised Learning: K-means, hierarchical clustering, PCA

Model evaluation: Accuracy, precision, recall, F1-score, ROC-AUC

Cross-validation and overfitting/underfitting

Bias-variance tradeoff


🧠 5. Deep Learning (Basics)

Neural networks: Perceptron, MLP

Activation functions (ReLU, Sigmoid, Tanh)

Backpropagation

Gradient descent and learning rate

CNNs and RNNs (intro level)


🗃️ 6. Data Structures & Algorithms (DSA)

Arrays, lists, dictionaries, sets

Sorting and searching algorithms

Time and space complexity (Big-O notation)

Common problems: string manipulation, matrix operations, recursion


💾 7. SQL & Databases

SELECT, WHERE, GROUP BY, HAVING

JOINS (inner, left, right, full)

Subqueries and CTEs

Window functions

Indexing and normalization


📦 8. Tools & Libraries

Python: pandas, NumPy, scikit-learn, TensorFlow, PyTorch

R: dplyr, ggplot2, caret

Jupyter Notebooks for experimentation

Git and GitHub for version control


🧪 9. A/B Testing & Experimentation

Control vs. treatment group

Hypothesis formulation

Significance level, p-value interpretation

Power analysis


🌐 10. Business Acumen & Storytelling

Translating data insights into business value

Crafting narratives with data

Building dashboards (Power BI, Tableau)

Knowing KPIs and business metrics

React ❤️ for more
11
Steps to become a data analyst

Learn the Basics of Data Analysis:
Familiarize yourself with foundational concepts in data analysis, statistics, and data visualization. Online courses and textbooks can help.
Free books & other useful data analysis resources - https://news.1rj.ru/str/learndataanalysis

Develop Technical Skills:
Gain proficiency in essential tools and technologies such as:

SQL: Learn how to query and manipulate data in relational databases.
Free Resources- @sqlanalyst

Excel: Master data manipulation, basic analysis, and visualization.
Free Resources- @excel_analyst

Data Visualization Tools: Become skilled in tools like Tableau, Power BI, or Python libraries like Matplotlib and Seaborn.
Free Resources- @PowerBI_analyst

Programming: Learn a programming language like Python or R for data analysis and manipulation.
Free Resources- @pythonanalyst

Statistical Packages: Familiarize yourself with packages like Pandas, NumPy, and SciPy (for Python) or ggplot2 (for R).

Hands-On Practice:
Apply your knowledge to real datasets. You can find publicly available datasets on platforms like Kaggle or create your datasets for analysis.

Build a Portfolio:
Create data analysis projects to showcase your skills. Share them on platforms like GitHub, where potential employers can see your work.

Networking:
Attend data-related meetups, conferences, and online communities. Networking can lead to job opportunities and valuable insights.

Data Analysis Projects:
Work on personal or freelance data analysis projects to gain experience and demonstrate your abilities.

Job Search:
Start applying for entry-level data analyst positions or internships. Look for job listings on company websites, job boards, and LinkedIn.
Jobs & Internship opportunities: @getjobss

Prepare for Interviews:
Practice common data analyst interview questions and be ready to discuss your past projects and experiences.

Continual Learning:
The field of data analysis is constantly evolving. Stay updated with new tools, techniques, and industry trends.

Soft Skills:
Develop soft skills like critical thinking, problem-solving, communication, and attention to detail, as they are crucial for data analysts.

Never ever give up:
The journey to becoming a data analyst can be challenging, with complex concepts and technical skills to learn. There may be moments of frustration and self-doubt, but remember that these are normal parts of the learning process. Keep pushing through setbacks, keep learning, and stay committed to your goal.

ENJOY LEARNING 👍👍
3🔥2👏1
Data Analyst: Analyzes data to provide insights and reports for decision-making.

Data Scientist: Builds models to predict outcomes and uncover deeper insights from data.

Data Engineer: Creates and maintains the systems that store and process data.
5👍1
If you want to Excel in Data Science and become an expert, master these essential concepts:

Core Data Science Skills:

• Python for Data Science – Pandas, NumPy, Matplotlib, Seaborn
• SQL for Data Extraction – SELECT, JOIN, GROUP BY, CTEs, Window Functions
• Data Cleaning & Preprocessing – Handling missing data, outliers, duplicates
• Exploratory Data Analysis (EDA) – Visualizing data trends

Machine Learning (ML):

• Supervised Learning – Linear Regression, Decision Trees, Random Forest
• Unsupervised Learning – Clustering, PCA, Anomaly Detection
• Model Evaluation – Cross-validation, Confusion Matrix, ROC-AUC
• Hyperparameter Tuning – Grid Search, Random Search

Deep Learning (DL):

• Neural Networks – TensorFlow, PyTorch, Keras
• CNNs & RNNs – Image & sequential data processing
• Transformers & LLMs – GPT, BERT, Stable Diffusion

Big Data & Cloud Computing:

• Hadoop & Spark – Handling large datasets
• AWS, GCP, Azure – Cloud-based data science solutions
• MLOps – Deploy models using Flask, FastAPI, Docker

Statistics & Mathematics for Data Science:

• Probability & Hypothesis Testing – P-values, T-tests, Chi-square
• Linear Algebra & Calculus – Matrices, Vectors, Derivatives
• Time Series Analysis – ARIMA, Prophet, LSTMs

Real-World Applications:

• Recommendation Systems – Personalized AI suggestions
• NLP (Natural Language Processing) – Sentiment Analysis, Chatbots
• AI-Powered Business Insights – Data-driven decision-making

React with ❤️ for more
9👍1
Ever wondered what the difference is between a Data Analyst and a Data Scientist? Both roles are in high demand, but they tackle data in different ways.
10
SQL Cheatsheet 📝

This SQL cheatsheet is designed to be your quick reference guide for SQL programming. Whether you’re a beginner learning how to query databases or an experienced developer looking for a handy resource, this cheatsheet covers essential SQL topics.

1. Database Basics
- CREATE DATABASE db_name;
- USE db_name;

2. Tables
- Create Table: CREATE TABLE table_name (col1 datatype, col2 datatype);
- Drop Table: DROP TABLE table_name;
- Alter Table: ALTER TABLE table_name ADD column_name datatype;

3. Insert Data
- INSERT INTO table_name (col1, col2) VALUES (val1, val2);

4. Select Queries
- Basic Select: SELECT * FROM table_name;
- Select Specific Columns: SELECT col1, col2 FROM table_name;
- Select with Condition: SELECT * FROM table_name WHERE condition;

5. Update Data
- UPDATE table_name SET col1 = value1 WHERE condition;

6. Delete Data
- DELETE FROM table_name WHERE condition;

7. Joins
- Inner Join: SELECT * FROM table1 INNER JOIN table2 ON table1.col = table2.col;
- Left Join: SELECT * FROM table1 LEFT JOIN table2 ON table1.col = table2.col;
- Right Join: SELECT * FROM table1 RIGHT JOIN table2 ON table1.col = table2.col;

8. Aggregations
- Count: SELECT COUNT(*) FROM table_name;
- Sum: SELECT SUM(col) FROM table_name;
- Group By: SELECT col, COUNT(*) FROM table_name GROUP BY col;

9. Sorting & Limiting
- Order By: SELECT * FROM table_name ORDER BY col ASC|DESC;
- Limit Results: SELECT * FROM table_name LIMIT n;

10. Indexes
- Create Index: CREATE INDEX idx_name ON table_name (col);
- Drop Index: DROP INDEX idx_name;

11. Subqueries
- SELECT * FROM table_name WHERE col IN (SELECT col FROM other_table);

12. Views
- Create View: CREATE VIEW view_name AS SELECT * FROM table_name;
- Drop View: DROP VIEW view_name;
5🔥1