Data Science & Machine Learning – Telegram
Data Science & Machine Learning
72.5K subscribers
770 photos
2 videos
68 files
680 links
Join this channel to learn data science, artificial intelligence and machine learning with funny quizzes, interesting projects and amazing resources for free

For collaborations: @love_data
Download Telegram
Fake_News_Detection_Machine_learning_project.rar
8.3 MB
Fake news Detection Machine Learning Project with 92%Accuracy
it contain compressed file in which "jupyter notebook file and dataset"
2👍1
dice_roll.py
445 B
🎲Dice_roll_Simulator_Gui with python in 2 minute 😊
numpy.pdf
1.4 MB
Data_science Numpy cheat sheet
1
Dimensionality reduction techniques

Singular Value Decomposition (SVD)
Principal Component Analysis (PCA)
Linear Discriminant Analysis (LDA)
T-distributed Stochastic Neighbor Embedding (t-SNE)
Autoencoders
Fourier and Wavelet Transforms
What is the curse of dimensionality? Why do we care about it?

Data in only one dimension is relatively tightly packed. Adding a dimension stretches the points across that dimension, pushing them further apart. Additional dimensions spread the data even further making high dimensional data extremely sparse. We care about it, because it is difficult to use machine learning in sparse spaces.
👍1
K-means vs DBScan ML Algorithm

DBScan is more robust to noise.
DBScan is better when the amount of clusters is difficult to guess.
K-means has a lower complexity, i.e. it will be much faster, especially with a larger amount of points.
1
Data Science & Machine Learning
Fake_News_Detection_Machine_learning_project.rar
Start working on any project if you are a beginner and want to grow your career as a data scientist
You will learn much more as you practice and work on projects from yourself
You can find dataset in this channel or go to kaggle to find any random dataset and just work on it
Learning concepts is fine but most of the learnings come from projects
I know that might feel boring at first time but as you move forward, it become interesting
1
👉The Ultimate Guide to the Pandas Library for Data Science in Python
👇👇

https://www.freecodecamp.org/news/the-ultimate-guide-to-the-pandas-library-for-data-science-in-python/amp/

A Visual Intro to NumPy and Data Representation
.
Link : 👇👇
https://jalammar.github.io/visual-numpy/

Matplotlib Cheatsheet 👇👇

https://github.com/rougier/matplotlib-cheatsheet

SQL Cheatsheet 👇👇

https://websitesetup.org/sql-cheat-sheet/
Seeing Theory : A visual introduction to probability and statistics

Link :👇👇
https://seeing-theory.brown.edu/

“The Projects You Should Do to Get a Data Science Job” by Ken Jee
👇👇
https://link.medium.com/Q2DnxSGRO6
Precision is one indicator of a machine learning model's performance – the quality of a positive prediction made by the model. Its formula would be?
Anonymous Quiz
43%
True Positive divided by actual yes
10%
True Positive divided by actual no
43%
True Positive divided by predicted yes
4%
True Positive divided by predicted no
👉A handy notebook on handling missing values

Link : 👇👇
https://www.kaggle.com/parulpandey/a-guide-to-handling-missing-values-in-python

A list of NLP Tutorials

Link : 👇👇
https://github.com/lyeoni/nlp-tutorial


“An Implementation and Explanation of the Random Forest in Python” by Will Koehrsen 👇👇
https://link.medium.com/GCWFv81v95

“How to analyse 100s of GBs of data on your laptop with Python” by Jovan Veljanoski 👇👇
https://link.medium.com/V8xS82Cax6
👍1
F-beta score is always
Anonymous Quiz
27%
Greater than 1
70%
Between 0 and 1
3%
Less than 0
Data Science & Machine Learning
What are precision, recall, and F1-score? Precision and recall are classification evaluation metrics: P = TP / (TP + FP) and R = TP / (TP + FN). Where TP is true positives, FP is false positives and FN is false negatives In both cases the score of 1 is…
Here is the explanation for the quiz

The F-beta score is the weighted harmonic mean of precision and recall, reaching its optimal value at 1 and its worst value at 0. The highest possible value of an F-score is 1.0, indicating perfect precision and recall, and the lowest possible value is 0, if either the precision or the recall is zero. The F-score is commonly used for evaluating information retrieval systems such as search engines, and also for many kinds of machine learning models, in particular in natural language processing.
What is the full form of LSTM?

Hint- LSTM algorithm is used for processing and making predictions based on time series data
Anonymous Quiz
7%
Long story total memory
72%
Long short-term memory
16%
Long short-term machine
4%
None of three
1