DE & ML Digest
@mlbigdata
120
subscribers
9
photos
1
video
34.7K
links
Collection of all articles on Data Engineering and Machine Learning
Contact -
@luminousmen
Download Telegram
Join
DE & ML Digest
120 subscribers
DE & ML Digest
ML
Using python to solve a complicated mathematical problem
Medium
Using python to solve a complicated mathematical problem
A step-by-step guide to getting three biggest rhombus sums in a matrix
DE & ML Digest
ML
Running Spark on Kubernetes: Approaches and Workflow
Medium
Running Spark on Kubernetes: Approaches and Workflow
Best ways to run Spark jobs on Kubernetes for development, data exploration and in production
DE & ML Digest
ML
Python Code Formatting Made Simple With Git Pre-commit Hooks
Medium
Python Code Formatting Made Simple With Git Pre-commit Hooks
Write code that everyone will fall in love with.
DE & ML Digest
ML
Blind Chess Log [0]
Medium
Blind Chess Log [0]
Monte Carlo Tree Search for partially-observed environments
DE & ML Digest
ML
Avoid These Five Behaviors That Make You Look Like A Data Novice
KDnuggets
Avoid These Five Behaviors That Make You Look Like A Data Novice - KDnuggets
If you are new to the Data Science industry or a well-versed veteran in all things data and analytics, there are always key pitfalls that each of us can easily slide into if we are not careful. These behaviors not only make us appear like novices, but they…
DE & ML Digest
ML
Version Control your Large Datasets using Google Drive
Medium
Version Control your Large Datasets using Google Drive
Making reproducible datasets possible
DE & ML Digest
ML
Statistics in Python — Using ANOVA for Feature Selection
Medium
Statistics in Python — Using ANOVA for Feature Selection
Understand how to use ANOVA for comparing between a categorical and numerical variable
DE & ML Digest
Big Data
Data transformation platform Metrolink.ai raises $22M
VentureBeat
Data transformation platform Metrolink.ai raises $22M
Metrolink.ai, a startup developing a data transformation and management platform, has raised $22 million in venture funding.
DE & ML Digest
Big Data
Call analytics provider CallMiner acquires screen capture platform OrecX
VentureBeat
Call analytics provider CallMiner acquires screen capture platform OrecX
Call analytics company CallMiner has acquired OrecX, an audio and screen capture platform, for an undisclosed sum.
DE & ML Digest
Big Data
Creating an IP Lookup Table of Activities in a SIEM Architecture
Databricks
How to Create a SIEM IP Lookup Table With DHCP and VPN Logs
This blog post shows you how to combine and parse Cisco ISE and Infoblox DHCP logs to create an IP Lookup table detailing your network activity and timeline.
DE & ML Digest
ML
A Guide to Machine Learning Pipelines and Orchest
Analytics Vidhya
Orchest | A Guide to Machine Learning Pipelines and Orchest
Orchest is a data pipeline ecosystem that does not require DAGs or any third-party integration with an environment that is simple to navigate
DE & ML Digest
ML
How to Include R and ggplot in a Python Notebook
Medium
How to Include R and ggplot in a Python Notebook
You can mix an match Python and R in the same Jupyter Notebook — here’s how
DE & ML Digest
ML
Training BPE, WordPiece, and Unigram Tokenizers from Scratch using Hugging Face
Medium
Training BPE, WordPiece, and Unigram Tokenizers from Scratch using Hugging Face
Comparing the tokens generated by SOTA tokenization algorithms using Hugging Face’s tokenizers package
DE & ML Digest
ML
Text Analysis in Natural Language Processing using Julia
Analytics Vidhya
Text Analysis in Natural Language Processing using Julia
The article majorly focuses on how to make you comfortable with the outline of Julia text analysis tools with brief explanations
DE & ML Digest
ML
How to connect DataGrip to Apache Druid
Medium
How to connect DataGrip to Apache Druid
A simple workaround for querying Apache Druid using a traditional SQL editor
DE & ML Digest
ML
A Step By Step Implementation of Principal Component Analysis
Medium
A Step By Step Implementation of Principal Component Analysis
A step-by-step tutorial to explain the working of PCA and implementing it from scratch in python
DE & ML Digest
ML
A checklist for submitting your research to arXiv
Medium
A Checklist for Submitting Your Research to arXiv
21 questions for PDF submission to the popular open-access archive of scholarly articles.
DE & ML Digest
ML
Automated Exploratory Data Analysis
Medium
Automated Exploratory Data Analysis
Using the python edatk library to find insights in your data
DE & ML Digest
ML
Real Time Image Segmentation Using 5 Lines of Code
KDnuggets
Real Time Image Segmentation Using 5 Lines of Code
PixelLib Library is a library created to allow easy integration of object segmentation in images and videos using few lines of python code. PixelLib now provides support for PyTorch backend to perform faster, more accurate segmentation and extraction of objects…
DE & ML Digest
ML
Data lake in S3 from MongoDB
Medium
Data lake in S3 from MongoDB
Using Python to upload MongoDB data to AWS S3 to build a data lake
DE & ML Digest
ML
Interactive Exploratory Data Analysis that Generates Python
Medium
Interactive Exploratory Data Analysis that Generates Python
A practical guide to interactive Exploratory Data Analysis on the Avocado dataset
TWeb.init({scrollToPost:'mlbigdata/34059'});