DE & ML Digest
@mlbigdata
120
subscribers
9
photos
1
video
34.7K
links
Collection of all articles on Data Engineering and Machine Learning
Contact -
@luminousmen
Download Telegram
Join
DE & ML Digest
120 subscribers
DE & ML Digest
ML
What’s the difference between a Data Scientist and a Data Analyst?
KDnuggets
What’s the difference between a Data Scientist and a Data Analyst? - KDnuggets
Find out the major differences between a Data Analyst and a Data Scientist, and read the author's pointers on what they would recommend you to do if you wish to make that transition from Data Analyst to Data Scientist.
DE & ML Digest
ML
Cartoon: Data Science for Thanksgiving
KDnuggets
Cartoon: Data Science for Thanksgiving - KDnuggets
A classic KDnuggets Thanksgiving cartoon examines the predicament of one group of fowl Data Scientists.
DE & ML Digest
ML
Top 4 Data Integration Tools for Modern Enterprises
KDnuggets
Top 4 Data Integration Tools for Modern Enterprises - KDnuggets
Maintaining a centralized data repository can simplify your business intelligence initiatives. Here are four data integration tools that can make data more valuable for modern enterprises.
DE & ML Digest
ML
Can You Become a Data Scientist Online?
KDnuggets
Can You Become a Data Scientist Online? - KDnuggets
Until November 29th, you can join over 1.5 million students around the globe and gain the skills of successful data science professionals with unlimited annual access to the 365 Data Science Program at 72% OFF. Read on to learn more!
DE & ML Digest
Big Data
Management Challenges in Big Data
Blog | iamluminousmen
Management Challenges in Big Data
Management challenges tackle privacy, security, governance, and data/metadata management. Another area of knowledge directly connected to the big data
DE & ML Digest
Big Data
ACID vs BASE: Comparison of two Design Philosophies
Blog | iamluminousmen
ACID vs BASE: Comparison of two Design Philosophies
Discover the differences between ACID and BASE design philosophies - from strong consistency to eventual consistency. Find out which suits your project better!
DE & ML Digest
Big Data
Data Challenges in Big Data
Blog | iamluminousmen
Data Challenges in Big Data
Working with data in big data always involves some kind of complexities related to its size, storage, and processing. What skills are needed to deal with them?
DE & ML Digest
Big Data
Analytical Challenges in Big Data
Blog | iamluminousmen
Analytical Challenges in Big Data
Analytics and BI is an approach for making data-driven decisions and provides information that can help businesses
DE & ML Digest
Big Data
Architecturally Significant Requirements
Blog | iamluminousmen
Architecturally Significant Requirements
Discover the crucial Architecturally Significant Requirements (ASR) for distributed systems, including Availability, Durability, Resiliency, Reliability, and Scalability. Learn how these factors impact system design and performance.
DE & ML Digest
Big Data
Machine Learning types
Blog | iamluminousmen
Machine Learning types
Machine Learning is based on the idea that analytic systems can learn to identify patterns and make decisions with minimal human involvement
DE & ML Digest
Big Data
Operational Challenges in Big Data
DE & ML Digest
Big Data
CAP and PACELC theorems in plain English
Blog | iamluminousmen
CAP and PACELC Theorems in Plain English
Understand the CAP and PACELC theorems in distributed systems. Learn how to navigate tradeoffs between consistency, availability, and partition tolerance for optimal system design.
DE & ML Digest
Big Data
Explaining the mechanics of Spark caching
Blog | iamluminousmen
Explaining the mechanics of Spark caching
Caching... There is so much in that word - the pain of invalidation and the joy of reusing computation. In Spark, this is known as an optimization technique
DE & ML Digest
Big Data
HDFS vs Cloud-based Object storage(S3)
Blog | iamluminousmen
HDFS vs Cloud-based Object storage(S3)
I am very annoyed that all sorts of big data engineers confuse S3 and HDFS systems, assuming that S3 is the same as HDFS. That’s not true.
DE & ML Digest
Big Data
Get Hive count in seconds
DE & ML Digest
Big Data
What is Serverless Architecture and what are its benefits?
Blog | iamluminousmen
What is Serverless Architecture and what are its benefits?
So much hype around serverless architectures but what it's really bringing to the table for us? Is it the next standard in application development?
DE & ML Digest
Big Data
Spark tips. Caching
Blog | iamluminousmen
Spark Tips. Caching
Another portion of tips to Apache Spark usage, now it's about caching and checkpointing data
DE & ML Digest
Big Data
Things to consider while running Google Cloud Dataproc
Blog | iamluminousmen
Things to consider while running Google Cloud Dataproc
There are many pitfalls that inexperienced engineers may encounter when building pipelines based on Cloud Dataproc, let's look into them.
DE & ML Digest
Big Data
MLflow for Bayesian Experiment Tracking
Databricks
MLflow for Bayesian Experiment Tracking
Learn how to use MLflow for performing reproducible Bayesian experiments.
DE & ML Digest
Big Data
Introducing Apache Spark
™
3.2
Databricks
Introducing Apache Spark
™
3.2
Learn more about the latest release of Apache Spark
™
, version 3.2, including pandas API on Spark, Adaptive Query Execution, and ANSI mode and how you can begin using it through Databricks Runtime 10.0.
DE & ML Digest
Big Data
GPU-accelerated Sentiment Analysis Using Pytorch and Huggingface on Databricks
Databricks
GPU-accelerated Sentiment Analysis Using Pytorch and Huggingface on Databricks
Learn more about the Pytorch-based GPU-accelerated sentiment analysis package from Huggingface and how it leverages the Databricks platform to simplify and scale the analysis of text for sentiment.
TWeb.init({scrollToPost:'mlbigdata/34731'});