DE & ML Digest
@mlbigdata
120
subscribers
9
photos
1
video
34.7K
links
Collection of all articles on Data Engineering and Machine Learning
Contact -
@luminousmen
Download Telegram
Join
DE & ML Digest
120 subscribers
DE & ML Digest
Big Data
Refactoring, Plug-in, Performance Improves By 20 times, Apache DolphinScheduler 2.0 alpha Release Highlights Check!
reddit
Refactoring, Plug-in, Performance Improves By 20 times, Apache...
Posted in r/bigdata by u/DolphinScheduler1 • 2 points and 0 comments
DE & ML Digest
Big Data
Big Data University Assignment
reddit
Big Data University Assignment
I'm writing a university assignment where I have to select a company that is currently facing some kind of strategic challenge or issue and then...
DE & ML Digest
Big Data
Schedule your DAGs like never before with Airflow timetables!
reddit
Schedule your DAGs like never before with Airflow timetables!
Posted in r/bigdata by u/marclamberti • 1 point and 0 comments
DE & ML Digest
Big Data
7 Ways the IoT Makes Farming More Precise
reddit
7 Ways the IoT Makes Farming More Precise
Posted in r/bigdata by u/Big_Data_Path • 1 point and 0 comments
DE & ML Digest
Big Data
Data Challenges in Big Data
Blog | iamluminousmen
Data Challenges in Big Data
Working with data in big data always involves some kind of complexities related to its size, storage, and processing. What skills are needed to deal with them?
DE & ML Digest
Big Data
Operational Challenges in Big Data
DE & ML Digest
Big Data
Analytical Challenges in Big Data
Blog | iamluminousmen
Analytical Challenges in Big Data
Analytics and BI is an approach for making data-driven decisions and provides information that can help businesses
DE & ML Digest
Big Data
Management Challenges in Big Data
Blog | iamluminousmen
Management Challenges in Big Data
Management challenges tackle privacy, security, governance, and data/metadata management. Another area of knowledge directly connected to the big data
DE & ML Digest
Big Data
Machine Learning types
Blog | iamluminousmen
Machine Learning types
Machine Learning is based on the idea that analytic systems can learn to identify patterns and make decisions with minimal human involvement
DE & ML Digest
Big Data
ACID vs BASE: Comparison of two Design Philosophies
Blog | iamluminousmen
ACID vs BASE: Comparison of two Design Philosophies
Discover the differences between ACID and BASE design philosophies - from strong consistency to eventual consistency. Find out which suits your project better!
DE & ML Digest
Big Data
Architecturally Significant Requirements
Blog | iamluminousmen
Architecturally Significant Requirements
Discover the crucial Architecturally Significant Requirements (ASR) for distributed systems, including Availability, Durability, Resiliency, Reliability, and Scalability. Learn how these factors impact system design and performance.
DE & ML Digest
Big Data
CAP and PACELC theorems in plain English
Blog | iamluminousmen
CAP and PACELC Theorems in Plain English
Understand the CAP and PACELC theorems in distributed systems. Learn how to navigate tradeoffs between consistency, availability, and partition tolerance for optimal system design.
DE & ML Digest
Big Data
Explaining the mechanics of Spark caching
Blog | iamluminousmen
Explaining the mechanics of Spark caching
Caching... There is so much in that word - the pain of invalidation and the joy of reusing computation. In Spark, this is known as an optimization technique
DE & ML Digest
Big Data
HDFS vs Cloud-based Object storage(S3)
Blog | iamluminousmen
HDFS vs Cloud-based Object storage(S3)
I am very annoyed that all sorts of big data engineers confuse S3 and HDFS systems, assuming that S3 is the same as HDFS. That’s not true.
DE & ML Digest
Big Data
Get Hive count in seconds
DE & ML Digest
Big Data
What is Serverless Architecture and what are its benefits?
Blog | iamluminousmen
What is Serverless Architecture and what are its benefits?
So much hype around serverless architectures but what it's really bringing to the table for us? Is it the next standard in application development?
DE & ML Digest
Big Data
Things to consider while running Google Cloud Dataproc
Blog | iamluminousmen
Things to consider while running Google Cloud Dataproc
There are many pitfalls that inexperienced engineers may encounter when building pipelines based on Cloud Dataproc, let's look into them.
DE & ML Digest
Big Data
Spark tips. Caching
Blog | iamluminousmen
Spark Tips. Caching
Another portion of tips to Apache Spark usage, now it's about caching and checkpointing data
DE & ML Digest
Big Data
AWS Cloud Builders – Career Transformation & Personal Growth
Amazon
AWS Cloud Builders – Career Transformation & Personal Growth | Amazon Web Services
Long-time readers of this blog know that I firmly believe in the power of education to improve lives. AWS Training and Certification equips people and organizations around the world with cloud computing education to build and validate cloud computing skills.…
DE & ML Digest
Big Data
Amazon QuickSight Q – Business Intelligence Using Natural Language Questions
Amazon
Amazon QuickSight Q – Business Intelligence Using Natural Language Questions | Amazon Web Services
Making sense of business data so that you can get value out of it is worthwhile yet still challenging. Even though the term Business Intelligence (BI) has been around since the mid-1800s (according to Wikipedia) adoption of contemporary BI tools within enterprises…
DE & ML Digest
Big Data
New for Amazon Connect: Voice ID, Wisdom, and Outbound Communications
Amazon
New for Amazon Connect: Voice ID, Wisdom, and Outbound Communications | Amazon Web Services
During the AWS re:Invent conference last year, I wrote about new capabilities added to Amazon Connect. Today, I am happy to announce the general availability of two of these capabilities, Voice ID and Wisdom, and the launch of a new one. High-volume outbound…
TWeb.init({scrollToPost:'mlbigdata/34547'});