DE & ML Digest
@mlbigdata
120
subscribers
9
photos
1
video
34.7K
links
Collection of all articles on Data Engineering and Machine Learning
Contact -
@luminousmen
Download Telegram
Join
DE & ML Digest
120 subscribers
DE & ML Digest
Big Data
Классификация кассовых чеков
Хабр
Классификация кассовых чеков
Банки получают содержание кассовых чеков клиентов по транзакциям, совершенных по собственным картам через Операторов Фискальных Данных с согласия клиента. Данные приходят в сыром текстовом формате,...
DE & ML Digest
Big Data
What is the Microsoft Azure Portal?
reddit
What is the Microsoft Azure Portal?
Posted in r/bigdata by u/Big_Data_Path • 1 point and 0 comments
DE & ML Digest
Big Data
Почему на удалении от крупных городов избиратели ходят на участки охотнее и голосуют за партию власти
Хабр
Почему на удалении от крупных городов избиратели ходят на участки охотнее и голосуют за партию власти
Иногда в СМИ можно встретить мнение, что отношение к выборам и электоральные предпочтения людей, проживающих в глубинке и в крупных городах, сильно отличаются. Возможно, источником такой информации...
DE & ML Digest
Big Data
Кастомные агрегаторы в Spark SQL
Хабр
Кастомные агрегаторы в Spark SQL
Данная статья является гайдом по использованию кастомных агрегаторов в Spark SQL API. Она “выросла” из моих заметок, которые я делал себе с начала работы со Spark. Сейчас, по мере накопления опыта,...
DE & ML Digest
Big Data
Cheapest cloud storage for people looking to store large amounts of data on budget.
reddit
Cheapest cloud storage for people looking to store large amounts...
Hello everyone, I'm very happy to announce that stucloud is finally ready for beta testing ! What's stucloud ? It's a cloud storage on budget...
DE & ML Digest
Big Data
ElasticSearch: отказоустойчивый сервер отказал
Хабр
ElasticSearch: отказоустойчивый сервер отказал
Всем привет, меня зовут Илья, я работаю в компании DINS на должности инженера отдела мониторинга. В этой статье расскажу о нашей боли при работе с ElasticSearch. Мне не удалось найти решение этой...
DE & ML Digest
Big Data
Python Financial Stock analysis - Stock Data using Yahoo Finance and Jupyter
reddit
Python Financial Stock analysis - Stock Data using Yahoo Finance...
Posted in r/bigdata by u/Best_Fold_2554 • 17 points and 0 comments
DE & ML Digest
Big Data
Converting Data into Insights: 4 Steps To Improve Your Business Efficiency
reddit
Converting Data into Insights: 4 Steps To Improve Your Business...
Posted in r/bigdata by u/Big_Data_Path • 1 point and 0 comments
DE & ML Digest
Big Data
Refactoring, Plug-in, Performance Improves By 20 times, Apache DolphinScheduler 2.0 alpha Release Highlights Check!
reddit
Refactoring, Plug-in, Performance Improves By 20 times, Apache...
Posted in r/bigdata by u/DolphinScheduler1 • 2 points and 0 comments
DE & ML Digest
Big Data
Big Data University Assignment
reddit
Big Data University Assignment
I'm writing a university assignment where I have to select a company that is currently facing some kind of strategic challenge or issue and then...
DE & ML Digest
Big Data
Schedule your DAGs like never before with Airflow timetables!
reddit
Schedule your DAGs like never before with Airflow timetables!
Posted in r/bigdata by u/marclamberti • 1 point and 0 comments
DE & ML Digest
Big Data
7 Ways the IoT Makes Farming More Precise
reddit
7 Ways the IoT Makes Farming More Precise
Posted in r/bigdata by u/Big_Data_Path • 1 point and 0 comments
DE & ML Digest
Big Data
Data Challenges in Big Data
Blog | iamluminousmen
Data Challenges in Big Data
Working with data in big data always involves some kind of complexities related to its size, storage, and processing. What skills are needed to deal with them?
DE & ML Digest
Big Data
Operational Challenges in Big Data
DE & ML Digest
Big Data
Analytical Challenges in Big Data
Blog | iamluminousmen
Analytical Challenges in Big Data
Analytics and BI is an approach for making data-driven decisions and provides information that can help businesses
DE & ML Digest
Big Data
Management Challenges in Big Data
Blog | iamluminousmen
Management Challenges in Big Data
Management challenges tackle privacy, security, governance, and data/metadata management. Another area of knowledge directly connected to the big data
DE & ML Digest
Big Data
Machine Learning types
Blog | iamluminousmen
Machine Learning types
Machine Learning is based on the idea that analytic systems can learn to identify patterns and make decisions with minimal human involvement
DE & ML Digest
Big Data
ACID vs BASE: Comparison of two Design Philosophies
Blog | iamluminousmen
ACID vs BASE: Comparison of two Design Philosophies
Discover the differences between ACID and BASE design philosophies - from strong consistency to eventual consistency. Find out which suits your project better!
DE & ML Digest
Big Data
Architecturally Significant Requirements
Blog | iamluminousmen
Architecturally Significant Requirements
Discover the crucial Architecturally Significant Requirements (ASR) for distributed systems, including Availability, Durability, Resiliency, Reliability, and Scalability. Learn how these factors impact system design and performance.
DE & ML Digest
Big Data
CAP and PACELC theorems in plain English
Blog | iamluminousmen
CAP and PACELC Theorems in Plain English
Understand the CAP and PACELC theorems in distributed systems. Learn how to navigate tradeoffs between consistency, availability, and partition tolerance for optimal system design.
DE & ML Digest
Big Data
Explaining the mechanics of Spark caching
Blog | iamluminousmen
Explaining the mechanics of Spark caching
Caching... There is so much in that word - the pain of invalidation and the joy of reusing computation. In Spark, this is known as an optimization technique
TWeb.init({scrollToPost:'mlbigdata/34539'});