[databases]
https://jack-vanlightly.com/analyses/2024/4/24/understanding-apache-hudi-consistency-model-part-1
https://jack-vanlightly.com/analyses/2024/4/24/understanding-apache-hudi-consistency-model-part-1
Jack Vanlightly
Understanding Apache Hudi's Consistency Model Part 1 — Jack Vanlightly
Apache Hudi is one of the leading three table formats (Apache Iceberg and Delta Lake being the other two). Whereas Apache Iceberg internals are relatively easy to understand, I found that Apache Hudi was more complex and hard to reason about. As a distributed…
[grafana]
I found this usage of Grafana quite inspiring:
https://grafana.com/about/events/grafanacon/2024/grafana-used-to-monitor-japan-slim-moon-lander/
I found this usage of Grafana quite inspiring:
https://grafana.com/about/events/grafanacon/2024/grafana-used-to-monitor-japan-slim-moon-lander/
Grafana Labs
Grafana in space: Monitoring Japan's SLIM moon lander in real time | Grafana Labs
JAXA Associate Senior Researcher Satoshi Nakahira presents an overview of the ISAS space science missions and the SLIM lunar lander.
❤🔥1
[databases]
https://www.uber.com/en-NL/blog/auto-categorizing-data-through-ai-ml/
Data categorization–the process of classifying data based on its characteristics and essence–is a foundational pillar of any privacy or security program. The effectiveness of fine-grained data categorization is pivotal in implementing privacy and security controls, such as access policies and encryption, as well as managing the lifecycle of data assets, encompassing retention and deletion. This blog delves into Uber’s approach to achieving data categorization at scale by leveraging various AI/ML techniques.
https://www.uber.com/en-NL/blog/auto-categorizing-data-through-ai-ml/
Data categorization–the process of classifying data based on its characteristics and essence–is a foundational pillar of any privacy or security program. The effectiveness of fine-grained data categorization is pivotal in implementing privacy and security controls, such as access policies and encryption, as well as managing the lifecycle of data assets, encompassing retention and deletion. This blog delves into Uber’s approach to achieving data categorization at scale by leveraging various AI/ML techniques.
[llm][usecase][text-to-sql]
https://medium.com/pinterest-engineering/how-we-built-text-to-sql-at-pinterest-30bad30dabff
https://medium.com/pinterest-engineering/how-we-built-text-to-sql-at-pinterest-30bad30dabff
Medium
How we built Text-to-SQL at Pinterest
Adam Obeng | Data Scientist, Data Platform Science; J.C. Zhong | Tech Lead, Analytics Platform; Charlie Gu | Sr. Manager, Engineering
👍1
[news][ai][hackaton]
Great projects out of the Mistral AI hackaton which took place in Paris.
https://x.com/alexreibman/status/1796349663710511114?s=46&t=eNN3Y-GKeBSlFyyj1ozvgg
Great projects out of the Mistral AI hackaton which took place in Paris.
https://x.com/alexreibman/status/1796349663710511114?s=46&t=eNN3Y-GKeBSlFyyj1ozvgg
👍3
[distributed systems][kafka]
Kora: A Cloud-Native Event Streaming Platform For Kafka
https://www.vldb.org/pvldb/vol16/p3822-povzner.pdf
Kora: A Cloud-Native Event Streaming Platform For Kafka
https://www.vldb.org/pvldb/vol16/p3822-povzner.pdf
[memory]
What Every Programmer Should Know About Memory
This paper explains the structure of memory subsys- tems in use on modern commodity hardware, illustrating why CPU caches were developed, how they work, and what programs should do to achieve optimal performance by utilizing them.
https://people.freebsd.org/~lstewart/articles/cpumemory.pdf
What Every Programmer Should Know About Memory
This paper explains the structure of memory subsys- tems in use on modern commodity hardware, illustrating why CPU caches were developed, how they work, and what programs should do to achieve optimal performance by utilizing them.
https://people.freebsd.org/~lstewart/articles/cpumemory.pdf
🔥2
[learning][distributed systems]
Colleague shared an amazing thing you can try to study distributed systems by building.
https://fly.io/dist-sys/1/
Colleague shared an amazing thing you can try to study distributed systems by building.
https://fly.io/dist-sys/1/
Fly
Challenge #1: Echo
Documentation and guides from the team at Fly.io.
🔥3
[video]
How computers work
https://www.youtube.com/watch?v=HaBMAD-Dr8M&list=PLnAxReCloSeTJc8ZGogzjtCtXl_eE6yzA&index=2
How computers work
https://www.youtube.com/watch?v=HaBMAD-Dr8M&list=PLnAxReCloSeTJc8ZGogzjtCtXl_eE6yzA&index=2
YouTube
Logic gates - From transistors to logic gates NAND, AND, NOR, OR, NOT, XOR how computers work PART 1
Logic Gates - This video describes how the main logic gates are built starting from transistors in C-MOS technology, mostly used in CPU and RAM Memory. We see the NAND, AND, OR, NOR, NOT, XOR gates. At the end we see how built a three inputs AND gate and…
👍1🔥1
[distributed systems][paper]
Event-Based Programming without Inversion of Control
https://lampwww.epfl.ch/~odersky/papers/jmlc06.pdf
Event-Based Programming without Inversion of Control
https://lampwww.epfl.ch/~odersky/papers/jmlc06.pdf
[asyncio][python]
https://www.roguelynn.com/words/asyncio-we-did-it-wrong/
https://www.roguelynn.com/words/asyncio-we-did-it-wrong/
roguelynn
asyncio: We Did It Wrong
"The concurrent Python programmer’s dream", the answer to everyone's asynchronous prayers. The `asyncio` module has various layers of abstraction allowing developers as much control as they need and are comfortable with. But it's easy to get lulled into a…