NEW BOT Телеграм, страница

[asyncio][python]
https://www.roguelynn.com/words/asyncio-we-did-it-wrong/

"The concurrent Python programmer’s dream", the answer to everyone's asynchronous prayers. The `asyncio` module has various layers of abstraction allowing developers as much control as they need and are comfortable with. But it's easy to get lulled into a…

635 views15:43

Engineer Readings

[paper][GC][state machine]
https://arxiv.org/html/2405.11182v1

In this paper, the authors quantify the overhead of running a state machine replication system for cloud systems written in a language with garbage collection (GC). To this end, they (1) design a canonical cloud system—a distributed, consensus-based, linearizable key-value store—from scratch, (2) implement it in C++, Java, Rust, and Go, and (3) evaluate the implementations under update-heavy and read-heavy workloads on AWS with different resource constraints, aiming to maximize throughput while maintaining low tail latency. The results show that GC incurs a non-trivial cost, even with ample memory. With limited memory, languages with manual memory management can achieve an order of magnitude higher throughput than those with GC on the same hardware. A key observation is that if a cloud system is expected to scale significantly, building it in a language with manual memory management, despite the higher development cost, may lead to substantial cloud cost savings in the long run.

🔥2

804 viewsedited 18:53

Engineer Readings

[ethz][computer architecture][lectures]

https://safari.ethz.ch/architecture/fall2022/doku.php?id=schedule

🔥1

686 views08:26

Engineer Readings

[paper][ClickHouse]

This paper presents an overview of ClickHouse, a popular open- source OLAP database designed for high-performance analytics over petabyte-scale data sets with high ingestion rates. Its storage layer combines a data format based on traditional log-structured merge (LSM) trees with novel techniques for continuous trans- formation (e.g. aggregation, archiving) of historical data in the background. Queries are written in a convenient SQL dialect and processed by a state-of-the-art vectorized query execution engine with optional code compilation. ClickHouse makes aggressive use of pruning techniques to avoid evaluating irrelevant data in queries. Other data management systems can be integrated at the table function, table engine, or database engine level. Real-world bench- marks demonstrate that ClickHouse is amongst the fastest analyti- cal databases on the market.

https://www.vldb.org/pvldb/vol17/p3731-schulze.pdf

👍2

775 views13:32

Engineer Readings

[cicd][uber]

https://www.uber.com/en-NL/blog/continuous-deployment/

“Uber’s business runs on a myriad of microservices. Ensuring that changes to all of these services are deployed safely and in a timely manner is critical. By utilizing continuous deployment to automate this process, we ensure that new features, library updates, and security patches are all delivered to production without unnecessary delays, improving the overall quality of code serving our business.
In this article, we share how we reimagined continuous deployment of microservices at Uber to improve our deployment automation and the user experience of managing microservices, while tackling some of the peculiar challenges of working with large monorepos with increasing commit volumes.
“

👍3❤1

693 viewsedited 07:24

Engineer Readings

[ai][moshi]

Moshi is made of three main components: Helium, a 7B language model trained on 2.1T tokens, Mimi, a neural audio codec that models semantic and acoustic information, and a new multi-stream architecture that jointly models audio from the user and Moshi on separate channels.

https://kyutai.org/Moshi.pdf

https://github.com/kyutai-labs/moshi

https://huggingface.co/kmhf

636 views07:58

Engineer Readings

[llm][comparison]

https://github.com/rasbt/LLMs-from-scratch/blob/main/ch05/07_gpt_to_llama/converting-gpt-to-llama2.ipynb

GitHub

LLMs-from-scratch/ch05/07_gpt_to_llama/converting-gpt-to-llama2.ipynb at main · rasbt/LLMs-from-scratch

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step - rasbt/LLMs-from-scratch

879 views09:36

Engineer Readings

[book][MIT]
Mathematics for Computer Science

https://courses.csail.mit.edu/6.042/spring18/mcs.pdf

👍4

907 views14:55

Engineer Readings

[dbms]
Data base management systems:
https://dsf.berkeley.edu/papers/fntdb07-architecture.pdf

👍1

784 views05:41

Engineer Readings

[vision language modeling]

https://scontent.fnic1-2.fna.fbcdn.net/v/t39.2365-6/447839882_313401218505975_3018145354897668074_n.pdf?_nc_cat=101&ccb=1-7&_nc_sid=3c67a6&_nc_ohc=cxdrHzez86cQ7kNvgEYdsnh&_nc_zt=14&_nc_ht=scontent.fnic1-2.fna&_nc_gid=AIm5oltX5H5ZDvBG6sQSiSS&oh=00_AYCzEpR145Byh-86b4MvcBd2P3AJd2cUyn5mPs7nqI1MBg&oe=671DBEE3

699 views18:06

Engineer Readings

[virtual machines][hypervisor]

https://x.com/chessman786/status/1855562661074968729?s=46&t=eNN3Y-GKeBSlFyyj1ozvgg

🔥4

589 views21:53

Engineer Readings

[ai][transformers.js]
Are we gonna run models in the browsers?
Here a great overview of what folks achieved so far:
https://www.youtube.com/watch?v=n18Lrbo8VU8

YouTube

Transformers.js: State-of-the-art Machine Learning for the web

Join Joshua Lochner from HuggingFace to learn about Transformers.js, an exciting new JavaScript library that empowers developers to build never-before-seen web applications. It is designed to be functionally equivalent to Hugging Face's Python transformers…

688 views08:22

Engineer Readings

[infra][uber][odin]
https://www.uber.com/en-DK/blog/the-accounter/

👍1

563 views22:14

Engineer Readings

[hardware][ai][network]
Quick overveiew on the hardware for AI infra (swtiches) at Meta
https://engineering.fb.com/2024/10/15/data-infrastructure/open-future-networking-hardware-ai-ocp-2024-meta/

Engineering at Meta

OCP Summit 2024: The open future of networking hardware for AI

At Open Compute Project Summit (OCP) 2024, we’re sharing details about our next-generation network fabric for our AI training clusters. We’ve expanded our network hardware portfolio and are contrib…

497 views09:43

Engineer Readings

[competitive programming][book]
As we are on the open market and looking for a job we aim to be good enough to pass the gates. But what if we are thinking about perfection?

https://cses.fi/book/book.pdf

👍2

505 viewsedited 20:14

Engineer Readings

[scalability][db]
https://www.notion.so/blog/building-and-scaling-notions-data-lake

Notion

How Notion build and grew our data lake to keep up with rapid growth

403 views19:39

Engineer Readings

[leadership][it’s not ai]
Great to see how leaders express their thoughts on making things done as well as how they treat theirselves in the times when you need to make product successfull. Some things you can read through the lines. 3 mins read and it’s worth it.
https://www.notion.so/blog/5-principles-for-effective-ai-leadership-without-deep-expertise

Notion

5 principles for effective AI leadership without deep expertise

In leadership roles, especially technical-leadership roles, there are few subjects you will be asked about more often than AI. But what if, like me until recently, you have lots of technical experience but have yet to dive meaningfully into AI development?

418 views19:54

Engineer Readings

[mediapipe][cross platform ai]

https://youtu.be/tVvKlx-oVqc?si=iv2jXEQJQfKhip6k

YouTube

MediaPipe Web: Bringing cross-platform AI tech to the browser

Tyler Mullen (Staff SWE, Google) will teach you about MediaPipe's cross-platform approach to building AI pipelines and bringing them to the browser. He'll highlight some of the benefits of our method and talk about a few of the major products we help power…

453 views20:11

Engineer Readings

[data structures][paper]

Cache-Oblivious Algorithms
and Data Structures

https://erikdemaine.org/papers/BRICS2002/paper.pdf