NEW BOT Телеграм, страница - 212324138

Engineer Readings

@engineerreadings

522 subscribers

6 photos

3 files

666 links

Download Telegram

About

Blog

Apps

Platform

Engineer Readings

522 subscribers

Engineer Readings

[ai]

https://ai-2027.com/

A research-backed AI scenario forecast.

🔥3💩1

486 views06:42

Engineer Readings

[mcp][security]

https://invariantlabs.ai/blog/mcp-security-notification-tool-poisoning-attacks

invariantlabs.ai

MCP Security Notification: Tool Poisoning Attacks

We have discovered a critical vulnerability in the Model Context Protocol (MCP) that allows for

👍1

499 views19:41

Engineer Readings

[ai research][stanford]

https://storm.genie.stanford.edu/

432 views16:12

Engineer Readings

[agent][protocol]

https://developers.googleblog.com/en/a2a-a-new-era-of-agent-interoperability/

Google for Developers Blog - News about Web, Mobile, AI and Cloud

Explore A2A, Google's new open protocol empowering developers to build interoperable AI solutions.

🔥2

523 views13:46

Engineer Readings

[voice ai][learnings]

https://voiceaiandvoiceagents.com/

Voiceaiandvoiceagents

Voice AI & Voice Agents | An Illustrated Primer

A comprehensive guide to voice AI in 2025

🔥3

606 views03:08

Engineer Readings

[llm][model training]

https://magazine.sebastianraschka.com/p/the-state-of-llm-reasoning-model-training

Sebastianraschka

The State of Reinforcement Learning for LLM Reasoning

Understanding GRPO and New Insights from Reasoning Model Papers

616 views17:55

Engineer Readings

[hpc][architecture]

https://temitayoadefemi.github.io/Papers/High%20Performance%20Architectures.pdf

699 views21:38

Engineer Readings

[data][validation][storage]

https://dropbox.tech/infrastructure/pocket-watch

Pocket watch: Verifying exabytes of data

❤2🔥1

608 views06:22

Engineer Readings

[durable objects][cloudflare]

https://blog.cloudflare.com/detecting-workers-builds-errors-across-1-million-durable-durable-objects/

The Cloudflare Blog

Let’s DO this: detecting Workers Builds errors across 1 million Durable Objects

Workers Builds, our CI/CD product for deploying Workers, monitors build issues by analyzing build failure metadata spread across over one million Durable Objects.

642 views21:41

Engineer Readings

[llm][injection]

https://arxiv.org/abs/2506.08837

Design Patterns for Securing LLM Agents against Prompt Injections

As AI agents powered by Large Language Models (LLMs) become increasingly versatile and capable of addressing a broad spectrum of tasks, ensuring their security has become a critical challenge....

592 views21:27

Engineer Readings

[cursor][architecture]

https://newsletter.pragmaticengineer.com/p/cursor

Pragmaticengineer

Real-world engineering challenges: building Cursor

Cursor has grown 100x in load in just a year, sees 1M+ QPS for its data layer, and serves billions of code completions, daily. A deepdive into how it’s built with cofounder, Sualeh Asif

734 views12:32

Engineer Readings

[zero latency][sqlite]

https://blog.cloudflare.com/sqlite-in-durable-objects/

The Cloudflare Blog

Zero-latency SQLite storage in every Durable Object

Traditional cloud storage is inherently slow because it is accessed over a network and must synchronize many clients. But what if we could instead put your application code deep into the storage layer, such that your code runs where the data is stored? Durable…

947 views20:05

Engineer Readings

[search][llm]

https://arxiv.org/abs/2506.05213

https://github.com/NathanHerr/LLM-First-Search

LLM-First Search: Self-Guided Exploration of the Solution Space

Large Language Models (LLMs) have demonstrated remarkable improvements in reasoning and planning through increased test-time compute, often by framing problem-solving as a search process. While...

905 viewsedited 19:16

Engineer Readings

[algorithms]

https://en.algorithmica.org/hpc/

👍1

819 views08:52

Engineer Readings

[building your own VM]

https://www.jmeiners.com/lc3-vm/

676 views16:30

Engineer Readings

[r2 sql][db][cloudflare]

https://blog.cloudflare.com/r2-sql-deep-dive/

The Cloudflare Blog

R2 SQL: a deep dive into our new distributed query engine

R2 SQL provides a built-in, serverless way to run ad-hoc analytic queries against your R2 Data Catalog. This post dives deep under the Iceberg into how we built this distributed engine, from its metadata-driven planner to its parallel execution model.

620 views06:14

Engineer Readings

[nvidia][gpu][architecture]
Inside NVIDIA GPUs: Anatomy of high performance matmul kernels

https://t.co/4jLomyexEu

Inside NVIDIA GPUs: Anatomy of high performance matmul kernels - Aleksa Gordić

From GPU architecture and PTX/SASS to warp-tiling and deep asynchronous tensor core pipelines.

🔥3

694 views08:26

Engineer Readings

[oracle][ai][db]
Oracle released Oracle AI database.
Notes:

https://www.oracle.com/news/announcement/ai-world-database-26ai-powers-the-ai-for-data-revolution-2025-10-14/

Oracle AI Database 26ai Powers the AI for Data Revolution

Oracle AI Database 26ai architects AI into the core of data management, furthering Oracle’s commitment to help customers securely bring AI to all their data, everywhere.

626 views12:40

Engineer Readings

[debugging] Hash-Based Bisect Debugging in Compilers and Runtimes

https://research.swtch.com/bisect

600 viewsedited 07:32

Engineer Readings

Happy New Year everyone! 🎄

🎄16

354 views08:26

Engineer Readings

[ai chips] Nvidia's latest move in the AI hardware race: specialized chips for inference

Nvidia just announced the Rubin CPX - a GPU specifically optimized for the prefill phase of inference. This is fascinating because it challenges the "one chip fits all" approach we've seen dominating AI infrastructure.
The core insight: prefill (generating the first token) is compute-heavy but barely uses memory bandwidth, while decode (generating subsequent tokens) is the opposite - memory-bound with underutilized compute. Running both on the same high-end GPU with expensive HBM wastes resources.
Rubin CPX uses cheaper GDDR7 instead of HBM (cutting memory cost by 50%+), drops NVLink for simple PCIe, but maintains strong FP4 compute - 20 PFLOPS dense. It's designed to be drastically cheaper per unit while being better suited for its specific workload.
The competitive angle is brutal: AMD and others were just catching up with rack-scale designs, and now they need to develop specialized prefill chips too, pushing their roadmaps back another cycle.
This disaggregated approach (separate hardware for prefill/decode) hints at where inference infrastructure is heading - not just software optimization, but purpose-built silicon for different phases of the same task.

https://newsletter.semianalysis.com/p/another-giant-leap-the-rubin-cpx-specialized-accelerator-rack

Another Giant Leap: The Rubin CPX Specialized Accelerator & Rack

New Prefill Specialized GPU, Rack Architecture, BOM, Disaggregated PD, Higher Perf per TCO, Lower TCO, GDDR7 & HBM Market Trends

244 views11:11