DevOps & SRE notes – Telegram
DevOps & SRE notes
12K subscribers
38 photos
19 files
2.49K links
Helpfull articles and tools for DevOps&SRE

WhatsApp: https://whatsapp.com/channel/0029Vb79nmmHVvTUnc4tfp2F

For paid consultation (RU/EN), contact: @tutunak


All ways to support https://telegra.ph/How-support-the-channel-02-19
Download Telegram
DevOps & SRE notes
How did you start your morning? Cloudflare decided that you’d had too much of the internet.
A change made to how Cloudflare's Web Application Firewall parses requests caused Cloudflare's network to be unavailable for several minutes this morning. This was not an attack; the change was deployed by our team to help mitigate the industry-wide vulnerability disclosed this week in React Server Components. We will share more information as we have it today.

https://www.cloudflarestatus.com/incidents/lfrm31y6sw9q
👍4
Cloudflare has had two major outages in less than 30 days. Are big tech companies broken? Can’t they be examples of good role models? Or is it just that shit happens?
Final Results
35%
Yes, everything is broken. They’re no longer a good example of solid engineering practices
65%
No, it’s fine. Shit happens, come on.
👍3💯1
Will Sulzer's report details the process of deploying self-hosted GitHub Action Runners on Google Kubernetes Engine (GKE) using a rootless Docker-in-Docker setup. The instructions focus on achieving this with minimal privileges for enhanced security.
https://medium.com/google-cloud/github-action-runners-on-gke-with-dind-rootless-bd54e23516c9
👍2🔥2
This analysis explores how eBPF (extended Berkeley Packet Filter) can be used to gain insights into real-time SSL/TLS encrypted traffic. The author, TJ. Podobnik, discusses how this technology allows for monitoring without compromising security.
https://medium.com/all-things-ebpf/what-insights-can-ebpf-provide-into-real-time-ssl-tls-encrypted-traffic-and-how-435c8ad33efc
👍5
This post by Brian Chambers reflects on the lessons learned from launching an edge compute platform at Chick-fil-A. It discusses the challenges and successes of developing and scaling the platform from within the Enterprise Architecture team.
https://medium.com/chick-fil-atech/what-we-learned-from-launching-edge-compute-from-enterprise-architecture-1dc34e49482f
👍1
This article discusses the importance of the "what went well" section in incident write-ups, arguing that it's more than just a morale booster. Lorin Hochstein suggests that detailing successful improvisations and diagnostic work can be a powerful learning tool for future incident responders.
https://surfingcomplexity.blog/2025/06/14/what-went-well-is-more-than-just-a-pat-on-the-back/
👍3
Forwarded from DevOps & SRE notes (tutunak)
Looking for a hosting platform to practice with Linux, Kubernetes, etc.? Register using my referral link on DigitalOcean and get $200 in credit for 60 days. By registering through my referral link, you also support this Telegram channel.

👉 Register
🔥43👍3👏1
This piece, "The MTTI Manifesto," argues for the importance of a new metric in incident response: Mean Time to Isolate. The author contends that the majority of outage time is spent identifying the problem's source, not fixing it, and that focusing on MTTI can drive significant improvements in system architecture and observability.
https://www.oldschoolburke.com/the-mtti-manifesto/
👍5
This write-up explores the emerging discipline of AI Reliability Engineering (AIRe) as the "Third Age of SRE." It argues that the unique challenges of AI workloads, such as their probabilistic nature and new failure modes like model decay, require an evolution of traditional Site Reliability Engineering principles.
https://thenewstack.io/ai-reliability-engineering-welcome-to-the-third-age-of-sre/
This dispatch offers a detailed walkthrough for backend engineers on creating a Kubernetes Operator using Go and Kubebuilder. The author, Amr Elhewy, simplifies complex DevOps concepts by building a practical "PodTracker" operator that sends Slack notifications for new pod creations.
https://hewi.blog/a-backend-engineer-lost-in-the-devops-world-making-a-kubernetes-operator-with-go
🔥3