DevOps & SRE notes – Telegram
DevOps & SRE notes
12K subscribers
41 photos
19 files
2.5K links
Helpful articles and tools for DevOps&SRE

WhatsApp: https://whatsapp.com/channel/0029Vb79nmmHVvTUnc4tfp2F

For paid consultation (RU/EN), contact: @tutunak


All ways to support https://telegra.ph/How-support-the-channel-02-19
Download Telegram
https://www.datadoghq.com/blog/kubernetes-cpu-requests-limits/
The blog post delves into the intricacies of CPU requests and limits within Kubernetes. It discusses how these parameters affect container scheduling and resource allocation, and offers insights into troubleshooting and optimizing application performance.
👍3
We recently upgraded from Postgres 11.9 to 15.3 with zero downtime by using logical replication, a suite of support noscripts, and tools in Elixir & Erlang’s BEAM virtual machine. This post will go into far too much detail explaining how we did it, and considerations you might need to make along the way if you try to do the same.

https://knock.app/blog/zero-downtime-postgres-upgrades
👍5
Engineers and major companies are pushing a technology called L4S that they say could make the web feel dramatically faster. But how?
https://www.theverge.com/23655762/l4s-internet-apple-comcast-latency-speed-bandwidth
👍52
This is the story of how we upgraded our fleet of 1200+ MySQL hosts to 8.0. Upgrading the fleet with no impact to our Service Level Objectives (SLO) was no small feat–planning, testing and the upgrade itself took over a year and collaboration across multiple teams within GitHub.
https://github.blog/2023-12-07-upgrading-github-com-to-mysql-8-0/
👍3
https://questdb.io/blog/questdb-cloud-kubernetes-operator/

The article from QuestDB discusses their transition to a Kubernetes Operator for their cloud operations. The switch involves moving from a provisioner-based system to a Kubernetes Operator, which manages complex operations and orchestrates features like automatic node failure recovery, High Availability, and Cold Storage. This new system simplifies provisioning by reducing code complexity and improving unit testing. It also offers benefits like auto-healing, easier upgrades, and improved control plane locality. The operator is designed to be a drop-in replacement for their current system, ensuring seamless transition and enhanced automation capabilities.
👍2