NEW BOT Телеграм, страница

DevOps&SRE Library

Failure is inevitable: Learning from a large outage, and building for reliability in depth at Datadog

https://www.datadoghq.com/blog/engineering/rethinking-reliability

3.11K views15:04

DevOps&SRE Library

Why we're leaving serverless

Every millisecond matters when you're in the critical path of API authentication. After two years of fighting serverless limitations, we rebuilt our entire API stack and slashed the end-to-end latency.

https://www.unkey.com/blog/serverless-exit

3K views07:01

DevOps&SRE Library

Advancing Our Chef Infrastructure: Safety Without Disruption

Building a safer, more reliable path forward for Chef at Slack

https://slack.engineering/advancing-our-chef-infrastructure-safety-without-disruption

2.91K views15:00

DevOps&SRE Library

Container CPU Requests & Limits Explained with GOMAXPROCS Tuning

In this article, we’re going to cover a few things that might’ve puzzled you if you’ve been running your applications, especially Go applications, in Kubernetes:

- How Kubernetes and the Linux kernel handle CPU stuff for containers
- What the Go runtime does with CPU, and whether you should bother setting GOMAXPROCS
- Which metrics are actually worth paying attention to

Maybe you’ve seen some of these metrics before while keeping an eye on your applications, but didn’t fully know what to make of them. This should help clear that up.

https://victoriametrics.com/blog/kubernetes-cpu-go-gomaxprocs

4.66K views07:04

DevOps&SRE Library

zmx

session persistence for terminal processes

https://github.com/neurosnap/zmx

3.31K views15:04

DevOps&SRE Library

Running our Docker registry on-prem with Harbor

On hosting images without the price tag.

https://dev.37signals.com/running-our-docker-registry-on-prem-with-harbor

2.93K views07:03

DevOps&SRE Library

fizzy

This is the source code of Fizzy, the Kanban tracking tool for issues and ideas by 37signals.

https://github.com/basecamp/fizzy

2.76K views15:03

DevOps&SRE Library

VERT

VERT is a file conversion utility that uses WebAssembly to convert files on your device instead of a cloud.

https://github.com/VERT-sh/VERT

2.89K views07:02

DevOps&SRE Library

Victorialogs vs Loki - Benchmarking Results

TL;DR – After side‑by‑side testing on a 500 GB/7‑day workload, VictoriaLogs cut query latencies by 94 %, shrank storage by ≈40 %, and used < 50 % of the CPU & RAM we previously allocated to Loki. This post explains why we switched.

https://truefoundry.com/blog/victorialogs-vs-loki

3.5K views15:04

DevOps&SRE Library

What I Really Mean When I Say “Good Communication” in Incident Response

“Good communication” is one of those phrases everyone nods along to — until the incident hits, and suddenly comms unravel before your eyes.

So here’s what I actually mean when I say communication matters.

https://uptimelabs.io/articles/good-communication-in-incident-response

3.85K views07:03

DevOps&SRE Library

The JVM Pause That Wasn't: A War Story

A high-throughput Java service was stalling. The culprit? Stop-the-World GC pauses were blocked by synchronous log writes to a busy disk.

https://dzone.com/articles/the-jvm-pause-that-wasnt-a-war-story

3.91K views15:01

DevOps&SRE Library

Kubernetes Informers are so easy... to misuse!

https://render.com/blog/kubernetes-informers

3.89K views07:04

DevOps&SRE Library

Importance of Graceful Shutdown in Kubernetes

https://dev.to/criteo_tech_community/importance-of-graceful-shutdown-in-kubernetes-2ikb

3.67K views15:05

DevOps&SRE Library

Breaking Boundaries - Kubernetes Namespaces and multi-tenancy

https://blog.amberwolf.com/blog/2025/september/kubernetes_namespace_boundaries

3.28K views07:04

DevOps&SRE Library

How our small company migrated from Docker Swarm to Kubernetes

https://medium.com/coreteq/how-our-small-company-migrated-from-docker-swarm-to-kubernetes-fb683c6f5c3a

4.01K views15:05

DevOps&SRE Library

How We Leverage Kubernetes to Deploy Our Cloud Infrastructure

https://tech-blog.goflink.com/how-we-leverage-kubernetes-to-deploy-our-cloud-infrastructure-abd32cfb7bdf

3.94K views07:04

DevOps&SRE Library

Centralizing Helm Charts: Moving Beyond Ingress with HTTPProxy

https://medium.com/@deekshaarora21/centralizing-helm-charts-why-ingress-broke-and-how-httpproxy-fixed-it-9b5b0e33bba4

3.88K views15:03

DevOps&SRE Library

When DIY Beats Managed Kubernetes

https://lakshminp.substack.com/p/when-diy-beats-managed-kubernetes

3.61K views07:05

DevOps&SRE Library

How our Edge Kubernetes Platform has Evolved

https://medium.com/chick-fil-atech/how-our-edge-kubernetes-platform-has-evolved-12609006bc92

3.48K views15:04

DevOps&SRE Library

pvc-autoresizer

pvc-autoresizer resizes PersistentVolumeClaims (PVCs) when the free amount of storage is below the threshold.

It queries the volume usage metrics from Prometheus that collects metrics from kubelet.

https://github.com/topolvm/pvc-autoresizer

3.26K views07:01

DevOps&SRE Library

Terraform governing with OPA

https://devopsonthetrail.com/terraform-governing-with-opa

3.81K views15:05

About

Blog

Apps

Platform