DevOps&SRE Library – Telegram
DevOps&SRE Library
18.7K subscribers
451 photos
3 videos
2 files
5.07K links
Библиотека статей по теме DevOps и SRE.

Реклама: @ostinostin
Контент: @mxssl

РКН: https://www.gosuslugi.ru/snet/67704b536aa9672b963777b3
Download Telegram
Victorialogs vs Loki - Benchmarking Results

TL;DR – After side‑by‑side testing on a 500 GB/7‑day workload, VictoriaLogs cut query latencies by 94 %, shrank storage by ≈40 %, and used < 50 % of the CPU & RAM we previously allocated to Loki. This post explains why we switched.


https://truefoundry.com/blog/victorialogs-vs-loki
What I Really Mean When I Say “Good Communication” in Incident Response

“Good communication” is one of those phrases everyone nods along to — until the incident hits, and suddenly comms unravel before your eyes.

So here’s what I actually mean when I say communication matters.


https://uptimelabs.io/articles/good-communication-in-incident-response
The JVM Pause That Wasn't: A War Story

A high-throughput Java service was stalling. The culprit? Stop-the-World GC pauses were blocked by synchronous log writes to a busy disk.


https://dzone.com/articles/the-jvm-pause-that-wasnt-a-war-story
Kubernetes Informers are so easy... to misuse!

https://render.com/blog/kubernetes-informers
Breaking Boundaries - Kubernetes Namespaces and multi-tenancy

https://blog.amberwolf.com/blog/2025/september/kubernetes_namespace_boundaries
pvc-autoresizer

pvc-autoresizer resizes PersistentVolumeClaims (PVCs) when the free amount of storage is below the threshold.

It queries the volume usage metrics from Prometheus that collects metrics from kubelet.


https://github.com/topolvm/pvc-autoresizer
Why I Use Terragrunt Over Terraform/OpenTofu in 2025

Terragrunt vs Terraform: Why I chose Terragrunt to eliminate code duplication, automate state management, orchestrate deployments, and follow pattern-level reuse


https://www.axelmendoza.com/posts/terraform-vs-terragrunt
Introducing Observable Load Testing = Locust + OpenTelemetry!

https://medium.com/locust-cloud/observable-load-testing-locust-opentelemetry-c5fced129d84
Patterns for Deploying OTel Collector at Scale

As applications grow, the question quickly shifts from what OTel can do to how we can deploy it effectively at scale. In this post, we’ll explore some deployment patterns for the OTel Collector!


https://newsletter.signoz.io/p/patterns-for-deploying-otel-collector
Better observability, deeper insights: OpenSearch’s new Piped Processing Language capabilities

https://opensearch.org/blog/better-observability-deeper-insights-opensearchs-new-piped-processing-language-capabilities
The "Meh-trics" Reloaded: Why I Was 100% Wrong About Metrics (and Also 100% Right)

https://www.honeycomb.io/blog/the-meh-trics-reloaded
renovate

Renovate is an automated dependency update tool. It helps to update dependencies in your code without needing to do it manually. When Renovate runs on your repo, it looks for references to dependencies (both public and private) and, if there are newer versions available, Renovate can create pull requests to update your versions automatically.


https://github.com/renovatebot/renovate
walrus

Walrus is a distributed message streaming platform built on a high-performance log storage engine. It provides fault-tolerant streaming with automatic leadership rotation, segment-based partitioning, and Raft consensus for metadata coordination.


https://github.com/nubskr/walrus