DevOps&SRE Library – Telegram
DevOps&SRE Library
18.4K subscribers
459 photos
3 videos
2 files
5K links
Библиотека статей по теме DevOps и SRE.

Реклама: @ostinostin
Контент: @mxssl

РКН: https://www.gosuslugi.ru/snet/67704b536aa9672b963777b3
Download Telegram
Things that go wrong with disk IO

There are a few interesting scenarios to keep in mind when writing applications (not just databases!) that read and write files, particularly in transactional contexts where you actually care about the integrity of the data and when you are editing data in place (versus copy-on-write for example).


https://notes.eatonphil.com/2025-03-27-things-that-go-wrong-with-disk-io.html
Hot Take: I Want Execs Closer to Incidents, Not Farther

https://uptimelabs.io/hot-take-i-want-execs-closer-to-incidents-not-farther
Improving Kubernetes-Mixin API Server Rules Consistency

A journey into troubleshooting an insidious, and subtle, issue that may occur with Prometheus Recording Rules


https://medium.com/codex/improving-kubernetes-mixin-api-server-rules-consistency-1c0d727e8160
cloudflare-operator

A Kubernetes Operator to create and manage Cloudflare Tunnels and DNS records
for (HTTP/TCP/UDP*) Service Resources


https://github.com/adyanth/cloudflare-operator
kubectl-cond

A kubectl plugin to print Kubernetes object resource conditions in a more human-readable format.


https://github.com/ahmetb/kubectl-cond
gravity

Fully-replicated DNS, DHCP and TFTP Server backed by etcd.


https://github.com/BeryJu/gravity
GitOps: How to Manage Dynamic Network Policy Changes at Scale Across 25 Clusters?

https://itnext.io/gitops-how-to-manage-dynamic-network-policy-changes-at-scale-across-25-clusters-0727ce1145e5
Top-3 Helm Plugins: Helm Secrets, Helm Diff and Helm Git

https://dev.to/mkdev/top-3-helm-plugins-helm-secrets-helm-diff-and-helm-git-2ngb
krs

A GenAI-powered Kubetools Recommender system for your Kubernetes cluster.


https://github.com/kubetoolsca/krs
dragonfly

Provide efficient, stable and secure file distribution and image acceleration based on p2p technology to be the best practice and standard solution in cloud native architectures.


https://github.com/dragonflyoss/dragonfly
hetzner-k3s

This is a CLI tool designed to make it incredibly fast and easy to create and manage Kubernetes clusters on Hetzner Cloud using k3s, a lightweight Kubernetes distribution from Rancher. In a test run, I created a 500-node highly available cluster (3 masters, 497 worker nodes) in just under 11 minutes - though this was with only the public network, as private networks are limited to 100 instances per network. I think this might be a world record!


https://github.com/vitobotta/hetzner-k3s
cloudflare-tunnel-ingress-controller

TLDR; This project simplifies exposing Kubernetes services to the internet easily and securely using Cloudflare Tunnel.


https://github.com/STRRL/cloudflare-tunnel-ingress-controller
go-ansible

Go-ansible is a Go package that allows executing Ansible commands, such as ansible-playbook, ansible-inventory, or ansible, directly from Golang applications. It offers a variety of options for each command, facilitating seamless integration of Ansible functionality into your projects. It is important to highlight that go-ansible is not an alternative implementation of Ansible, but rather a wrapper around the Ansible commands. Let's dive in and explore the capabilities of go-ansible together.


https://github.com/apenella/go-ansible
Turning Non-Prod Incidents into Resilience-Building Opportunities

https://uptimelabs.io/turning-non-prod-incidents-into-resilience-building-opportunities
How Should You Compensate Your Employees for Being On Call?

https://www.pagerduty.com/blog/how-should-you-compensate-your-employees