DevOps&SRE Library – Telegram
DevOps&SRE Library
18.4K subscribers
466 photos
4 videos
2 files
5K links
Библиотека статей по теме DevOps и SRE.

Реклама: @ostinostin
Контент: @mxssl

РКН: https://www.gosuslugi.ru/snet/67704b536aa9672b963777b3
Download Telegram
coredns-manager-operator

With the CoreDNS Manager Operator, you can handle internal DNS directly within your Kubernetes cluster, simplifying the process and reducing infrastructure needs.


https://github.com/monkale-io/coredns-manager-operator
chartdb

ChartDB is a powerful, web-based database diagramming editor. Instantly visualize your database schema with a single "Smart Query." Customize diagrams, export SQL noscripts, and access all features—no account required. Experience seamless database design here.


https://github.com/chartdb/chartdb
umami

Umami is a simple, fast, privacy-focused alternative to Google Analytics.


https://github.com/umami-software/umami
psitransfer

Simple open source self-hosted file sharing solution. It's an alternative to paid services like Dropbox, WeTransfer.


https://github.com/psi-4ward/psitransfer
The Continuous Delivery Pipeline Problem

A Perspective on the Current State of Continuous Delivery


https://manifesto.getglu.dev
How Dropbox Saved Millions of Dollars by Building a Load Balancer

Dropbox saved resources by creating a superior version of a tool everyone uses


https://newsletter.betterstack.com/p/how-dropbox-saved-millions-of-dollars
Break Stuff on Purpose

Strengthen your system’s ability to recover by intentionally causing and resolving failures


https://slack.engineering/break-stuff-on-purpose
New Production Readiness Check experience in Mercari

My team Marketplace SRE is part of the Platform Division, which provides the Platform for the Mercari Group as a whole. This article discusses improvements made to the process called Production Readiness Check, which supports the reliability of our services and how it changed the developer experience.


https://engineering.mercari.com/en/blog/entry/20241213-new-production-readiness-check-experience-in-mercari
Uptime, status pages, and transparency calculus

When you first create a status page, it’s probably because you want to communicate outages to your customers. The faster you can share details about an outage, the sooner your customers know what’s going on, and the more effectively they can handle the outage.

Communicating promptly – in clear language – builds trust. And as a young company with a customer centric focus, that’s your top priority.

So why is it that as an industry, we no longer fully trust the status page of large service providers?


https://blog.lawrencejones.dev/status-pages
Preventing Out-of-Memory (OOM) Kills in Kubernetes: Tips for Optimizing Container Memory Management

https://causely.ai/kubernetes-oom-killer-tips
Adding latency: one step, two step, oops

https://blog.lawrencejones.dev/latency
How to support a growing Kubernetes cluster with a small etcd

https://www.datadoghq.com/blog/managing-etcd-storage
The Evolution of SRE at Google

Using STAMP to improve resilience in Google production systems


https://www.usenix.org/publications/loginonline/evolution-sre-google

CAST summary notes for tech teams: https://github.com/joelparkerhenderson/causal-analysis-based-on-system-theory
Ghostty

Ghostty is a fast, feature-rich, and cross-platform terminal emulator that uses platform-native UI and GPU acceleration.


https://ghostty.org
Mastering GitOps with Flux

At Adore Me, we’ve harnessed the power of GitOps and Flux to streamline our complex infrastructure. This approach ensures consistency, security, and efficiency, enabling continuous innovation and seamless deployment across our Kubernetes clusters.


https://adoreme.tech/mastering-gitops-with-flux-adoreme-024b56ac397b
From Chaos to Control: The Importance of Tailored Autoscaling in Kubernetes

https://dev.to/check/from-chaos-to-control-the-importance-of-tailored-autoscaling-in-kubernetes-2kpn
Mastering Graceful Shutdowns in Go: A Comprehensive Guide for Kubernetes

https://hackernoon.com/mastering-graceful-shutdowns-in-go-a-comprehensive-guide-for-kubernetes
Internal Developer Platforms: A Real Thing or Just a Trend?

https://itnext.io/internal-developer-platforms-a-real-thing-or-just-a-trend-ee9c97870dcc