DevOps&SRE Library – Telegram
DevOps&SRE Library
18.4K subscribers
457 photos
3 videos
2 files
4.99K links
Библиотека статей по теме DevOps и SRE.

Реклама: @ostinostin
Контент: @mxssl

РКН: https://www.gosuslugi.ru/snet/67704b536aa9672b963777b3
Download Telegram
New Production Readiness Check experience in Mercari

My team Marketplace SRE is part of the Platform Division, which provides the Platform for the Mercari Group as a whole. This article discusses improvements made to the process called Production Readiness Check, which supports the reliability of our services and how it changed the developer experience.


https://engineering.mercari.com/en/blog/entry/20241213-new-production-readiness-check-experience-in-mercari
Uptime, status pages, and transparency calculus

When you first create a status page, it’s probably because you want to communicate outages to your customers. The faster you can share details about an outage, the sooner your customers know what’s going on, and the more effectively they can handle the outage.

Communicating promptly – in clear language – builds trust. And as a young company with a customer centric focus, that’s your top priority.

So why is it that as an industry, we no longer fully trust the status page of large service providers?


https://blog.lawrencejones.dev/status-pages
Preventing Out-of-Memory (OOM) Kills in Kubernetes: Tips for Optimizing Container Memory Management

https://causely.ai/kubernetes-oom-killer-tips
Adding latency: one step, two step, oops

https://blog.lawrencejones.dev/latency
How to support a growing Kubernetes cluster with a small etcd

https://www.datadoghq.com/blog/managing-etcd-storage
The Evolution of SRE at Google

Using STAMP to improve resilience in Google production systems


https://www.usenix.org/publications/loginonline/evolution-sre-google

CAST summary notes for tech teams: https://github.com/joelparkerhenderson/causal-analysis-based-on-system-theory
Ghostty

Ghostty is a fast, feature-rich, and cross-platform terminal emulator that uses platform-native UI and GPU acceleration.


https://ghostty.org
Mastering GitOps with Flux

At Adore Me, we’ve harnessed the power of GitOps and Flux to streamline our complex infrastructure. This approach ensures consistency, security, and efficiency, enabling continuous innovation and seamless deployment across our Kubernetes clusters.


https://adoreme.tech/mastering-gitops-with-flux-adoreme-024b56ac397b
From Chaos to Control: The Importance of Tailored Autoscaling in Kubernetes

https://dev.to/check/from-chaos-to-control-the-importance-of-tailored-autoscaling-in-kubernetes-2kpn
Mastering Graceful Shutdowns in Go: A Comprehensive Guide for Kubernetes

https://hackernoon.com/mastering-graceful-shutdowns-in-go-a-comprehensive-guide-for-kubernetes
Internal Developer Platforms: A Real Thing or Just a Trend?

https://itnext.io/internal-developer-platforms-a-real-thing-or-just-a-trend-ee9c97870dcc
Generating, transforming, and patching Kubernetes configuration with Kustomize

https://itnext.io/generating-transforming-and-patching-kubernetes-configuration-with-kustomize-fb7b02476a1b
Network Policies

In this Article we talk about Network policies in Kubernetes & How we can use them to secure our application workloads.


https://aditya-tanwar.hashnode.dev/kubernetes-network-policies
caddy-waf

A simple Web Application Firewall (WAF) middleware for the Caddy server, designed to provide comprehensive protection against web attacks. This middleware integrates seamlessly with Caddy and offers a wide range of security features to safeguard your applications.


https://github.com/fabriziosalmi/caddy-waf
s3-file-share-for-free

A modern web application for easily sharing and managing files using Amazon S3 storage with a clean and intuitive user interface.


https://github.com/rohitg00/s3-file-share-for-free
diskover

Diskover is an open source file system indexer that uses Elasticsearch to index and manage data across heterogeneous storage systems. Using Diskover, you are able to more effectively search and organize files and system administrators are able to manage storage infrastructure, efficiently provision storage, monitor and report on storage usage and effectively make decisions about new infrastructure purchases.


https://github.com/diskoverdata/diskover-community
ggh

Recall your SSH sessions (also search your SSH config file)


https://github.com/byawitz/ggh
zasper

Zasper is an IDE designed from the ground up to support massive concurrency. It provides a minimal memory footprint, exceptional speed, and the ability to handle numerous concurrent connections.


https://github.com/zasper-io/zasper