DevOps&SRE Library – Telegram
DevOps&SRE Library
18.4K subscribers
459 photos
3 videos
2 files
5K links
Библиотека статей по теме DevOps и SRE.

Реклама: @ostinostin
Контент: @mxssl

РКН: https://www.gosuslugi.ru/snet/67704b536aa9672b963777b3
Download Telegram
Upgrading Kubernetes - A Practical Guide

One common question I see on Mastodon and Reddit is "I've inherited a cluster, how do I safely upgrade it". It's surprising that this still isn't a better understood process given the widespread adoption of k8s, but I've had to take over legacy clusters a few times and figured I would write up some of the tips and tricks I've found over the years to make the process easier.


https://matduggan.com/upgrading-kubernetes-safely
Why We Developed Own Kubernetes Controller to Copy Secrets

Kubernetes is a superior platform for deploying and managing applications, but sometimes simple tasks like having the same data — in our case Secrets — between namespaces can cause slight problems and you have to build your own solutions to those problems.


https://medium.com/lonto-digital-services-integrator/why-we-developed-own-kubernetes-controller-to-copy-secrets-e46368ae6db9
Kubernetes YAML tips and tricks

TL;DR: In this article, you will learn tips and gotchas for writing YAML manifests for Kubernetes resources.


https://itnext.io/kubernetes-yaml-tips-and-tricks-904a2c0b2b81
Not Just Scale

It seems like everywhere I look on the internet these days, somebody’s making some form of the following argument:

You don’t need distributed systems! Computers are so fast these days you can serve all your customers off a single machine!

This argument is silly and reductive.

But first, let’s look for the kernel of truth.

One Machine Is All You Need?


https://brooker.co.za/blog/2024/06/04/scale.html
eks-node-viewer

eks-node-viewer is a tool for visualizing dynamic node usage within a cluster. It was originally developed as an internal tool at AWS for demonstrating consolidation with Karpenter. It displays the scheduled pod resource requests vs the allocatable capacity on the node. It does not look at the actual pod resource usage.


https://github.com/awslabs/eks-node-viewer
kubernetes-learning-path

A roadmap to learn Kubernetes from scratch (Beginner to Advanced level)


https://github.com/techiescamp/kubernetes-learning-path
Modern Git Commands and Features You Should Be Using

https://martinheinz.dev/blog/109
Why didn't you status?

Demystifying the public status process.


https://hross.substack.com/p/why-didnt-you-status
pgmq

A lightweight message queue. Like AWS SQS and RSMQ but on Postgres.


https://github.com/tembo-io/pgmq
gitswitch

GitSwitch, the ultimate solution for managing multiple Git users across different vendors


https://github.com/target-ops/gitswitch
Anomaly Alerting in Prometheus

Exploring how to do anomaly alerting on seasonal data in Prometheus.


https://karlstoney.com/response-time-anomaly-alert
Key metrics for monitoring AWS WAF

https://www.datadoghq.com/blog/aws-waf-metrics
overmind-cli

Overmind is a powerful tool for real-time impact analysis on Terraform changes. Overmind can identify the blast radius and uncover potential risks with overmind terrafrom plan before they harm your infrastructure, allowing anyone to make changes with confidence. We also track the impacts of the changes you make with overmind teraform apply, so that you can be sure that your changes haven't had any unexpected downstream impact.


https://github.com/overmindtech/cli
neosync

Open source data anonymization and synthetic data orchestration for developers. Create high fidelity synthetic data and sync it across your environments.


https://github.com/nucleuscloud/neosync