DevOps&SRE Library – Telegram
DevOps&SRE Library
18.4K subscribers
459 photos
3 videos
2 files
5K links
Библиотека статей по теме DevOps и SRE.

Реклама: @ostinostin
Контент: @mxssl

РКН: https://www.gosuslugi.ru/snet/67704b536aa9672b963777b3
Download Telegram
pgmq

A lightweight message queue. Like AWS SQS and RSMQ but on Postgres.


https://github.com/tembo-io/pgmq
gitswitch

GitSwitch, the ultimate solution for managing multiple Git users across different vendors


https://github.com/target-ops/gitswitch
Anomaly Alerting in Prometheus

Exploring how to do anomaly alerting on seasonal data in Prometheus.


https://karlstoney.com/response-time-anomaly-alert
Key metrics for monitoring AWS WAF

https://www.datadoghq.com/blog/aws-waf-metrics
overmind-cli

Overmind is a powerful tool for real-time impact analysis on Terraform changes. Overmind can identify the blast radius and uncover potential risks with overmind terrafrom plan before they harm your infrastructure, allowing anyone to make changes with confidence. We also track the impacts of the changes you make with overmind teraform apply, so that you can be sure that your changes haven't had any unexpected downstream impact.


https://github.com/overmindtech/cli
neosync

Open source data anonymization and synthetic data orchestration for developers. Create high fidelity synthetic data and sync it across your environments.


https://github.com/nucleuscloud/neosync
sqlite-studio

Single binary, single command SQLite database explorer. SQLite studio supports opening both local SQLite DB files, libSQL servers and PostgreSQL.


https://github.com/frectonz/sqlite-studio
jsondiff

jsondiff is a Go package for computing the diff between two JSON documents as a series of RFC6902 (JSON Patch) operations, which is particularly suitable to create the patch response of a Kubernetes Mutating Webhook for example.


https://github.com/wI2L/jsondiff
Grafana Loki query acceleration: How we sped up queries without adding resources

https://grafana.com/blog/2024/05/29/grafana-loki-query-acceleration-how-we-sped-up-queries-without-adding-resources
Linux Crisis Tools

When you have an outage caused by a performance issue, you don't want to lose precious time just to install the tools needed to diagnose it. Here is a list of "crisis tools" I recommend installing on your Linux servers by default (if they aren't already), along with the (Ubuntu) package names that they come from.


https://www.brendangregg.com/blog/2024-03-24/linux-crisis-tools.html
5 shortcomings of Helm

1. Helm doesn't provide a mechanism for upgrading Custom Resource Definitions
2. Helm dependency management
3. Helm chart creation is not user-friendly
4. The values.yaml file is an anti-pattern
5. Inability to interact with the Kubernetes API


https://glasskube.eu/en/r/knowledge/5-helm-shortcomings
A/B Testing with Linkerd and Flagger using Dynamic Routing

Linkerd’s dynamic request routing feature has been introduced in Linkerd 2.13. It leverages the Kubernetes Gateway API feature, which reduces the amount of new configuration machinery introduced onto the clusters while still providing a rich and expressive feature set. In this blog post, we will look at Linkerd’s dynamic request routing feature and use it for A/B testing using Flagger.


https://www.infracloud.io/blogs/ab-testing-with-linkerd-flagger-using-dynamic-routing
Maximize Cost Savings by Putting Your Kubernetes Resources to Sleep During Off-Hours

https://www.perfectscale.io/blog/putting-k8s-resources-to-sleep-with-keda
Importance of Kubernetes and the Need for Tainting Nodes

https://semaphoreci.com/blog/kubernetes-taints-tolerations