kubernetes-learning-path
https://github.com/techiescamp/kubernetes-learning-path
A roadmap to learn Kubernetes from scratch (Beginner to Advanced level)
https://github.com/techiescamp/kubernetes-learning-path
Building an Observability Solution with ClickHouse
Part 1 - Logs: https://clickhouse.com/blog/storing-log-data-in-clickhouse-fluent-bit-vector-open-telemetry
Part 2 - Traces: https://clickhouse.com/blog/storing-traces-and-spans-open-telemetry-in-clickhouse
Part 1 - Logs: https://clickhouse.com/blog/storing-log-data-in-clickhouse-fluent-bit-vector-open-telemetry
Part 2 - Traces: https://clickhouse.com/blog/storing-traces-and-spans-open-telemetry-in-clickhouse
Why didn't you status?
https://hross.substack.com/p/why-didnt-you-status
Demystifying the public status process.
https://hross.substack.com/p/why-didnt-you-status
Trial by Fire: Tales from the SRE Frontlines — Ep2: The Scary ApplicationSet
https://medium.com/adevinta-tech-blog/trial-by-fire-tales-from-the-sre-frontlines-ep2-the-scary-applicationset-ec1a2d491562
https://medium.com/adevinta-tech-blog/trial-by-fire-tales-from-the-sre-frontlines-ep2-the-scary-applicationset-ec1a2d491562
Kubernetes Tip: What Happens To Pods Running On Node That Become Unreachable?
https://medium.com/tailwinds-navigator/kubernetes-tip-what-happens-to-pods-running-on-node-that-become-unreachable-3d409f734e5d
https://medium.com/tailwinds-navigator/kubernetes-tip-what-happens-to-pods-running-on-node-that-become-unreachable-3d409f734e5d
pgmq
https://github.com/tembo-io/pgmq
A lightweight message queue. Like AWS SQS and RSMQ but on Postgres.
https://github.com/tembo-io/pgmq
gitswitch
https://github.com/target-ops/gitswitch
GitSwitch, the ultimate solution for managing multiple Git users across different vendors
https://github.com/target-ops/gitswitch
Anomaly Alerting in Prometheus
https://karlstoney.com/response-time-anomaly-alert
Exploring how to do anomaly alerting on seasonal data in Prometheus.
https://karlstoney.com/response-time-anomaly-alert
overmind-cli
https://github.com/overmindtech/cli
Overmind is a powerful tool for real-time impact analysis on Terraform changes. Overmind can identify the blast radius and uncover potential risks with overmind terrafrom plan before they harm your infrastructure, allowing anyone to make changes with confidence. We also track the impacts of the changes you make with overmind teraform apply, so that you can be sure that your changes haven't had any unexpected downstream impact.
https://github.com/overmindtech/cli
neosync
https://github.com/nucleuscloud/neosync
Open source data anonymization and synthetic data orchestration for developers. Create high fidelity synthetic data and sync it across your environments.
https://github.com/nucleuscloud/neosync
sqlite-studio
https://github.com/frectonz/sqlite-studio
Single binary, single command SQLite database explorer. SQLite studio supports opening both local SQLite DB files, libSQL servers and PostgreSQL.
https://github.com/frectonz/sqlite-studio
jsondiff
https://github.com/wI2L/jsondiff
jsondiff is a Go package for computing the diff between two JSON documents as a series of RFC6902 (JSON Patch) operations, which is particularly suitable to create the patch response of a Kubernetes Mutating Webhook for example.
https://github.com/wI2L/jsondiff
Reducing cardinality load from node_systemd_unit_state
https://medium.com/opsops/reducing-cardinality-load-from-node-systemd-unit-state-8b5ed3be1222
https://medium.com/opsops/reducing-cardinality-load-from-node-systemd-unit-state-8b5ed3be1222
Grafana Loki query acceleration: How we sped up queries without adding resources
https://grafana.com/blog/2024/05/29/grafana-loki-query-acceleration-how-we-sped-up-queries-without-adding-resources
https://grafana.com/blog/2024/05/29/grafana-loki-query-acceleration-how-we-sped-up-queries-without-adding-resources
Linux Crisis Tools
https://www.brendangregg.com/blog/2024-03-24/linux-crisis-tools.html
When you have an outage caused by a performance issue, you don't want to lose precious time just to install the tools needed to diagnose it. Here is a list of "crisis tools" I recommend installing on your Linux servers by default (if they aren't already), along with the (Ubuntu) package names that they come from.
https://www.brendangregg.com/blog/2024-03-24/linux-crisis-tools.html
KEDA + Kafka: Improve performance by 62.15% at peak loads
https://kedify.io/resources/blog/keda-kafka-improve-performance-by-62-15-at-peak-loads
https://kedify.io/resources/blog/keda-kafka-improve-performance-by-62-15-at-peak-loads
5 shortcomings of Helm
https://glasskube.eu/en/r/knowledge/5-helm-shortcomings
1. Helm doesn't provide a mechanism for upgrading Custom Resource Definitions
2. Helm dependency management
3. Helm chart creation is not user-friendly
4. The values.yaml file is an anti-pattern
5. Inability to interact with the Kubernetes API
https://glasskube.eu/en/r/knowledge/5-helm-shortcomings