Upgrading Kubernetes - A Practical Guide
https://matduggan.com/upgrading-kubernetes-safely
One common question I see on Mastodon and Reddit is "I've inherited a cluster, how do I safely upgrade it". It's surprising that this still isn't a better understood process given the widespread adoption of k8s, but I've had to take over legacy clusters a few times and figured I would write up some of the tips and tricks I've found over the years to make the process easier.
https://matduggan.com/upgrading-kubernetes-safely
Why We Developed Own Kubernetes Controller to Copy Secrets
https://medium.com/lonto-digital-services-integrator/why-we-developed-own-kubernetes-controller-to-copy-secrets-e46368ae6db9
Kubernetes is a superior platform for deploying and managing applications, but sometimes simple tasks like having the same data — in our case Secrets — between namespaces can cause slight problems and you have to build your own solutions to those problems.
https://medium.com/lonto-digital-services-integrator/why-we-developed-own-kubernetes-controller-to-copy-secrets-e46368ae6db9
Kubernetes YAML tips and tricks
https://itnext.io/kubernetes-yaml-tips-and-tricks-904a2c0b2b81
TL;DR: In this article, you will learn tips and gotchas for writing YAML manifests for Kubernetes resources.
https://itnext.io/kubernetes-yaml-tips-and-tricks-904a2c0b2b81
How to save a fortune with self hosted GitHub runners
https://dev.to/styren/how-to-save-a-fortune-with-self-hosted-github-runners-2m93
https://dev.to/styren/how-to-save-a-fortune-with-self-hosted-github-runners-2m93
Not Just Scale
https://brooker.co.za/blog/2024/06/04/scale.html
It seems like everywhere I look on the internet these days, somebody’s making some form of the following argument:
You don’t need distributed systems! Computers are so fast these days you can serve all your customers off a single machine!
This argument is silly and reductive.
But first, let’s look for the kernel of truth.
One Machine Is All You Need?
https://brooker.co.za/blog/2024/06/04/scale.html
Accessing Kubernetes cluster using SSH tunnel
https://banach.net.pl/posts/2022/accessing-kubernetes-cluster-using-ssh-tunnel
https://banach.net.pl/posts/2022/accessing-kubernetes-cluster-using-ssh-tunnel
eks-node-viewer
https://github.com/awslabs/eks-node-viewer
eks-node-viewer is a tool for visualizing dynamic node usage within a cluster. It was originally developed as an internal tool at AWS for demonstrating consolidation with Karpenter. It displays the scheduled pod resource requests vs the allocatable capacity on the node. It does not look at the actual pod resource usage.
https://github.com/awslabs/eks-node-viewer
kubernetes-learning-path
https://github.com/techiescamp/kubernetes-learning-path
A roadmap to learn Kubernetes from scratch (Beginner to Advanced level)
https://github.com/techiescamp/kubernetes-learning-path
Building an Observability Solution with ClickHouse
Part 1 - Logs: https://clickhouse.com/blog/storing-log-data-in-clickhouse-fluent-bit-vector-open-telemetry
Part 2 - Traces: https://clickhouse.com/blog/storing-traces-and-spans-open-telemetry-in-clickhouse
Part 1 - Logs: https://clickhouse.com/blog/storing-log-data-in-clickhouse-fluent-bit-vector-open-telemetry
Part 2 - Traces: https://clickhouse.com/blog/storing-traces-and-spans-open-telemetry-in-clickhouse
Why didn't you status?
https://hross.substack.com/p/why-didnt-you-status
Demystifying the public status process.
https://hross.substack.com/p/why-didnt-you-status
Trial by Fire: Tales from the SRE Frontlines — Ep2: The Scary ApplicationSet
https://medium.com/adevinta-tech-blog/trial-by-fire-tales-from-the-sre-frontlines-ep2-the-scary-applicationset-ec1a2d491562
https://medium.com/adevinta-tech-blog/trial-by-fire-tales-from-the-sre-frontlines-ep2-the-scary-applicationset-ec1a2d491562
Kubernetes Tip: What Happens To Pods Running On Node That Become Unreachable?
https://medium.com/tailwinds-navigator/kubernetes-tip-what-happens-to-pods-running-on-node-that-become-unreachable-3d409f734e5d
https://medium.com/tailwinds-navigator/kubernetes-tip-what-happens-to-pods-running-on-node-that-become-unreachable-3d409f734e5d
pgmq
https://github.com/tembo-io/pgmq
A lightweight message queue. Like AWS SQS and RSMQ but on Postgres.
https://github.com/tembo-io/pgmq
gitswitch
https://github.com/target-ops/gitswitch
GitSwitch, the ultimate solution for managing multiple Git users across different vendors
https://github.com/target-ops/gitswitch
Anomaly Alerting in Prometheus
https://karlstoney.com/response-time-anomaly-alert
Exploring how to do anomaly alerting on seasonal data in Prometheus.
https://karlstoney.com/response-time-anomaly-alert
overmind-cli
https://github.com/overmindtech/cli
Overmind is a powerful tool for real-time impact analysis on Terraform changes. Overmind can identify the blast radius and uncover potential risks with overmind terrafrom plan before they harm your infrastructure, allowing anyone to make changes with confidence. We also track the impacts of the changes you make with overmind teraform apply, so that you can be sure that your changes haven't had any unexpected downstream impact.
https://github.com/overmindtech/cli
neosync
https://github.com/nucleuscloud/neosync
Open source data anonymization and synthetic data orchestration for developers. Create high fidelity synthetic data and sync it across your environments.
https://github.com/nucleuscloud/neosync