Kubernetes v1.34: Finer-Grained Control Over Container Restarts
https://kubernetes.io/blog/2025/08/29/kubernetes-v1-34-per-container-restart-policy/
With the release of Kubernetes 1.34, a new alpha feature is introduced that gives you more granular control over container restarts within a Pod. This feature, named Container Restart Policy and Rules, allows you to specify a restart policy for each container individually, overriding the Pod's global restart policy. In addition, it also allows you to conditionally restart individual containers based on their exit codes. This feature is available behind the alpha feature gate ContainerRestartRules.
This has been a long-requested feature. Let's dive into how it works and how you can use it.
https://kubernetes.io/blog/2025/08/29/kubernetes-v1-34-per-container-restart-policy/
Understanding the True Cost of a Kubernetes Workload
https://medium.com/life-at-telkomsel/understanding-the-true-cost-of-a-kubernetes-workload-3a81e2b9529b
Trace individual microservice costs by combining Kubernetes metrics, APM, and CUR for granular spending insights
https://medium.com/life-at-telkomsel/understanding-the-true-cost-of-a-kubernetes-workload-3a81e2b9529b
Cloud Cost Optimization: A Senior Engineer’s Guide
https://medium.com/@razkevich8/cloud-cost-optimization-a-senior-engineers-guide-d49ed4606de1
https://medium.com/@razkevich8/cloud-cost-optimization-a-senior-engineers-guide-d49ed4606de1
Battle for Resources or the SSA Path to Kubernetes Diplomacy
https://hackernoon.com/battle-for-resources-or-the-ssa-path-to-kubernetes-diplomacy
https://hackernoon.com/battle-for-resources-or-the-ssa-path-to-kubernetes-diplomacy
SPIFFE & SPIRE: Your Kubernetes Workloads’ Secret Identity Agency
https://medium.com/@mohammedredatarmidi/spiffe-spire-your-kubernetes-workloads-secret-identity-agency-0e8947437871
https://medium.com/@mohammedredatarmidi/spiffe-spire-your-kubernetes-workloads-secret-identity-agency-0e8947437871
Monitoring Kubernetes Cluster with Prometheus and Grafana using ArgoCD
https://jackjapar.com/monitoring-kubernetes-cluster-with-prometheus-and-grafana-using-argocd
https://jackjapar.com/monitoring-kubernetes-cluster-with-prometheus-and-grafana-using-argocd
Failure is inevitable: Learning from a large outage, and building for reliability in depth at Datadog
https://www.datadoghq.com/blog/engineering/rethinking-reliability
https://www.datadoghq.com/blog/engineering/rethinking-reliability
Why we're leaving serverless
https://www.unkey.com/blog/serverless-exit
Every millisecond matters when you're in the critical path of API authentication. After two years of fighting serverless limitations, we rebuilt our entire API stack and slashed the end-to-end latency.
https://www.unkey.com/blog/serverless-exit
Advancing Our Chef Infrastructure: Safety Without Disruption
https://slack.engineering/advancing-our-chef-infrastructure-safety-without-disruption
Building a safer, more reliable path forward for Chef at Slack
https://slack.engineering/advancing-our-chef-infrastructure-safety-without-disruption
Container CPU Requests & Limits Explained with GOMAXPROCS Tuning
https://victoriametrics.com/blog/kubernetes-cpu-go-gomaxprocs
In this article, we’re going to cover a few things that might’ve puzzled you if you’ve been running your applications, especially Go applications, in Kubernetes:
- How Kubernetes and the Linux kernel handle CPU stuff for containers
- What the Go runtime does with CPU, and whether you should bother setting GOMAXPROCS
- Which metrics are actually worth paying attention to
Maybe you’ve seen some of these metrics before while keeping an eye on your applications, but didn’t fully know what to make of them. This should help clear that up.
https://victoriametrics.com/blog/kubernetes-cpu-go-gomaxprocs
5
Running our Docker registry on-prem with Harbor
https://dev.37signals.com/running-our-docker-registry-on-prem-with-harbor
On hosting images without the price tag.
https://dev.37signals.com/running-our-docker-registry-on-prem-with-harbor
fizzy
https://github.com/basecamp/fizzy
This is the source code of Fizzy, the Kanban tracking tool for issues and ideas by 37signals.
https://github.com/basecamp/fizzy
VERT
https://github.com/VERT-sh/VERT
VERT is a file conversion utility that uses WebAssembly to convert files on your device instead of a cloud.
https://github.com/VERT-sh/VERT
1
Victorialogs vs Loki - Benchmarking Results
https://truefoundry.com/blog/victorialogs-vs-loki
TL;DR – After side‑by‑side testing on a 500 GB/7‑day workload, VictoriaLogs cut query latencies by 94 %, shrank storage by ≈40 %, and used < 50 % of the CPU & RAM we previously allocated to Loki. This post explains why we switched.
https://truefoundry.com/blog/victorialogs-vs-loki
What I Really Mean When I Say “Good Communication” in Incident Response
https://uptimelabs.io/articles/good-communication-in-incident-response
“Good communication” is one of those phrases everyone nods along to — until the incident hits, and suddenly comms unravel before your eyes.
So here’s what I actually mean when I say communication matters.
https://uptimelabs.io/articles/good-communication-in-incident-response
The JVM Pause That Wasn't: A War Story
https://dzone.com/articles/the-jvm-pause-that-wasnt-a-war-story
A high-throughput Java service was stalling. The culprit? Stop-the-World GC pauses were blocked by synchronous log writes to a busy disk.
https://dzone.com/articles/the-jvm-pause-that-wasnt-a-war-story
Importance of Graceful Shutdown in Kubernetes
https://dev.to/criteo_tech_community/importance-of-graceful-shutdown-in-kubernetes-2ikb
https://dev.to/criteo_tech_community/importance-of-graceful-shutdown-in-kubernetes-2ikb