NEW BOT Телеграм, страница

Kubernetes v1.34: Finer-Grained Control Over Container Restarts

With the release of Kubernetes 1.34, a new alpha feature is introduced that gives you more granular control over container restarts within a Pod. This feature, named Container Restart Policy and Rules, allows you to specify a restart policy for each container individually, overriding the Pod's global restart policy. In addition, it also allows you to conditionally restart individual containers based on their exit codes. This feature is available behind the alpha feature gate ContainerRestartRules.

This has been a long-requested feature. Let's dive into how it works and how you can use it.

https://kubernetes.io/blog/2025/08/29/kubernetes-v1-34-per-container-restart-policy/

3.58K views15:02

DevOps&SRE Library

Understanding the True Cost of a Kubernetes Workload

Trace individual microservice costs by combining Kubernetes metrics, APM, and CUR for granular spending insights

https://medium.com/life-at-telkomsel/understanding-the-true-cost-of-a-kubernetes-workload-3a81e2b9529b

3.44K views07:02

DevOps&SRE Library

Cloud Cost Optimization: A Senior Engineer’s Guide

https://medium.com/@razkevich8/cloud-cost-optimization-a-senior-engineers-guide-d49ed4606de1

3.16K views15:03

DevOps&SRE Library

Battle for Resources or the SSA Path to Kubernetes Diplomacy

https://hackernoon.com/battle-for-resources-or-the-ssa-path-to-kubernetes-diplomacy

3.81K views07:01

DevOps&SRE Library

SPIFFE & SPIRE: Your Kubernetes Workloads’ Secret Identity Agency

https://medium.com/@mohammedredatarmidi/spiffe-spire-your-kubernetes-workloads-secret-identity-agency-0e8947437871

3.84K views15:02

DevOps&SRE Library

Monitoring Kubernetes Cluster with Prometheus and Grafana using ArgoCD

https://jackjapar.com/monitoring-kubernetes-cluster-with-prometheus-and-grafana-using-argocd

3.83K views07:05

DevOps&SRE Library

Cluster API + Talos + Proxmox = ❤️

https://a-cup-of.coffee/blog/talos-capi-proxmox

3.49K views15:03

DevOps&SRE Library

webdav

A simple and standalone WebDAV server.

https://github.com/hacdias/webdav

3.02K views07:01

DevOps&SRE Library

Failure is inevitable: Learning from a large outage, and building for reliability in depth at Datadog

https://www.datadoghq.com/blog/engineering/rethinking-reliability

3.11K views15:04

DevOps&SRE Library

Why we're leaving serverless

Every millisecond matters when you're in the critical path of API authentication. After two years of fighting serverless limitations, we rebuilt our entire API stack and slashed the end-to-end latency.

https://www.unkey.com/blog/serverless-exit

3.01K views07:01

DevOps&SRE Library

Advancing Our Chef Infrastructure: Safety Without Disruption

Building a safer, more reliable path forward for Chef at Slack

https://slack.engineering/advancing-our-chef-infrastructure-safety-without-disruption

2.91K views15:00

DevOps&SRE Library

Container CPU Requests & Limits Explained with GOMAXPROCS Tuning

In this article, we’re going to cover a few things that might’ve puzzled you if you’ve been running your applications, especially Go applications, in Kubernetes:

- How Kubernetes and the Linux kernel handle CPU stuff for containers
- What the Go runtime does with CPU, and whether you should bother setting GOMAXPROCS
- Which metrics are actually worth paying attention to

Maybe you’ve seen some of these metrics before while keeping an eye on your applications, but didn’t fully know what to make of them. This should help clear that up.

https://victoriametrics.com/blog/kubernetes-cpu-go-gomaxprocs

4.67K views07:04

DevOps&SRE Library

zmx

session persistence for terminal processes

https://github.com/neurosnap/zmx

3.32K views15:04

DevOps&SRE Library

Running our Docker registry on-prem with Harbor

On hosting images without the price tag.

https://dev.37signals.com/running-our-docker-registry-on-prem-with-harbor

2.94K views07:03

DevOps&SRE Library

fizzy

This is the source code of Fizzy, the Kanban tracking tool for issues and ideas by 37signals.

https://github.com/basecamp/fizzy

2.77K views15:03

DevOps&SRE Library

VERT

VERT is a file conversion utility that uses WebAssembly to convert files on your device instead of a cloud.

https://github.com/VERT-sh/VERT

2.9K views07:02

DevOps&SRE Library

Victorialogs vs Loki - Benchmarking Results

TL;DR – After side‑by‑side testing on a 500 GB/7‑day workload, VictoriaLogs cut query latencies by 94 %, shrank storage by ≈40 %, and used < 50 % of the CPU & RAM we previously allocated to Loki. This post explains why we switched.

https://truefoundry.com/blog/victorialogs-vs-loki

3.5K views15:04

DevOps&SRE Library

What I Really Mean When I Say “Good Communication” in Incident Response

“Good communication” is one of those phrases everyone nods along to — until the incident hits, and suddenly comms unravel before your eyes.

So here’s what I actually mean when I say communication matters.

https://uptimelabs.io/articles/good-communication-in-incident-response

3.85K views07:03

DevOps&SRE Library

The JVM Pause That Wasn't: A War Story

A high-throughput Java service was stalling. The culprit? Stop-the-World GC pauses were blocked by synchronous log writes to a busy disk.

https://dzone.com/articles/the-jvm-pause-that-wasnt-a-war-story

3.92K views15:01

DevOps&SRE Library

Kubernetes Informers are so easy... to misuse!

https://render.com/blog/kubernetes-informers

3.9K views07:04

DevOps&SRE Library

Importance of Graceful Shutdown in Kubernetes

https://dev.to/criteo_tech_community/importance-of-graceful-shutdown-in-kubernetes-2ikb

3.68K views15:05

About

Blog

Apps

Platform