This comprehensive guide details the process of setting up a high-availability k3s Kubernetes cluster. It uses keepalived for a virtual IP, a Galera cluster for the database, and Longhorn for distributed block storage to ensure no single point of failure.
https://raymii.org/s/tutorials/High_Available_k3s_kubernetes_cluster_with_keepalived_galera_and_longhorn.html
https://raymii.org/s/tutorials/High_Available_k3s_kubernetes_cluster_with_keepalived_galera_and_longhorn.html
👍3
⚠️ Ingress Nginx will be retired!
To prioritize the safety and security of the ecosystem, Kubernetes SIG Network and the Security Response Committee are announcing the upcoming retirement of Ingress NGINX 🪦
https://kubernetes.io/blog/2025/11/11/ingress-nginx-retirement/
To prioritize the safety and security of the ecosystem, Kubernetes SIG Network and the Security Response Committee are announcing the upcoming retirement of Ingress NGINX 🪦
https://kubernetes.io/blog/2025/11/11/ingress-nginx-retirement/
Kubernetes
Ingress NGINX Retirement: What You Need to Know
To prioritize the safety and security of the ecosystem, Kubernetes SIG Network and the Security Response Committee are announcing the upcoming retirement of Ingress NGINX. Best-effort maintenance will continue until March 2026. Afterward, there will be no…
1😢7❤3🎉3💩2
This exploration by OpenSauced explains how they leverage Kubernetes Jobs to run OpenSSF Scorecard checks at a massive scale. The system is designed to assess the security posture of nearly any public repository on GitHub.
https://dev.to/opensauced/how-we-use-kubernetes-jobs-to-scale-openssf-scorecard-5bf2
https://dev.to/opensauced/how-we-use-kubernetes-jobs-to-scale-openssf-scorecard-5bf2
DEV Community
How we use Kubernetes jobs to scale OpenSSF Scorecard
We recently released integrations with the OpenSSF Scorecard on the OpenSauced platform. The OpenSSF...
❤2🔥1
This post offers an exploration of the fundamental traffic management capabilities within the Istio service mesh. Author Albert Riu covers core concepts like DestinationRules, VirtualServices, and Gateways to control and route traffic in a microservices architecture.
https://medium.com/@arivermar/exploring-the-basics-of-istio-traffic-management-cee13f0817c2
https://medium.com/@arivermar/exploring-the-basics-of-istio-traffic-management-cee13f0817c2
Medium
Exploring the basics of Istio traffic management
Exploring the basics of Istio traffic management In microservices architecture, the concept of a service mesh has already become firmly established. A service mesh could be defined as an …
👍1
Ingress Nginx will be retired, time to choose a gateway api.
Gateway API Benchmarks provides a common set of tests to evaluate a Gateway API implementation.
https://github.com/howardjohn/gateway-api-bench
Gateway API Benchmarks provides a common set of tests to evaluate a Gateway API implementation.
https://github.com/howardjohn/gateway-api-bench
GitHub
GitHub - howardjohn/gateway-api-bench: Gateway API Benchmarks provides a common set of tests to evaluate a Gateway API implementation.
Gateway API Benchmarks provides a common set of tests to evaluate a Gateway API implementation. - howardjohn/gateway-api-bench
🔥5👍2
This post from Causely.ai provides practical tips for dealing with the Out of Memory (OOM) Killer in Kubernetes environments. It covers how to diagnose OOMKilled events and configure resource requests and limits effectively to prevent them.
https://www.causely.ai/blog/kubernetes-oom-killer-tips
https://www.causely.ai/blog/kubernetes-oom-killer-tips
www.causely.ai
Preventing Out-of-Memory (OOM) Kills in Kubernetes: Tips for Optimizing Container Memory Management
Running containerized applications at scale with Kubernetes demands careful resource management. One very complicated but common challenge is preventing Out-of-Memory (OOM) kills, which occur when a container’s memory consumption surpasses its allocated limit.…
👍2❤1
In this insightful piece, Lawrence Jones breaks down the nuances of measuring and understanding latency in distributed systems. The author emphasizes the importance of using percentiles and histograms over simple averages to get a true picture of system performance.
https://blog.lawrencejones.dev/latency/
https://blog.lawrencejones.dev/latency/
blog.lawrencejones.dev
Adding latency: one step, two step, oops
When it comes to complex systems, you can only go so far with synthetic experiments before you need to try something for real, and test in production. There's no substitute for it, and you're likely making the wrong decision if you avoid it. But I can say…
👍2🔥1
Cloudflare outage report
https://blog.cloudflare.com/18-november-2025-outage/
https://blog.cloudflare.com/18-november-2025-outage/
The Cloudflare Blog
Cloudflare outage on November 18, 2025
Cloudflare suffered a service outage on November 18, 2025. The outage was triggered by a bug in generation logic for a Bot Management feature file causing many Cloudflare services to be affected.
🔥1
The Adore Me tech team shares their journey and best practices for mastering GitOps with Flux in this detailed publication. They discuss their implementation strategies, challenges overcome, and the benefits of adopting a fully declarative approach to continuous delivery.
https://adoreme.tech/mastering-gitops-with-flux-adoreme-024b56ac397b
https://adoreme.tech/mastering-gitops-with-flux-adoreme-024b56ac397b
Medium
Mastering GitOps with Flux @AdoreMe
At Adore Me, we’ve harnessed the power of GitOps and Flux to streamline our complex infrastructure. This approach ensures consistency…
👍2
This article from "login:" magazine explains how Google's SRE teams are adopting the STAMP (System-Theoretic Accident Model and Processes) framework. This shift moves from preventing individual component failures to managing complex system interactions for improved reliability.
https://www.usenix.org/publications/loginonline/evolution-sre-google
https://www.usenix.org/publications/loginonline/evolution-sre-google
USENIX
The Evolution of SRE at Google
This blogpost provides a comprehensive guide on implementing SMART-on-FHIR authentication for AWS HealthLake using Terraform. It walks through the necessary configurations for HealthLake, Cognito, and Lambda to create a secure healthcare application.
https://medium.com/@kczpl/how-to-implement-smart-on-fhir-with-aws-healthlake-using-terraform-130389a1c0b8
https://medium.com/@kczpl/how-to-implement-smart-on-fhir-with-aws-healthlake-using-terraform-130389a1c0b8
Medium
How to implement SMART-on-FHIR with AWS HealthLake using Terraform
In this guide, we’ll walk through implementing SMART-on-FHIR authentication with AWS HealthLake using Terraform, making the process clear…
👍1