DevOps & SRE notes – Telegram
DevOps & SRE notes
12K subscribers
38 photos
19 files
2.5K links
Helpfull articles and tools for DevOps&SRE

WhatsApp: https://whatsapp.com/channel/0029Vb79nmmHVvTUnc4tfp2F

For paid consultation (RU/EN), contact: @tutunak


All ways to support https://telegra.ph/How-support-the-channel-02-19
Download Telegram
🤣9🔥5
Author dotdc presents Terraflow, a CI/CD orchestrator designed to scale Terraform operations effectively. This report outlines the creation of the tool and how it helps manage complex infrastructure deployments.
https://medium.com/@dotdc/creating-terraflow-a-ci-cd-orchestrator-to-scale-terraform-3965b3f8931f
👍1
This analysis provides a deep dive into writing policies for Kubernetes clusters using OPA Gatekeeper. The Permify Tech Blog explains how to enforce custom rules and maintain security and compliance in a cloud-native environment.
https://medium.com/permify-tech-blog/opa-gatekeeper-how-to-write-policies-for-kubernetes-clusters-bb660666eb19
1👍1
AWS just released their postmortem (link in comment) for the October DynamoDB outage. It's thorough, technically detailed, and explains exactly what broke and how they'll "prevent" it from happening again. But this PR-approved, sanitized narrative tells us only what happened to the technology, nothing else.

https://aws.amazon.com/message/101925/
2👍2
Marc Christian P. Gregorio offers a practical commentary on automating centralized NAT Gateways in AWS across multiple VPCs and regions using Terraform. The solution aims to optimize costs and simplify network management for large-scale deployments.
https://medium.com/@marcchristianp.gregorio/automating-centralized-nat-gateways-in-aws-vpcs-and-region-with-terraform-69a6f90d60da
👍31
Elliot Graebert proposes an impact-based leveling system for engineering organizations as an alternative to traditional career ladders. This treatise discusses how focusing on impact can foster a more motivated and effective engineering culture.
https://medium.com/@elliotgraebert/an-impact-based-level-system-for-engineering-organizations-2e0f9bee20e6
👍21
This article from JP Gouin provides a deep dive into implementing GitOps at scale, with a specific focus on the cluster bootstrapping process. It covers the challenges and solutions for managing numerous Kubernetes clusters efficiently and declaratively.
https://medium.com/@jp-gouin/gitops-at-scale-clusters-bootstrapping-f36695d4340d
2
This edition of the Scalable Thread newsletter breaks down effective strategies for handling sudden and unexpected bursts of traffic to your systems. It explores architectural patterns and techniques to ensure reliability and prevent service degradation during traffic spikes.
https://newsletter.scalablethread.com/p/how-to-handle-sudden-bursts-of-traffic
👍3
This write-up from Prezi Engineering explains how multi-AZ deployments can lead to surprisingly high data transfer costs. It documents their journey of migrating from a costly self-hosted Prometheus setup to a more efficient monitoring solution to save on their cloud budget.
https://engineering.prezi.com/how-using-availability-zones-can-eat-up-your-budget-our-journey-from-prometheus-to-be8a816f7efe
👍1
Author Yasin Taha Erol provides a practical guide on migrating from Kubernetes' native Horizontal Pod Autoscaler (HPA) to KEDA. The text highlights the benefits of KEDA's event-driven scaling and walks through the steps for a smooth transition.
https://yasintahaerol.medium.com/migrating-hpa-to-keda-13e946ee29ee
👍2