DevOps & SRE notes – Telegram
DevOps & SRE notes
12K subscribers
39 photos
19 files
2.5K links
Helpfull articles and tools for DevOps&SRE

WhatsApp: https://whatsapp.com/channel/0029Vb79nmmHVvTUnc4tfp2F

For paid consultation (RU/EN), contact: @tutunak


All ways to support https://telegra.ph/How-support-the-channel-02-19
Download Telegram
Automating secrets rotation is crucial for maintaining security in dynamic environments. This article by Canela Severo explains how to achieve true secrets auto-rotation using External Secrets Operator (ESO) and HashiCorp Vault. Discover the steps to set up and integrate these tools to ensure your secrets are rotated securely and efficiently, enhancing your overall security posture.

https://dev.to/canelasevero/true-secrets-auto-rotation-with-eso-and-vault-1g4o
👍3
Upgrading Kubernetes clusters can be a complex and costly endeavor, especially at scale. This article from Qovery explores the challenges and expenses involved in upgrading hundreds of Kubernetes clusters. Learn about the strategies, tools, and best practices to manage and minimize the costs while ensuring smooth and efficient upgrades.

https://www.qovery.com/blog/the-cost-of-upgrading-hundreds-of-kubernetes-clusters/
1👍5
Advanced secret management is essential for securing sensitive data in Kubernetes environments. This article from Ediri.io explores the integration of Pulumi with the Secrets Store CSI Driver to enhance secret management on Kubernetes. Learn how to securely manage and automate secrets, leveraging Pulumi's infrastructure as code capabilities along with Kubernetes-native tools.


https://blog.ediri.io/advanced-secret-management-on-kubernetes-with-pulumi-secrets-store-csi-driver
Over two decades of Site Reliability Engineering (SRE) practice have provided valuable insights into maintaining reliable and scalable systems. This article from Google delves into the lessons learned from 20 years of SRE, sharing key practices, challenges, and strategies that have shaped the discipline. Discover the principles that continue to drive successful SRE teams and how they can be applied to modern engineering practices.

https://sre.google/resources/practices-and-processes/twenty-years-of-sre-lessons-learned/
👍1
Handling kernel panics in Kubernetes environments is critical for maintaining system stability. This article from the Netflix Tech Blog explores the challenges and solutions related to kernel panics in Kubernetes clusters. Learn about the techniques and tools Netflix uses to diagnose, mitigate, and prevent kernel panics, ensuring the resilience and reliability of their infrastructure.

https://netflixtechblog.com/kubernetes-and-kernel-panics-ed620b9c6225
🔥2
Effective data caching is essential for improving application performance and scalability. This article from RevenueCat delves into their approach to data caching, exploring the strategies and technologies used to optimize data retrieval and reduce latency. Learn how RevenueCat enhances user experience and system efficiency through thoughtful caching practices.

https://www.revenuecat.com/blog/engineering/data-caching-revenuecat/
👍1
Scaling to support over a million users online in a single server is a remarkable achievement, and Discord's engineering team has pushed the boundaries to make it possible. This article from the Discord blog details the challenges, innovations, and technical strategies employed to handle such massive scale, offering insights into the architecture and optimizations that enable seamless communication for millions of users simultaneously.

https://discord.com/blog/maxjourney-pushing-discords-limits-with-a-million-plus-online-users-in-a-single-server
👍2
Implementing request-level authentication and authorization is vital for securing microservices in Kubernetes. This article from InfraCloud discusses how to integrate Istio with Keycloak to achieve fine-grained access control. Learn how these tools work together to enhance the security of your Kubernetes environment by managing authentication and authorization at the request level.

https://www.infracloud.io/blogs/request-level-authentication-authorization-istio-keycloak/
👍2
Optimizing Terraform workflows is essential for efficient infrastructure management. This article by Bejarano.io introduces "Terraform Plan Light," a tool designed to speed up the Terraform plan process by reducing unnecessary overhead. Learn how to integrate this tool into your workflow to improve performance and streamline your Terraform operations.

https://www.bejarano.io/terraform-plan-light
1👍1
Mastering Terraform modules is key to managing complex Azure infrastructure efficiently. In this final part of the series, IAMACHS delves into advanced techniques for using Terraform modules on Azure. Explore best practices, real-world examples, and tips for organizing and reusing infrastructure code, ensuring scalable and maintainable deployments.

https://www.iamachs.com/p/azure-terraform/part-7-modules-grand-finale/