DevOps & SRE notes – Telegram
DevOps & SRE notes
12K subscribers
38 photos
19 files
2.5K links
Helpfull articles and tools for DevOps&SRE

WhatsApp: https://whatsapp.com/channel/0029Vb79nmmHVvTUnc4tfp2F

For paid consultation (RU/EN), contact: @tutunak


All ways to support https://telegra.ph/How-support-the-channel-02-19
Download Telegram
This exploration delves into managing AWS Systems Manager (SSM) Parameters using Terraform while accommodating external updates. It presents a solution to the challenge of maintaining SSM Parameters that are modified by other processes, ensuring that Terraform-managed infrastructure remains tracked and in sync with real-world changes

https://www.proactiveops.io/archive/managing-aws-ssm-parameters-with-terraform-with/
💯2
This comprehensive article delves into five distinct solutions for cross-cluster communication in Kubernetes multi-cluster environments, comparing their principles, advantages, and limitations. It covers underlay networks, overlay CNIs, Submariner, Skupper, and KubeSlice, providing insights into their unique approaches to addressing the challenges of inter-cluster connectivity and communication

https://oilbeater.com/en/2024/05/24/five-kubernetes-multicluster-network/
This comprehensive guide of LocalStack and Terraform for efficient local development of AWS-based applications. It demonstrates how to set up a local environment that emulates AWS services, enabling developers to work on Spring Boot and React applications without relying on actual AWS resources during initial stages

https://www.srvrlss.io/blog/localstack-terraform/
👍1
The blogpost presents a solution for managing microservices in Kubernetes environments using a unified Helm chart approach. It addresses the challenges of maintaining consistent configurations across multiple services by centralizing templates and allowing for customization through dependencies and overrides

https://dev.to/calinflorescu/streamlining-microservices-management-a-unified-helm-chart-approach-59g7
👍4
This author is using the AWS Cloud Control (AWSCC) provider in Terraform to manage AWS resources that may not be available in the standard AWS provider. It likely explains how AWSCC can help fill gaps in Terraform's AWS resource coverage, potentially improving infrastructure management for AWS users.


https://cloudonaut.io/are-you-missing-an-aws-resource-in-terraform-try-awscc-provider
Importance of integrating action items into incident reviews, arguing that learning and identifying improvements are inseparable aspects of the process. The author emphasizes that discussing potential actions during reviews can enhance understanding and lead to more effective system improvements, challenging the notion that focusing on actions detracts from learning.

https://incident.io/blog/why-i-like-discussing-actions-items-in-incident-reviews
This blogpost discusses strategies for building resilient applications on Kubernetes, emphasizing the importance of proper configuration to harness the platform's dynamic nature. It covers key topics such as configuring health probes, handling pod termination gracefully, and implementing pod distribution strategies to improve application stability and reduce downtime.

https://jaadds.medium.com/building-resilient-applications-on-kubernetes-9e9e4edb4d33
👍3
This blog post argues against discussing action items during incident reviews, emphasizing that these meetings should focus on learning and understanding system behavior. The author contends that dedicating time to action items reduces opportunities for valuable insights, as incident reviews offer a unique chance for diverse team members to explore the intricacies of complex socio-technical systems

https://surfingcomplexity.blog/2024/09/28/why-i-dont-like-discussing-action-items-during-incident-reviews/
👍3
This guide provides instructions on constructing an AI agent for Site Reliability Engineering (SRE). It offers insights into leveraging artificial intelligence to enhance operational efficiency and reliability in software systems.

https://www.aptible.ai/guides/how-to-build-an-ai-agent-for-sre
2👍2
This blog post discusses Gitpod's decision to move away from Kubernetes for hosting cloud development environments. It details the challenges they faced in using Kubernetes for this purpose, including resource management, security, and operational complexities

https://www.gitpod.io/blog/we-are-leaving-kubernetes
👍8