DevOps & SRE notes – Telegram
DevOps & SRE notes
12K subscribers
40 photos
19 files
2.5K links
Helpful articles and tools for DevOps&SRE

WhatsApp: https://whatsapp.com/channel/0029Vb79nmmHVvTUnc4tfp2F

For paid consultation (RU/EN), contact: @tutunak


All ways to support https://telegra.ph/How-support-the-channel-02-19
Download Telegram
The article delves into creating a serverless newsletter system using Amazon Web Services (AWS) and Amazon Simple Email Service (SES). The guide covers the entire process, from setting up SES to handle emails, ensuring email authenticity with DKIM, and managing bounces and complaints, to constructing a serverless infrastructure that includes AWS Lambda and DynamoDB for efficient email operations. The author emphasizes scalability, cost-effectiveness, and the technical challenges encountered during the setup.
https://cloudnature.net/blog/building-a-serverless-newsletter-your-guide-to-aws-and-amazon-ses
👍4
The blog post guides you through using the Grafana Operator for managing Grafana Cloud stacks within Kubernetes. It covers the setup process using Helm for quick installation, and details how to connect to Grafana Cloud, configure resources like folders, dashboards, and alert rules. The post emphasizes the benefits of integrating Grafana Cloud with Kubernetes, making configuration management streamlined and efficient.
https://grafana.com/blog/2024/04/24/how-to-use-the-grafana-operator-managing-a-grafana-cloud-stack-in-kubernetes/
👍52
In this blog post, the engineering team at Slack shares an insightful narrative about their extensive migration from AWS's IMDSv1 to IMDSv2, detailing both the challenges and solutions they encountered along the way. The migration aimed to enhance security by leveraging IMDSv2's session authentication for managing about 60,000 EC2 instances. The post elaborates on the technical strategies, tools, and collaborative efforts that ensured a smooth transition, reflecting Slack's commitment to maintaining robust security standards in their cloud infrastructure.
https://slack.engineering/our-journey-migrating-to-aws-imdsv2/
👍4
Discover the innovative strategies to implement A/B testing across a platform serving over 50 million users, using AWS Lambda@Edge. This post dives deep into the technical complexities and solutions involved in executing split A/B tests efficiently, ensuring minimal impact on performance while enhancing user experience. Learn about the challenges and insights from a large-scale deployment that combines technology with practical business applications.
https://rahulsuresh.net/blog/ab-testing-millions-of-users-using-aws-lambda-edge
👍2
Explore the journey of Canva's engineering team as they detail their experiences and solutions in scaling a content usage counting service to handle billions of transactions. This post delves into the evolution of their architecture from MySQL to modern OLAP databases, emphasizing improvements in scalability, accuracy, and operability. Discover the technical hurdles and innovative strategies that helped Canva support its growing needs and ensure accurate content creator compensation.
https://www.canva.dev/blog/engineering/scaling-to-count-billions/
👍31
In the world of infrastructure as code (IaC), ensuring the security and efficiency of your configurations is paramount. This blog post on provides a comprehensive overview of Terraform static code analysis tools, exploring their essential features and offering detailed comparisons. Whether you're a seasoned DevOps professional or just starting out, this deep dive will equip you with the knowledge needed to choose the right tools for maintaining high-quality IaC environments. Dive into the intricacies of these tools and learn how to enhance your Terraform workflows by visiting our full article.
https://devdosvid.blog/2024/04/16/a-deep-dive-into-terraform-static-code-analysis-tools-features-and-comparisons/
👍4
The article from Cloudflare's blog discusses the importance of alert observability in reducing on-call burnout. It highlights how Cloudflare has developed various dashboards to monitor alerts effectively, which helps in managing on-call responsibilities more efficiently and minimizes interruptions. The use of dashboards allows teams to quickly analyze and respond to alerts, identify frequently firing alerts, and make necessary adjustments. This approach not only streamlines operations but also significantly enhances the experience for on-call personnel by providing clear visibility and tools to manage alerts proactively.
https://blog.cloudflare.com/alerts-observability
👍3
The article on "What Is Amazon Resource Name (ARN)?" from EverythingDevOps discusses the significance and structure of ARNs in AWS. ARNs are unique identifiers used to specify resources across AWS, ensuring clear identification irrespective of the region or account. The article covers the structure of ARNs, their importance in resource-level permissions, cross-service interactions, and API operations. It also highlights how ARNs facilitate auditing, logging, and resource tagging. Finally, it provides examples of ARNs and discusses best practices for their usage.
https://everythingdevops.dev/what-is-amazon-resource-name-arn/
👍3
The article provides a comprehensive guide on Kubernetes sidecar containers, explaining their function, use cases, and implementation. Sidecar containers run alongside main application containers within a pod, sharing the same lifecycle and resources, enhancing the main container's capabilities in areas like logging, monitoring, and network communications. The guide details practical examples, discusses native sidecar support in Kubernetes v1.28, and offers best practices, such as applying the single responsibility principle and managing resource limits effectively.
https://komodor.com/learn/kubernetes-sidecar-containers-practical-guide-with-examples/
👍4