DevOps & SRE notes – Telegram
DevOps & SRE notes
12K subscribers
39 photos
19 files
2.5K links
Helpfull articles and tools for DevOps&SRE

WhatsApp: https://whatsapp.com/channel/0029Vb79nmmHVvTUnc4tfp2F

For paid consultation (RU/EN), contact: @tutunak


All ways to support https://telegra.ph/How-support-the-channel-02-19
Download Telegram
https://blog.palark.com/prometheus-architecture-tsdb/

Prometheus is a powerful monitoring tool, and understanding its architecture, particularly the Time Series Database (TSDB), is key to effectively managing and querying metrics. This article from Palark delves into the architecture of Prometheus with a focus on TSDB, explaining how it stores and retrieves time-series data. Learn about the inner workings of Prometheus to optimize your monitoring setup and gain deeper insights into your system’s performance.
🔥3
Returning to Terraform after a break can be both refreshing and challenging. This article from Ordina JWorks revisits Terraform, exploring the latest features, best practices, and updates in the infrastructure as code landscape. Whether you're reacquainting yourself with Terraform or looking to enhance your existing knowledge, this guide provides valuable insights to help you get back on track with efficient and scalable infrastructure management.

https://blog.ordina-jworks.io/cloud/2023/06/05/back-to-terraform.html
1👍1
Out-Of-Memory (OOM) issues can be a common challenge in Kubernetes environments, and understanding the OOM Killer is crucial for maintaining system stability. This article from Cloud Native Daily demystifies the OOM Killer in Kubernetes, explaining how it operates and how to track down memory issues. Learn the best practices for diagnosing and preventing memory-related problems in your Kubernetes clusters.

https://medium.com/cloud-native-daily/noscript-demystifying-oom-killer-in-kubernetes-tracking-down-memory-issues-b5a4973fbd56
Monitoring AWS Lambda functions is essential for ensuring their performance and reliability. This comprehensive guide from AWS Plain English covers everything you need to know about AWS Lambda monitoring. Explore the tools, metrics, and best practices for tracking the health and efficiency of your serverless applications, enabling you to detect issues early and optimize performance.

https://aws.plainenglish.io/aws-lambda-monitoring-a-full-guide-3cc68c6052fd
👍1
Securing Kubernetes workloads requires a comprehensive approach, and using signed and encrypted container images is an essential part of that. This article from ITNEXT provides a practical guide on how to implement signed and encrypted container images to enhance security in Kubernetes environments. Learn how to protect your workloads by ensuring image authenticity, preventing tampering, and encrypting sensitive data within your container images.

https://itnext.io/securing-kubernetes-workloads-a-practical-approach-to-signed-and-encrypted-container-images-ff6e98b65bcd
👍1
Running Prometheus at scale presents unique challenges, especially in large and complex environments. This article from Cloudflare explains how they manage Prometheus at scale, covering the architecture, optimizations, and strategies that enable efficient monitoring across their global infrastructure. Discover the techniques Cloudflare uses to ensure reliable, high-performance metrics collection and monitoring in their vast network.

https://blog.cloudflare.com/how-cloudflare-runs-prometheus-at-scale/
Securing Kubernetes communications with SSL/TLS is essential for protecting data and ensuring encrypted connections. This article from Semaphore CI explores the process of configuring SSL/TLS in Kubernetes, detailing the steps for setting up secure communication between services. Learn best practices for managing certificates, configuring Ingress, and ensuring your Kubernetes cluster is secure.

https://semaphoreci.com/blog/kubernetes-ssl-tls
1👍1
Managing multi-account deployments with Terraform can be complex, but following the right patterns can simplify the process. This article from AWTIP outlines effective strategies and patterns for handling Terraform deployments across multiple AWS accounts. Learn how to structure your Terraform configurations, manage shared resources, and ensure secure and scalable deployments in multi-account environments.

https://awstip.com/patterns-for-terraform-multi-account-deployments-f47d77d6f250
👍1
Managing Prometheus alerts in Kubernetes at scale can be challenging, but using GitOps practices can streamline this process. This article from Faun explores how to manage Prometheus alerts in large-scale Kubernetes environments using a GitOps approach. Learn how to automate, version control, and maintain consistency in your alerting configurations, ensuring efficient and reliable monitoring across your clusters.

https://faun.pub/managing-prometheus-alerts-in-kubernetes-at-scale-using-gitops-25d0ab4a2e2d
Managing authentication across multiple AWS accounts in Terraform can be complex, but understanding the right techniques is crucial for secure and efficient operations. This article by Hector Reyes Alemán provides a comprehensive guide on using the Terraform AWS provider for multi-account authentication. Learn about the best practices, tools, and configurations needed to manage authentication seamlessly across different AWS accounts in your Terraform projects.

https://hector-reyesaleman.medium.com/terraform-aws-provider-everything-you-need-to-know-about-multi-account-authentication-and-f2343a4afd4b
👍2