DevOps & SRE notes – Telegram
DevOps & SRE notes
12K subscribers
38 photos
19 files
2.5K links
Helpfull articles and tools for DevOps&SRE

WhatsApp: https://whatsapp.com/channel/0029Vb79nmmHVvTUnc4tfp2F

For paid consultation (RU/EN), contact: @tutunak


All ways to support https://telegra.ph/How-support-the-channel-02-19
Download Telegram
A Service Level Agreement (SLA) is a foundational element in service-based industries, defining the performance standards and reliability expectations between a service provider and its clients. This article breaks down the essentials of SLAs, explaining their purpose, key components, and how they help manage customer expectations. By understanding SLAs, both providers and clients can foster transparency and accountability in service delivery.

https://uptimerobot.com/blog/what-is-an-sla/
👍5
Turning incidents into learning opportunities is a powerful way to build resilient systems and teams. This article explores three strategies for leveraging incidents as a catalyst for improvement, focusing on reflection, analysis, and proactive change. By fostering a culture of learning, teams can transform setbacks into valuable insights, enhancing both technical capabilities and team collaboration.

https://thenewstack.io/3-strategies-to-turn-incidents-into-learning-opportunities/
👍4
Certificate pinning was once a standard security measure to prevent man-in-the-middle attacks by associating a host with a specific TLS certificate or public key.However, the practice has become outdated and can lead to significant issues, such as service outages during certificate rotations or renewals.Modern security standards, including shorter certificate lifetimes, regular rotations of intermediate certificates, and increased use of certificate transparency logs, have addressed the original concerns that certificate pinning aimed to mitigate.These advancements offer enhanced security without the complexities and risks associated with pinning.Organizations are encouraged to adopt these contemporary practices to maintain robust security while ensuring operational stability.

https://blog.cloudflare.com/why-certificate-pinning-is-outdated/
👍3
As APIs become integral to modern applications, managing their consumption strategically has become essential. This article examines why traditional monitoring alone is no longer sufficient, advocating for a proactive approach to API consumption management. By focusing on usage patterns, rate limits, and dependency mapping, it highlights ways to ensure API reliability and optimize performance, helping organizations better control their API ecosystems.

https://thenewstack.io/beyond-monitoring-the-urgent-need-for-strategic-api-consumption-management/
👍71
The Kubernetes scheduler plays a crucial role in ensuring that workloads are placed on the appropriate nodes for optimal performance. This article from Awelm dives into the inner workings of the Kubernetes scheduler, explaining how it decides where to run pods and how to optimize scheduling for better resource utilization. Learn about the different scheduling algorithms, configurations, and strategies to improve the efficiency and performance of your Kubernetes clusters.

https://www.awelm.com/posts/kube-scheduler/
👍7
The blogpost explores the complex decision-making process behind adopting a microservices architecture, a monolithic service architecture, or a hybrid approach in engineering organizations. It delves into the challenges and considerations faced by a theoretical company, Theoretical Compliance Company, as it navigates the evolving landscape of service architecture strategies in the context of changing industry trends and business constraints

https://lethain.com/decompose-monolith-strategy/
👍3
Helm Distribution plugin is is a set of utilities and Helm Plugin for making offline work with Helm Charts easier. It is meant to be used for creating reproducible and relocatable packages for Helm Charts that can be moved around registries without hassles. This is particularly useful for distributing Helm Charts into airgapped environments.

https://github.com/vmware-labs/distribution-tooling-for-helm
👍5
Explore how to enhance AWS RDS security with fine-grained access control using IAM authentication, Terraform, and serverless solutions.

https://alantai.hashnode.dev/securing-aws-rds-with-fine-grained-access-control-using-iam-authentication-terraform-and-serverless
👍1
Learn strategies to handle Out-Of-Memory (OOM) issues in Kubernetes for Java applications to ensure smoother operations.
https://yonahdissen.medium.com/surviving-oom-in-kubernetes-java-applications-fd1fb1a65f02
👍6
Discover how Monzo manages seamless migrations across 2,800 microservices to maintain efficiency and stability.
https://monzo.com/blog/how-we-run-migrations-across-2800-microservices
👍7