DevOps & SRE notes – Telegram
DevOps & SRE notes
12K subscribers
40 photos
19 files
2.5K links
Helpful articles and tools for DevOps&SRE

WhatsApp: https://whatsapp.com/channel/0029Vb79nmmHVvTUnc4tfp2F

For paid consultation (RU/EN), contact: @tutunak


All ways to support https://telegra.ph/How-support-the-channel-02-19
Download Telegram
In the article "How Containers Work," the author provides an in-depth exploration of the fundamental concepts and mechanisms behind container technology. Containers have transformed the software development and deployment landscape by offering a lightweight, portable, and efficient way to run applications. This article breaks down the core components of containers, such as namespaces, cgroups, and the container runtime, explaining how they interact to isolate and manage application processes. By understanding the inner workings of containers, readers can gain insights into their advantages, how they differ from traditional virtualization, and best practices for leveraging them in modern software development workflows.

https://ikouchiha47.github.io/2024/02/05/how-containers-work.html
This article delves into effective strategies for enhancing Continuous Integration (CI) build speeds and reducing operational costs. It provides a comprehensive guide on optimizing CI processes by implementing parallel test execution, caching build artifacts, and reorganizing workflows into distinct jobs to improve efficiency and decrease resource usage.

https://owaiskhan.me/post/improve-ci-build-time-and-reduce-cost
Looking for a hosting platform to practice with Linux, Kubernetes, etc.? Register using my referral link on DigitalOcean and get $200 in credit for 60 days. By registering through my referral link, you also support this Telegram channel.

👉 Register
👍7
Cleaner is a Kubernetes controller that identifies unused or unhealthy resources, helping you maintain a streamlined and efficient Kubernetes cluster. It provides flexible scheduling, label filtering, Lua-based selection criteria, resource removal or update and notifications via Slack, Webex and Discord. it can also automate clusters operations.

https://github.com/gianlucam76/k8s-cleaner
👍6
This article offers practical advice on essential tools to have installed on Linux servers to quickly address performance issues during a crisis. It outlines a recommended toolkit for immediate diagnostics and troubleshooting, highlighting the importance of having these tools pre-installed to avoid delays in resolving urgent problems. The discussion includes detailed scenarios illustrating potential challenges and solutions when dealing with system outages, underscoring the utility of preparedness in crisis management.

https://www.brendangregg.com/blog/2024-03-24/linux-crisis-tools.html
🔥3
The blog post challenges the notion that cybersecurity is uniquely complex within the software industry. She argues that many of the issues faced by cybersecurity are similar to those encountered by other engineering disciplines. Through critical analysis and comparison, Shortridge advocates for integrating cybersecurity with other engineering efforts to improve overall system resilience and reduce inefficiencies caused by treating cybersecurity as a separate, esoteric field.
https://kellyshortridge.com/blog/posts/cybersecurity-isnt-special/
👍5
The article discusses how DoorDash utilizes eBPF (extended Berkeley Packet Filter) technology for advanced monitoring capabilities to handle network traffic across their services efficiently. The post details the development and operation of BPFAgent, a tool built using eBPF to enhance observability within their Kubernetes clusters by tracking and analyzing network interactions and system calls without additional code instrumentation.
https://doordash.engineering/2023/08/15/bpfagent-ebpf-for-monitoring-at-doordash/
👍6
In the blog post the author explores the key performance indicators used to measure the efficiency and effectiveness of software delivery. Known as DORA metrics, these indicators include deployment frequency, lead time for changes, mean time to recovery, and change failure rate. By analyzing these metrics, organizations can gain valuable insights into their development processes and identify areas for improvement. This article provides an in-depth look at each metric, offering practical advice on how to leverage them to optimize software delivery and achieve DevOps excellence.
https://www.datadoghq.com/blog/dora-metrics-software-delivery/
👍3
The article explains how to use AWS CodeBuild's Lambda Compute to make Terraform deployments faster and more cost-efficient. The post covers recent updates to CodeBuild that integrate Lambda for building and testing, and introduces the use of custom images for builds. Practical steps and code snippets are included to help readers implement these improvements in their own infrastructure projects.
https://dev.to/aws-builders/accelerate-and-save-cost-for-terraform-deployments-with-aws-codebuilds-lambda-compute-5814
👍2
The blog post delves into the challenges and techniques of creating Infrastructure as Code (IaC) using existing cloud configurations. The author, Cory O'Daniel, critiques the limitations of current tools like Terraformer and Terracognita, which often fail to simplify the process effectively. He emphasizes the complexity and manual effort still required to convert existing resources into well-organized IaC, highlighting the pitfalls and inefficiencies in these tools.
https://www.massdriver.cloud/blogs/generating-infrastructure-as-code-from-existing-cloud-resources