NEW BOT Телеграм, страница

DevOps & SRE notes

The write-up by Overcast explores whether Kubernetes pods are truly evicted because of CPU pressure or if other scheduler nuances are at play. By dissecting eviction events and kubelet metrics, the author equips operators with actionable tips to diagnose and prevent unexpected pod terminations.
https://overcast.blog/do-pods-really-get-evicted-due-to-cpu-pressure-2b27274a670c

Medium

Do Kubernetes Pods Really Get Evicted Due to CPU Pressure?

As Kubernetes administrators and developers, we’ve all heard the notion that pods can get evicted due to high CPU pressure on a node. But…

👍4

14.3K viewstutunak, 15:02

DevOps & SRE notes

This piece argues that GitHub’s network effects, developer experience, and strategic acquisitions ultimately cemented its dominance in the code-hosting world. It contrasts competing platforms and offers lessons for toolmakers seeking to build thriving ecosystems.
https://blog.gitbutler.com/why-github-actually-won/

Butler's Log

Why GitHub Actually Won

How GitHub _actually_ became the dominant force it is today, from one of it's cofounders.

❤1🔥1

13.7K viewstutunak, 08:02

DevOps & SRE notes

Laminar - open-source all-in-one platform for engineering AI products. Create data flywheel for your AI app. Traces, Evals, Datasets, Labels. YC S24.

https://github.com/lmnr-ai/lmnr

GitHub

GitHub - lmnr-ai/lmnr: Laminar - open-source observability platform purpose-built for AI agents. YC S24.

Laminar - open-source observability platform purpose-built for AI agents. YC S24. - lmnr-ai/lmnr

👍4❤1

14.3K viewstutunak, 14:04

DevOps & SRE notes

vCluster - Create fully functional virtual Kubernetes clusters - Each vcluster runs inside a namespace of the underlying k8s cluster. It's cheaper than creating separate full-blown clusters and it offers better multi-tenancy and isolation than regular namespaces.

https://github.com/loft-sh/vcluster?tab=readme-ov-file

GitHub

GitHub - loft-sh/vcluster: vCluster - Create fully functional virtual Kubernetes clusters - Each vcluster runs inside a namespace…

vCluster - Create fully functional virtual Kubernetes clusters - Each vcluster runs inside a namespace of the underlying k8s cluster. It's cheaper than creating separate full-blown clusters...

👌3

13.2K viewstutunak, 08:01

DevOps & SRE notes

The post from Massdriver demonstrates how to generate clean Infrastructure-as-Code configurations from resources that already exist in your cloud accounts. Through step-by-step examples, it shows teams how to regain version control and security compliance without rebuilding everything from scratch.
https://www.massdriver.cloud/blogs/generating-infrastructure-as-code-from-existing-cloud-resources

Generating Infrastructure-as-Code From Existing Cloud Resources | Massdriver Blog

Generating Infrastructure-as-Code From Existing Cloud Resources

Generating OpenTofu or Terraform from existing cloud resources is a tedious process fraught with gotchas. It's not a silver bullet for click-ops debt. This post will discuss caveats of tools and different approaches to generating IaC.

14.2K viewstutunak, 14:00

DevOps & SRE notes

In his overview, Martin Heinz unpacks a recent technical challenge and walks readers through the solution, complete with code snippets and performance benchmarks. The candid narrative emphasizes practical learning and encourages experimentation in everyday development workflows.
https://martinheinz.dev/blog/111

martinheinz.dev

Recent Docker BuildKit Features You're Missing Out On

<p>
With introduction of BuildKit - the improved builder backend for Docker - many new features has been added to Docker, many of which are little known. S...

13.3K viewstutunak, 08:00

DevOps & SRE notes

Pretty fancy and modern terminal file manager

https://github.com/yorukot/superfile

GitHub

GitHub - yorukot/superfile: Pretty fancy and modern terminal file manager

Pretty fancy and modern terminal file manager. Contribute to yorukot/superfile development by creating an account on GitHub.

👍3

13.9K viewstutunak, 14:04

DevOps & SRE notes

Real-time monitoring of Linux process network traffic, including the client IP, port, and transfer speed for each connection.

https://github.com/CDWEN0526/tmd-top

GitHub

GitHub - CDWEN0526/tmd-top: Real-time monitoring of Linux process network traffic, including the client IP, port, and transfer…

Real-time monitoring of Linux process network traffic, including the client IP, port, and transfer speed for each connection. - GitHub - CDWEN0526/tmd-top: Real-time monitoring of Linux process n...

👍4

12.9K viewstutunak, 08:04

DevOps & SRE notes

This story dives into structuring Argo CD repositories with ApplicationSets so that teams can manage dozens of Kubernetes environments from a single source of truth. Clear diagrams and YAML examples make the pattern easy to adopt for both greenfield and legacy clusters.
https://medium.com/containers-101/how-to-structure-your-argo-cd-repositories-using-application-sets-1150e75d05b3

Medium

How to Structure Your Argo CD Repositories Using Application Sets

In the previous article of the series we explained how to model GitOps environments and promote an application between them. That article…

❤3👍2

13.7K viewstutunak, 14:04

DevOps & SRE notes

The publication on Random Tinkering details how to schedule lightweight CronJobs that scrape node-level metrics with Node Exporter and ship them to Prometheus. It balances operational guidance with security considerations, ensuring observability without overloading the control plane.
https://randomtinkering.hashnode.dev/how-to-collect-kubernetes-node-metrics-with-node-exporter-using-cronjobs

👍2

12.6K viewstutunak, 08:00

DevOps & SRE notes

Distributed, in-memory key/value store and cache. It can be used as an embedded Go library and a language-independent service.

https://github.com/olric-data/olric

GitHub

GitHub - olric-data/olric: Distributed, in-memory key/value store and cache. It can be used as an embedded Go library and a language…

Distributed, in-memory key/value store and cache. It can be used as an embedded Go library and a language-independent service. - olric-data/olric

👍1

13K viewstutunak, 14:04

DevOps & SRE notes

Kubernetes Control Plane Virtual IP and Load-Balancer

https://github.com/kube-vip/kube-vip

GitHub

GitHub - kube-vip/kube-vip: Kubernetes Control Plane Virtual IP and Load-Balancer

Kubernetes Control Plane Virtual IP and Load-Balancer - kube-vip/kube-vip

👍3

12K viewstutunak, 08:03

DevOps & SRE notes

This entry introduces Talos, a minimal Linux distribution purpose-built for Kubernetes, and explains how its immutable design reduces drift and patching headaches. It walks through the installation flow, cluster bootstrap, and day-two operations from a practitioner’s viewpoint.
https://a-cup-of.coffee/blog/talos/

a-cup-of.coffee

Talos - An Immutable OS for Kubernetes

Talos is an operating system for Kubernetes. It is designed to be lightweight, secure, and easy to use. In this article, I will introduce Talos and its features.

👍5

12.6K viewstutunak, 14:03

DevOps & SRE notes

The insight compiled by Ashish B. serves as a living cheat sheet for common Google Cloud tasks, covering everything from IAM gotchas to cost-saving tricks with gcloud commands. It’s a handy reference for engineers who bounce between cloud providers and need quick recall of GCP specifics.
https://ashishb.net/programming/google-cloud/

ashishb.net

It is hard to recommend Google Cloud

Discover the challenges of migrating from Google Domains and Container Registry. Explore Google Cloud's strengths and pitfalls, plus read why strategy is key.

👍2

11.4K viewstutunak, 08:04

DevOps & SRE notes

Get notified when actions are taken in the AWS Console.

https://github.com/cloudandthings/terraform-aws-clickops-notifier

GitHub

GitHub - cloudandthings/terraform-aws-clickops-notifier: Get notified when actions are taken in the AWS Console.

Get notified when actions are taken in the AWS Console. - cloudandthings/terraform-aws-clickops-notifier

👍4❤1

12.1K viewstutunak, 14:04

DevOps & SRE notes

A hybrid command-line/UI development experience for cloud-native development

https://github.com/kubernetes-sigs/kui?tab=readme-ov-file

GitHub

GitHub - kubernetes-retired/kui: A hybrid command-line/UI development experience for cloud-native development

A hybrid command-line/UI development experience for cloud-native development - kubernetes-retired/kui

👍2

10.5K viewstutunak, 08:00

DevOps & SRE notes

In this article, “Making Your System Observable” outlines practical techniques for evolving from scattered logs to coherent observability across services. Readers will discover why a holistic signals-first mindset matters more than bolting on dashboards late in the game.
https://www.architecture-weekly.com/p/making-your-system-observability

Architecture-Weekly

Making your system observability predictable

Everyone claims that observability is the key for production readiness. Yet, most of us just adds auto-instrumentation right before going to production and call it a day. That's fine, but not enough. Inspired by Martin Thwaites take, I showed how to add prectictable…

❤3

11.5K viewstutunak, 14:00

DevOps & SRE notes

This blogpost by Yandex SRE Dmitry Ziablov recounts a late-night incident that turned a harmless retry loop into a production outage. He dissects the cascade of failures and offers a framework for spotting bad retry patterns before they bite.
https://medium.com/yandex/good-retry-bad-retry-an-incident-story-648072d3cee6

Medium

Good Retry, Bad Retry: An Incident Story

Sometimes, a seemingly simple and obvious solution can lead to a series of problems later on. This is especially true when adding retries.

💩2

9.43K viewstutunak, 08:00

DevOps & SRE notes

The piece argues that traces beat metrics when you need to pinpoint latency spikes and hidden dependencies. It walks through three concrete debugging scenarios that show why span data can surface root causes in seconds.
https://jaywhy13.hashnode.dev/3-reasons-traces-better-than-metrics-for-debugging-your-application

❤1👍1

10.3K viewstutunak, 14:03

DevOps & SRE notes

In Slack’s detailed write-up, engineers share how the Unified Grid architecture split a monolithic workspace into isolated “cells” to serve enterprises with hundreds of thousands of users. The narrative dives into sharding strategy, migration challenges, and the performance wins that followed.
https://slack.engineering/unified-grid-how-we-re-architected-slack-for-our-largest-customers/

slack.engineering

Unified Grid: How We Re-Architected Slack for Our Largest Customers

All software is built atop a core set of assumptions. As new code is added and new use-cases emerge, software can become unmoored from those assumptions. When this happens, a fundamental tension arises between revisiting those foundational assumptions—which…

❤1

8.43K viewstutunak, 08:05

DevOps & SRE notes

This post explains how Sharyash Agrawal tamed CPU throttling in Go services running under Kubernetes limits. From choosing the right GC knob to tuning Go’s runtime scheduler, the guide helps teams avoid sudden latency spikes.
https://medium.com/@sharyash81/solving-cpu-throttling-issue-in-golang-applications-before-hitting-the-cpu-limit-in-kubernetes-7d8f40da6477

Medium

Solving CPU throttling issue in Golang applications before hitting the CPU limit in Kubernetes.

We faced an issue within our Kubernetes cluster wherein certain multi-threaded Golang applications, for which CPU limit has been set, are…

👍3❤1

10K viewstutunak, 15:02

About

Blog

Apps

Platform