NEW BOT Телеграм, страница

The Real Failure Rate of EBS

PlanetScale has deployed millions of Amazon Elastic Block Store (EBS) volumes across the world. We create and destroy tens of thousands of them every day as we stand up databases for customers, take backups, and test our systems end-to-end. Through this experience, we have an unique viewpoint into the failure rate and mechanisms of EBS, and have spent a lot of time working on how to mitigate them.

https://planetscale.com/blog/the-real-fail-rate-of-ebs

3.79K views15:09

DevOps&SRE Library

ttyd

ttyd is a simple command-line tool for sharing terminal over the web.

https://github.com/tsl0922/ttyd

3.93K views07:06

DevOps&SRE Library

Kubernetes at Mercado Libre

At Mercado Libre, managing over 30,000 microservices and supporting 16,000 developers requires a robust and scalable solution. To address these challenges, we adopted Kubernetes (K8s) as the core engine of our internal platform, Fury. This article explores how Kubernetes streamlined our infrastructure management, accelerated software delivery, and improved cost efficiency, enabling our developers to focus on innovation and product creation.

https://medium.com/mercadolibre-tech/kubernetes-at-mercado-libre-ec331bea1866

3.81K views15:03

DevOps&SRE Library

From Autopilot to Standard GKE: The Key to 15x Cheaper Istio

TL;DR Istio proxy costs for 10-nodes GKE cluster reduced from $3065 to $185 per month.

https://medium.com/google-cloud/from-autopilot-to-standard-gke-the-key-to-15x-cheaper-istio-d755c1d3866d

3.73K views07:06

DevOps&SRE Library

All My DevOps Pipelines from GitLab Commit to ArgoCD got beaten by FTP

https://medium.com/@davidpech_39825/all-my-devops-pipelines-from-gitlab-commit-to-argocd-got-beaten-by-ftp-205b34f61441

3.92K views15:08

DevOps&SRE Library

When is read-only not read-only?

Kubernetes RBAC has some tricky areas where the behaviour you get might not be exactly what you expect, and sometimes as in this case, those unexpected behaviours are not very apparent!

https://raesene.github.io/blog/2024/11/11/When-Is-Read-Only-Not-Read-Only

4.55K views07:07

DevOps&SRE Library

Managing Kubernetes CEPH cluster monitor: easier than it looks

In this article I will explain how you can safely fail over a rook-ceph Monitor to a new Kubernetes node.

I hope I will save you some time and if you find the content useful please leave a clap.Also let me know in the comments of the article if you need professional help with a common topic.

https://blog.devgenius.io/managing-kubernetes-ceph-cluster-monitor-easier-than-it-looks-56ee000e68ff

4.89K views15:06

DevOps&SRE Library

How to Troubleshoot OOM Issues in Go Applications Running on Kubernetes

https://medium.com/@csepulvedab/how-to-troubleshoot-oom-issues-in-go-applications-running-on-kubernetes-149e8bb104ac

6.25K views07:05

DevOps&SRE Library

Securing Argo CD in a Multi-Tenant Environment with Application Projects

https://medium.com/containers-101/securing-argo-cd-in-a-multi-tenant-environment-with-application-projects-39ad30893fd5

4.99K views15:04

DevOps&SRE Library

How to mount secrets as files or environment variables in Kubernetes

https://itnext.io/how-to-mount-secrets-as-files-or-environment-variables-in-kubernetes-f03d545dcd89

4.68K views07:07

DevOps&SRE Library

Connecting K8S/Cilium cluster and K8S/Calico cluster using Submariner

https://medium.com/@yboaron/connecting-k8s-cilium-cluster-and-k8s-calico-cluster-using-submariner-d56d7c39f0cb

4.49K views15:06

DevOps&SRE Library

container-startup-autoscaler

container-startup-autoscaler (CSA) is a Kubernetes controller that modifies the CPU and/or memory resources of containers depending on whether they're starting up, according to the startup/post-startup settings you supply. CSA works at the pod level and is agnostic to how the pod is managed; it works with deployments, statefulsets, daemonsets and other workload management APIs.

https://github.com/ExpediaGroup/container-startup-autoscaler

4.66K views07:03

DevOps&SRE Library

kubectl.nvim

Processes kubectl outputs to enable vim-like navigation in a buffer for your cluster.

https://github.com/Ramilito/kubectl.nvim

5.19K views15:08

DevOps&SRE Library

falco

Falco is a cloud native runtime security tool for Linux operating systems. It is designed to detect and alert on abnormal behavior and potential security threats in real-time.

https://github.com/falcosecurity/falco

4.3K views07:03

DevOps&SRE Library

dice

DiceDB is an open-source, fast, reactive, in-memory database optimized for modern hardware. Commonly used as a cache, it offers a familiar interface while enabling real-time data updates through query subnoscriptions. It delivers higher throughput and lower median latencies, making it ideal for modern workloads.

https://github.com/dicedb/dice

3.98K views15:07

DevOps&SRE Library

stu

STU is the TUI explorer application for Amazon S3 (AWS S3) written in Rust using ratatui.

https://github.com/lusingander/stu

3.92K views07:04

DevOps&SRE Library

xan

xan is a command line tool that can be used to process CSV files directly from the shell.

https://github.com/medialab/xan

4.13K views15:04

DevOps&SRE Library

openproject

OpenProject is the leading open source project management software.

https://github.com/opf/openproject

3.78K views07:04

DevOps&SRE Library

Beyond “5 Whys”: A Better Way to Learn from Incidents

We all can agree that the most important purpose of a post-incident review (or post-mortem) is to learn from incidents. Implied in this learning is improving the system (people, processes, technology, and their interactions). All my reflections on the “5 Whys” technique refer back to how the technique enhances our learning (or not) from incidents.

https://uptimelabs.io/beyond-5-whys-a-better-way-to-learn-from-incidents

3.79K views15:04

DevOps&SRE Library

Anatomy of a Throttler

Part 1: https://planetscale.com/blog/anatomy-of-a-throttler-part-1

Part 2: https://planetscale.com/blog/anatomy-of-a-throttler-part-2

Part 3: https://planetscale.com/blog/anatomy-of-a-throttler-part-3

4.2K views07:04

DevOps&SRE Library

Systematically Terraforming a Brownfield of Cloud Infrastructure

Some thinking, trade-offs, theory building, and method-making one might ended up doing, in the course of bringing Infrastructure as Code (IaC) discipline to brownfield (and greenfield) services, at a small regulated fintech company, having a smaller engineering team that serves several business units, including one of India's largest national tax gateways. Only somewhat easier than reading a long compound sentence without pausing for breath. Phew.

https://www.evalapply.org/posts/systems-approach-to-infrastructure-as-code

4.35K views15:05

About

Blog

Apps

Platform