NEW BOT Телеграм, страница

DevOps & SRE notes

The article "Autoscaling with Keda and Prometheus Using Custom Metrics in Go" on *Medium* provides a detailed guide on how to implement autoscaling in Kubernetes using Keda and Prometheus. It demonstrates creating custom Prometheus metrics in a Go application, deploying it on Kubernetes, and configuring Prometheus to scrape these metrics. The article then shows how to integrate Keda with Prometheus to scale pods based on custom metrics, such as the number of HTTP requests or product orders, ensuring dynamic resource allocation during varying traffic conditions.

https://medium.com/vakifbank-teknoloji/autoscaling-with-keda-and-prometheus-using-custom-metrics-in-go-558a64668fc4

Medium

Autoscaling with Keda and Prometheus Using Custom Metrics in Go

Goals

👍3

2.08K viewstutunak, 17:01

DevOps & SRE notes

The blogpost highlights potential security risks associated with automating Terraform lifecycle management. It discusses how malicious actors can exploit vulnerabilities in Terraform automation platforms, such as Hashicorp Cloud and Atlantis, by creating custom providers or using data sources to execute malicious code during the terraform plan phase. This can lead to unauthorized access to sensitive cloud credentials, compromising entire cloud environments. The article emphasizes the need for secure defaults and validation mechanisms in these platforms to mitigate such risks

https://snyk.io/blog/gitflops-dangers-of-terraform-automation-platforms/

Snyk Labs

GitFlops: The Dangers of Terraform Automation Platforms | Snyk Labs

Terraform automation platforms streamline infrastructure management but also introduce security vulnerabilities when speculative plans are executed. Read how attackers can exploit Terraform lifecycle automation to gain unauthorized cloud access, compromising…

👍2

2.14K viewstutunak, 08:00

DevOps & SRE notes

An operator to manage ephemeral Kubernetes resources 🐝

https://github.com/NCCloud/mayfly

GitHub

GitHub - NCCloud/mayfly: An operator to manage ephemeral Kubernetes resources :honeybee:

An operator to manage ephemeral Kubernetes resources :honeybee: - NCCloud/mayfly

👍4❤1

2.24K viewstutunak, 16:01

DevOps & SRE notes

OpenTofu / Terraform / Terragrunt and Atmos version manager

https://github.com/tofuutils/tenv

GitHub

GitHub - tofuutils/tenv: OpenTofu / Terraform / Terragrunt / Terramate and Atmos version manager

OpenTofu / Terraform / Terragrunt / Terramate and Atmos version manager - tofuutils/tenv

👍4❤2

2.16K viewstutunak, 08:03

DevOps & SRE notes

In his article "TTR: the out-of-control metric," Lorin Hochstein critiques the application of the Time-to-Resolve (TTR) metric in incident management. He argues that since incidents represent periods when systems are out of control, applying statistical analyses to TTR is ineffective and does not lead to meaningful improvements.

https://surfingcomplexity.blog/2024/11/23/ttr-the-out-of-control-metric/

Surfing Complexity

TTR: the out-of-control metric

I’m currently reading The Machine That Changed The World. This is a book written back in 1990 comparing Toyota’s approach to automobile manufacturing to the approach used by American ca…

👍2

2.21K viewstutunak, 16:04

DevOps & SRE notes

Richard Artoul explores the distinctions between "shared nothing" and "shared storage" architectures, particularly within data streaming contexts. He highlights how shared storage systems, by decoupling data from metadata, offer enhanced flexibility and scalability compared to traditional shared-nothing models. citeturn0search0
https://www.warpstream.com/blog/the-case-for-shared-storage

Warpstream

The Case for Shared Storage

In this post, I’ll start off with a brief overview of “shared nothing” vs. “shared storage” architectures in general. This discussion will be a bit abstract and high-level, but the goal is to share with you some of the guiding philosophy that ultimately led…

👍3

2.19K viewstutunak, 08:01

DevOps & SRE notes

The mighty, self-hostable Git server for the command line🍦

https://github.com/charmbracelet/soft-serve

GitHub

GitHub - charmbracelet/soft-serve: The mighty, self-hostable Git server for the command line🍦

The mighty, self-hostable Git server for the command line🍦 - charmbracelet/soft-serve

🔥3💯2❤1

2.43K viewstutunak, 16:04

DevOps & SRE notes

Runbook automation platform with deep observability integrations for SRE & On-Call Teams

https://github.com/DrDroidLab/playbooks

GitHub

GitHub - DrDroidLab/PlayBooks: Runbook automation platform with deep observability integrations for SRE & On-Call Teams

Runbook automation platform with deep observability integrations for SRE & On-Call Teams - DrDroidLab/PlayBooks

👍4

2.21K viewstutunak, 08:02

DevOps & SRE notes

In the blogpost examined how increasing CPU utilization can lead to higher latency, affecting overall system performance. Through various experiments, they observed that as CPU usage rises, latency increases, highlighting the importance of optimizing system efficiency to maintain performance under varying loads. citeturn0search0

https://github.blog/engineering/architecture-optimization/breaking-down-cpu-speed-how-utilization-impacts-performance/

The GitHub Blog

Breaking down CPU speed: How utilization impacts performance

The Performance Engineering team at GitHub assessed how CPU performance degrades as utilization increases and how this relates to capacity.

👍2

2.38K viewstutunak, 15:20

DevOps & SRE notes

In this article "Solutions to the Lost Update Problem," Sönke Ruempler addresses the challenges of concurrent transactions leading to lost updates in database systems.

https://ruempler.eu/2024/11/25/solutions-to-the-lost-update-problem/

Sönke Ruempler

Solutions to the Lost Update Problem

The Lost Update Problem is a common issue in concurrent systems, where two transactions read the same data, modify it, and write it back to the database. The second transaction will overwrite the chan

2.1K viewstutunak, 08:03

DevOps & SRE notes

Forwarded from Best Channels for Tech guys

🐍 Python Notes 🐍

Stay on top of your Python skills with concise notes, tips, and tricks for every level of developer! Whether you're a beginner or advanced, these notes cover everything from basic syntax to advanced libraries and real-world applications.

📘 Comprehensive Python Guides
⚙️ Practical Coding Tips & Tricks
🚀 Master Python, Step by Step

✨ Subscribe now and boost your Python knowledge! 📲

🔥4

2.14K views11:21

DevOps & SRE notes

Define sleep & wake up cycles for your Kubernetes resources. Automatically schedule to shutdown Deployments, CronJobs, StatefulSets and HorizontalPodAutoscalers that occupy resources in your cluster and wake them up only when you need them, reducing that way the overall power consumption.

https://github.com/rekuberate-io/sleepcycles

GitHub

GitHub - rekuberate-io/sleepcycles: Define sleep & wake up cycles for your Kubernetes resources. Automatically schedule to shutdown…

Define sleep & wake up cycles for your Kubernetes resources. Automatically schedule to shutdown Deployments, CronJobs, StatefulSets and HorizontalPodAutoscalers that occupy resources in you...

❤‍🔥4

2.56K viewstutunak, 16:05

DevOps & SRE notes

Chaos snake for Kubernetes. Chaos engineering combined with a classic snake game.

https://github.com/deggja/chaossnake

GitHub

GitHub - deggja/chaossnake: Chaos snake for Kubernetes. Chaos engineering combined with a classic snake game.

Chaos snake for Kubernetes. Chaos engineering combined with a classic snake game. - deggja/chaossnake

👍3

2.41K viewstutunak, 08:02

DevOps & SRE notes

Lawrence Jones discusses the challenges of ensuring that an incident management system can alert its own team during outages.

https://incident.io/hubs/building-on-call/who-watches-the-watchers

incident.io

How we page ourselves if incident.io goes down

Learn how we tackle the ultimate paradox: ensuring our alerting system pages us, even when it’s the one failing.

It's a common question - let's dive into detail on our "dead man's switch", how we stress-test our systems, and why we care so much about our…

🔥3❤1

2.41K viewstutunak, 16:04

DevOps & SRE notes

In the article discusses the challenges of maintaining Service Level Objectives (SLOs) in a microservices environment. The team redefined their Critical User Journeys (CUJs) and implemented end-to-end (E2E) testing to automate SLO maintenance, resulting in a 99% reduction in maintenance time and immediate impact assessment during incidents.

https://engineering.mercari.com/en/blog/entry/20241204-keeping-user-journey-slos-up-to-date-with-e2e-testing-in-a-microservices-architecture/

Mercari

Keeping User Journey SLOs Up-to-Date with E2E Testing in a Microservices Architecture

This post is for Day 3 of Mercari Advent Calendar 2024, brought to you by @yakenji from the Mercari Site Reliability Eng

👍6

2.62K viewstutunak, 08:03

DevOps & SRE notes

Local development for AWS Lambda's and Terraform

https://github.com/uatec/teleform

GitHub

GitHub - uatec/teleform: Local development for AWS Lambda's and Terraform

Local development for AWS Lambda's and Terraform. Contribute to uatec/teleform development by creating an account on GitHub.

2.72K viewstutunak, 16:02

DevOps & SRE notes

NVIDIA device plugin for Kubernetes

https://github.com/NVIDIA/k8s-device-plugin

GitHub

GitHub - NVIDIA/k8s-device-plugin: NVIDIA device plugin for Kubernetes

NVIDIA device plugin for Kubernetes. Contribute to NVIDIA/k8s-device-plugin development by creating an account on GitHub.

👍4

2.69K viewstutunak, 08:04

DevOps & SRE notes

In December 2024, AWS introduced a visual deployment timeline feature for CloudFormation, enhancing the infrastructure-as-code service with real-time visualization of resource provisioning sequences. This timeline offers a graphical representation of the order and duration of resource deployments, providing insights into dependencies and potential bottlenecks.

https://www.infoq.com/news/2024/12/cloudformation-visual-deployment/

InfoQ

AWS Introduces Visual Deployment Timeline to Aid CloudFormation Debugging

Amazon Web Services (AWS) has enhanced its CloudFormation service by adding a new deployment timeline view. The new view gives developers and cloud engineers a more intuitive way to track and understand the infrastructure deployment process, including new…

👍4

2.91K viewstutunak, 13:40

DevOps & SRE notes

In the article "The Karpenter Transformation," Nadav Buchman from Fiverr Engineering discusses the company's migration of their Kubernetes compute nodes to Karpenter, an open-source Kubernetes node lifecycle manager developed by AWS.
https://medium.com/fiverr-engineering/the-karpenter-transformation-1c278294bd9b

Medium

Welcome to Fiverr’s Infrastructure Renaissance: First Up — the Karpenter transformation

How we migrated our Kubernetes compute nodes to Karpenter, and what have we gained from it.

👍4❤1

2.82K viewstutunak, 07:36

DevOps & SRE notes

Intelligence for Kubernetes. World's most promising Kubernetes Visualization Tool for Developer and Platform Engineering teams.

https://github.com/KusionStack/karpor

GitHub

GitHub - KusionStack/karpor: Intelligence for Kubernetes. World's most promising Kubernetes Visualization Tool for Developer and…

Intelligence for Kubernetes. World's most promising Kubernetes Visualization Tool for Developer and Platform Engineering teams. - GitHub - KusionStack/karpor: Intelligence for Kubernetes. ...

👍5

3.02K viewstutunak, 16:04

DevOps & SRE notes

Automate Kubernetes Configuration Editing

https://github.com/kptdev/kpt

GitHub

GitHub - kptdev/kpt: Automate Kubernetes Configuration Editing

Automate Kubernetes Configuration Editing. Contribute to kptdev/kpt development by creating an account on GitHub.

3.11K viewstutunak, 08:02

About

Blog

Apps

Platform