The article "Autoscaling with Keda and Prometheus Using Custom Metrics in Go" on *Medium* provides a detailed guide on how to implement autoscaling in Kubernetes using Keda and Prometheus. It demonstrates creating custom Prometheus metrics in a Go application, deploying it on Kubernetes, and configuring Prometheus to scrape these metrics. The article then shows how to integrate Keda with Prometheus to scale pods based on custom metrics, such as the number of HTTP requests or product orders, ensuring dynamic resource allocation during varying traffic conditions.
https://medium.com/vakifbank-teknoloji/autoscaling-with-keda-and-prometheus-using-custom-metrics-in-go-558a64668fc4
https://medium.com/vakifbank-teknoloji/autoscaling-with-keda-and-prometheus-using-custom-metrics-in-go-558a64668fc4
Medium
Autoscaling with Keda and Prometheus Using Custom Metrics in Go
Goals
👍3
The blogpost highlights potential security risks associated with automating Terraform lifecycle management. It discusses how malicious actors can exploit vulnerabilities in Terraform automation platforms, such as Hashicorp Cloud and Atlantis, by creating custom providers or using data sources to execute malicious code during the
https://snyk.io/blog/gitflops-dangers-of-terraform-automation-platforms/
terraform plan phase. This can lead to unauthorized access to sensitive cloud credentials, compromising entire cloud environments. The article emphasizes the need for secure defaults and validation mechanisms in these platforms to mitigate such riskshttps://snyk.io/blog/gitflops-dangers-of-terraform-automation-platforms/
Snyk Labs
GitFlops: The Dangers of Terraform Automation Platforms | Snyk Labs
Terraform automation platforms streamline infrastructure management but also introduce security vulnerabilities when speculative plans are executed. Read how attackers can exploit Terraform lifecycle automation to gain unauthorized cloud access, compromising…
👍2
In his article "TTR: the out-of-control metric," Lorin Hochstein critiques the application of the Time-to-Resolve (TTR) metric in incident management. He argues that since incidents represent periods when systems are out of control, applying statistical analyses to TTR is ineffective and does not lead to meaningful improvements.
https://surfingcomplexity.blog/2024/11/23/ttr-the-out-of-control-metric/
https://surfingcomplexity.blog/2024/11/23/ttr-the-out-of-control-metric/
Surfing Complexity
TTR: the out-of-control metric
I’m currently reading The Machine That Changed The World. This is a book written back in 1990 comparing Toyota’s approach to automobile manufacturing to the approach used by American ca…
👍2
Richard Artoul explores the distinctions between "shared nothing" and "shared storage" architectures, particularly within data streaming contexts. He highlights how shared storage systems, by decoupling data from metadata, offer enhanced flexibility and scalability compared to traditional shared-nothing models. citeturn0search0
https://www.warpstream.com/blog/the-case-for-shared-storage
https://www.warpstream.com/blog/the-case-for-shared-storage
Warpstream
The Case for Shared Storage
In this post, I’ll start off with a brief overview of “shared nothing” vs. “shared storage” architectures in general. This discussion will be a bit abstract and high-level, but the goal is to share with you some of the guiding philosophy that ultimately led…
👍3
The mighty, self-hostable Git server for the command line🍦
https://github.com/charmbracelet/soft-serve
https://github.com/charmbracelet/soft-serve
GitHub
GitHub - charmbracelet/soft-serve: The mighty, self-hostable Git server for the command line🍦
The mighty, self-hostable Git server for the command line🍦 - charmbracelet/soft-serve
🔥3💯2❤1
Runbook automation platform with deep observability integrations for SRE & On-Call Teams
https://github.com/DrDroidLab/playbooks
https://github.com/DrDroidLab/playbooks
GitHub
GitHub - DrDroidLab/PlayBooks: Runbook automation platform with deep observability integrations for SRE & On-Call Teams
Runbook automation platform with deep observability integrations for SRE & On-Call Teams - DrDroidLab/PlayBooks
👍4
In the blogpost examined how increasing CPU utilization can lead to higher latency, affecting overall system performance. Through various experiments, they observed that as CPU usage rises, latency increases, highlighting the importance of optimizing system efficiency to maintain performance under varying loads. citeturn0search0
https://github.blog/engineering/architecture-optimization/breaking-down-cpu-speed-how-utilization-impacts-performance/
https://github.blog/engineering/architecture-optimization/breaking-down-cpu-speed-how-utilization-impacts-performance/
The GitHub Blog
Breaking down CPU speed: How utilization impacts performance
The Performance Engineering team at GitHub assessed how CPU performance degrades as utilization increases and how this relates to capacity.
👍2
In this article "Solutions to the Lost Update Problem," Sönke Ruempler addresses the challenges of concurrent transactions leading to lost updates in database systems.
https://ruempler.eu/2024/11/25/solutions-to-the-lost-update-problem/
https://ruempler.eu/2024/11/25/solutions-to-the-lost-update-problem/
Sönke Ruempler
Solutions to the Lost Update Problem
The Lost Update Problem is a common issue in concurrent systems, where two transactions read the same data, modify it, and write it back to the database. The second transaction will overwrite the chan
Forwarded from Best Channels for Tech guys
🐍 Python Notes 🐍
Stay on top of your Python skills with concise notes, tips, and tricks for every level of developer! Whether you're a beginner or advanced, these notes cover everything from basic syntax to advanced libraries and real-world applications.
📘 Comprehensive Python Guides
⚙️ Practical Coding Tips & Tricks
🚀 Master Python, Step by Step
✨ Subscribe now and boost your Python knowledge! 📲
Stay on top of your Python skills with concise notes, tips, and tricks for every level of developer! Whether you're a beginner or advanced, these notes cover everything from basic syntax to advanced libraries and real-world applications.
📘 Comprehensive Python Guides
⚙️ Practical Coding Tips & Tricks
🚀 Master Python, Step by Step
✨ Subscribe now and boost your Python knowledge! 📲
🔥4
Define sleep & wake up cycles for your Kubernetes resources. Automatically schedule to shutdown Deployments, CronJobs, StatefulSets and HorizontalPodAutoscalers that occupy resources in your cluster and wake them up only when you need them, reducing that way the overall power consumption.
https://github.com/rekuberate-io/sleepcycles
https://github.com/rekuberate-io/sleepcycles
GitHub
GitHub - rekuberate-io/sleepcycles: Define sleep & wake up cycles for your Kubernetes resources. Automatically schedule to shutdown…
Define sleep & wake up cycles for your Kubernetes resources. Automatically schedule to shutdown Deployments, CronJobs, StatefulSets and HorizontalPodAutoscalers that occupy resources in you...
❤🔥4
Chaos snake for Kubernetes. Chaos engineering combined with a classic snake game.
https://github.com/deggja/chaossnake
https://github.com/deggja/chaossnake
GitHub
GitHub - deggja/chaossnake: Chaos snake for Kubernetes. Chaos engineering combined with a classic snake game.
Chaos snake for Kubernetes. Chaos engineering combined with a classic snake game. - deggja/chaossnake
👍3
Lawrence Jones discusses the challenges of ensuring that an incident management system can alert its own team during outages.
https://incident.io/hubs/building-on-call/who-watches-the-watchers
https://incident.io/hubs/building-on-call/who-watches-the-watchers
incident.io
How we page ourselves if incident.io goes down
Learn how we tackle the ultimate paradox: ensuring our alerting system pages us, even when it’s the one failing.
It's a common question - let's dive into detail on our "dead man's switch", how we stress-test our systems, and why we care so much about our…
It's a common question - let's dive into detail on our "dead man's switch", how we stress-test our systems, and why we care so much about our…
🔥3❤1
In the article discusses the challenges of maintaining Service Level Objectives (SLOs) in a microservices environment. The team redefined their Critical User Journeys (CUJs) and implemented end-to-end (E2E) testing to automate SLO maintenance, resulting in a 99% reduction in maintenance time and immediate impact assessment during incidents.
https://engineering.mercari.com/en/blog/entry/20241204-keeping-user-journey-slos-up-to-date-with-e2e-testing-in-a-microservices-architecture/
https://engineering.mercari.com/en/blog/entry/20241204-keeping-user-journey-slos-up-to-date-with-e2e-testing-in-a-microservices-architecture/
Mercari
Keeping User Journey SLOs Up-to-Date with E2E Testing in a Microservices Architecture
This post is for Day 3 of Mercari Advent Calendar 2024, brought to you by @yakenji from the Mercari Site Reliability Eng
👍6
In December 2024, AWS introduced a visual deployment timeline feature for CloudFormation, enhancing the infrastructure-as-code service with real-time visualization of resource provisioning sequences. This timeline offers a graphical representation of the order and duration of resource deployments, providing insights into dependencies and potential bottlenecks.
https://www.infoq.com/news/2024/12/cloudformation-visual-deployment/
https://www.infoq.com/news/2024/12/cloudformation-visual-deployment/
InfoQ
AWS Introduces Visual Deployment Timeline to Aid CloudFormation Debugging
Amazon Web Services (AWS) has enhanced its CloudFormation service by adding a new deployment timeline view. The new view gives developers and cloud engineers a more intuitive way to track and understand the infrastructure deployment process, including new…
👍4
In the article "The Karpenter Transformation," Nadav Buchman from Fiverr Engineering discusses the company's migration of their Kubernetes compute nodes to Karpenter, an open-source Kubernetes node lifecycle manager developed by AWS.
https://medium.com/fiverr-engineering/the-karpenter-transformation-1c278294bd9b
https://medium.com/fiverr-engineering/the-karpenter-transformation-1c278294bd9b
Medium
Welcome to Fiverr’s Infrastructure Renaissance: First Up — the Karpenter transformation
How we migrated our Kubernetes compute nodes to Karpenter, and what have we gained from it.
👍4❤1
Intelligence for Kubernetes. World's most promising Kubernetes Visualization Tool for Developer and Platform Engineering teams.
https://github.com/KusionStack/karpor
https://github.com/KusionStack/karpor
GitHub
GitHub - KusionStack/karpor: Intelligence for Kubernetes. World's most promising Kubernetes Visualization Tool for Developer and…
Intelligence for Kubernetes. World's most promising Kubernetes Visualization Tool for Developer and Platform Engineering teams. - GitHub - KusionStack/karpor: Intelligence for Kubernetes. ...
👍5