autotune
https://github.com/kruize/autotune
Kruize Autotune is an Autonomous Performance Tuning Tool for Kubernetes. Autotune accepts a user provided "slo" goal to optimize application performance. It uses Prometheus to identify "layers" of an application that it is monitoring and matches tunables from those layers to the user provided slo. It then runs experiments with the help of a hyperparameter optimization framework to arrive at the most optimal values for the identified set of tunables to get a better result for the user provided slo.
Autotune can take an arbitrarily large set of tunables and run experiments to continually optimize the user provided slo in incremental steps. For this reason, it does not necessarily have a "best" value for a set of tunables, only a "better" one than what is currently deployed.
https://github.com/kruize/autotune
kubeip
https://github.com/doitintl/kubeip
Kubernetes' nodes don't necessarily need their own public IP addresses to communicate. However, there are certain situations where it's beneficial for nodes in a node pool to have their own unique public IP addresses.
For instance, in gaming applications, a console might need to establish a direct connection with a cloud virtual machine to reduce the number of hops.
Similarly, if you have multiple agents running on Kubernetes that need a direct server connection, and the server needs to whitelist all agent IPs, having dedicated public IPs can be useful. These scenarios, among others, can be handled on a cloud-managed Kubernetes cluster using Node Public IP.
KubeIP is a utility that assigns a static public IP to each node it manages. The IP is allocated to the node's primary network interface, chosen from a pool of reserved static IPs using platform-supported filtering and ordering.
If there are no static public IPs left, KubeIP will hold on until one becomes available. When a node is removed, KubeIP releases the static public IP back into the pool of reserved static IPs.
https://github.com/doitintl/kubeip
The case of the vanishing CPU: A Linux kernel debugging story
https://clickhouse.com/blog/a-case-of-the-vanishing-cpu-a-linux-kernel-debugging-story
A mysterious CPU spike in ClickHouse Cloud on GCP led to months of debugging, revealing a deeper issue within the Linux kernel’s memory management. What started as random performance degradation turned into a deep dive into kernel internals, where engineer Sergei Trifonov uncovered a hidden livelock. His journey through eBPF tracing, perf analysis, and a reproducible test case ultimately led to a surprising fix - only for another kernel bug to surface right after. Curious, read on…
https://clickhouse.com/blog/a-case-of-the-vanishing-cpu-a-linux-kernel-debugging-story
pgrouting
https://github.com/pgRouting/pgrouting
pgRouting extends the PostGIS/PostgreSQL geospatial database to provide geospatial routing and other network analysis functionality.
https://github.com/pgRouting/pgrouting
rsql
https://github.com/theseus-rs/rsql
rsql is a command line SQL interface for data. rsql is a modern, feature-rich, and user-friendly client, that has been designed to be easy to use, and to provide a consistent experience across all supported data sources.
https://github.com/theseus-rs/rsql
postgresql-embedded
https://github.com/theseus-rs/postgresql-embedded
Install and run a PostgreSQL database locally on Linux, MacOS or Windows. PostgreSQL can be bundled with your application, or downloaded on demand.
https://github.com/theseus-rs/postgresql-embedded
wait4x
https://github.com/atkrad/wait4x
Wait4X is a powerful, zero-dependency tool that waits for services to be ready before continuing.
https://github.com/atkrad/wait4x
pg_mooncake
https://github.com/Mooncake-Labs/pg_mooncake
pg_mooncake is a Postgres extension that adds columnar storage and vectorized execution (DuckDB) for fast analytics within Postgres. Postgres + pg_mooncake ranks among the top 10 fastest in ClickBench.
https://github.com/Mooncake-Labs/pg_mooncake
Why I'm No Longer Talking to Architects About Microservices
https://blog.container-solutions.com/why-im-no-longer-talking-to-architects-about-microservices
https://blog.container-solutions.com/why-im-no-longer-talking-to-architects-about-microservices
Terraform Infrastructure as Code: Essential Tools for Clean, Maintainable Production Environments
https://awsmorocco.com/terraform-infrastructure-as-code-essential-tools-for-clean-maintainable-production-environments-e223bedac905
In this article, I’ll cover a few important tools that can help you keep your Terraform code in good shape. I’ll cover areas such as static analysis, documentation, security checks and ways to improve your workflow. These tools can alert you early on to potential problems, help you adhere to best practices, stimulate teamwork and reduce operational risks.
https://awsmorocco.com/terraform-infrastructure-as-code-essential-tools-for-clean-maintainable-production-environments-e223bedac905
tenv
https://github.com/tofuutils/tenv
Welcome to tenv, a versatile version manager for OpenTofu, Terraform, Terragrunt and Atmos, written in Go. Our tool simplifies the complexity of handling different versions of these powerful tools, ensuring developers and DevOps professionals can focus on what matters most - building and deploying efficiently.
https://github.com/tofuutils/tenv
Auto-scaling with KEDA Using Custom RED Metrics from Prometheus
https://medium.com/@emreblblvv/auto-scaling-with-keda-using-custom-red-metrics-from-prometheus-76d50785e442
At the beginning of the year, we made a strategic decision to rewrite and optimize our legacy services that experience high load during peak times. Recognizing that these services don’t consistently operate under the same load levels, we opted for an auto-scaling approach. This decision was driven by two primary goals: enhancing reliability and evolving towards a more cloud-native architecture.
https://medium.com/@emreblblvv/auto-scaling-with-keda-using-custom-red-metrics-from-prometheus-76d50785e442
The Power of Preview Deployments: Catching Bugs Before They Bite using Argo CD
https://medium.com/hostspaceng/the-power-of-preview-deployments-catching-bugs-before-they-bite-using-argo-cd-c21c069b04c5
A Step-by-Step Guide to Configuring Argo CD Pull Request Generator for Preview Deployments.
https://medium.com/hostspaceng/the-power-of-preview-deployments-catching-bugs-before-they-bite-using-argo-cd-c21c069b04c5
Overview of Kubernetes GPU Scheduling, Device Plugin, CDI, NFD, and GPU Operator
https://medium.com/@rifewang/overview-of-kubernetes-gpu-scheduling-device-plugin-cdi-nfd-and-gpu-operator-48a7c4213b28
https://medium.com/@rifewang/overview-of-kubernetes-gpu-scheduling-device-plugin-cdi-nfd-and-gpu-operator-48a7c4213b28
Image Management on Kubernetes Node
https://medium.com/@rifewang/image-management-on-kubernetes-node-18649e16bc26
https://medium.com/@rifewang/image-management-on-kubernetes-node-18649e16bc26
Learn Kustomize for Kubernetes — mini course
https://medium.com/@josip.cloud/learn-kustomize-for-kubernetes-course-part-1-6cc4ab945f9d
https://medium.com/@josip.cloud/learn-kustomize-for-kubernetes-course-part-1-6cc4ab945f9d
Automatic testing and rollback of your GitOps with Argo Rollouts
https://jason-umiker.medium.com/automatic-rollback-of-your-gitops-with-argo-rollouts-04f94baa2d03
https://jason-umiker.medium.com/automatic-rollback-of-your-gitops-with-argo-rollouts-04f94baa2d03
Longhorn Backup and Restore
https://reddec.net/posts/longhorn-backup-and-restore
Article breaks down using Longhorn (version 1.7) for backups, self-hosting, and recovery. It’s for seasoned DevOps and SysAdmin folks who like doing things themselves. You’ll find a step-by-step guide on setting up and restoring backups with Longhorn, Velero, and GitOps. The goal is to show you a tried-and-true setup for getting your backups and restores working smoothly.
https://reddec.net/posts/longhorn-backup-and-restore
liqo
https://github.com/liqotech/liqo
Liqo is an open-source project that enables dynamic and seamless Kubernetes multi-cluster topologies, supporting heterogeneous on-premise, cloud and edge infrastructures.
https://github.com/liqotech/liqo
Certified Kubernetes Administrator (CKA) Notes
https://github.com/ismet55555/Certified-Kubernetes-Administrator-Notes
https://github.com/ismet55555/Certified-Kubernetes-Administrator-Notes