How We Saved 10s of Thousands of Dollars Deploying Low Cost Open Source AI Technologies At Scale with Kubernetes
https://dev.to/opensauced/how-we-saved-10s-of-thousands-of-dollars-deploying-low-cost-open-source-ai-technologies-at-scale-with-kubernetes-57j8
When you first start building AI applications with generative AI, you'll likely end up using OpenAI's API at some point in your project's journey. And for good reason! Their API is well-structured, fast, and supported by great libraries. At a small scale or when you’re just getting started, using OpenAI can be relatively economical. There’s also a huge amount of really great educational material out there that walks you through the process of building AI applications and understanding complex techniques using OpenAI’s API.
https://dev.to/opensauced/how-we-saved-10s-of-thousands-of-dollars-deploying-low-cost-open-source-ai-technologies-at-scale-with-kubernetes-57j8
1
Kubectl Scale Deployment to 0
https://stormforge.io/kubernetes-autoscaling/kubectl-scale-deployment-to-0
Scaling in Kubernetes refers to adjusting the number of pod replicas or modifying the resource allocations of pods to meet application demand, ensuring optimal performance, availability, and resource utilization in a cluster. Kubernetes supports both horizontal and vertical scaling: Horizontal scaling increases or decreases the number of pod replicas while vertical scaling adjusts the resource requests (CPU and memory) of existing pods.
In this article, you’ll learn how to use and apply the kubectl command with the scale subcommand to manually execute horizontal scaling. You’ll also see how and when to use kubectl to scale deployment to 0, which means reducing the number of running pods of an application or a workload to zero, and the limitations of this action. Additionally, you’ll learn how Kubernetes Event-Driven Autoscaling (KEDA) can fill in the gaps left by the Kubernetes Horizontal Pod Autoscaler (HPA). Finally, we highlight StormForge’s features for efficient horizontal and vertical scaling.
https://stormforge.io/kubernetes-autoscaling/kubectl-scale-deployment-to-0
1
Recent Docker BuildKit Features You're Missing Out On
https://martinheinz.dev/blog/111
With introduction of BuildKit - the improved builder backend for Docker - many new features has been added to Docker, many of which are little known. So, here's a rundown of the ones you definitely need to know about and should start using to make better use of Docker.
https://martinheinz.dev/blog/111
1
How to Structure Your Argo CD Repositories Using Application Sets
https://medium.com/containers-101/how-to-structure-your-argo-cd-repositories-using-application-sets-1150e75d05b3
https://medium.com/containers-101/how-to-structure-your-argo-cd-repositories-using-application-sets-1150e75d05b3
1
Solving CPU throttling issue in Golang applications before hitting the CPU limit in Kubernetes
https://medium.com/@sharyash81/solving-cpu-throttling-issue-in-golang-applications-before-hitting-the-cpu-limit-in-kubernetes-7d8f40da6477
We faced an issue within our Kubernetes cluster wherein certain multi-threaded Golang applications, for which CPU limit has been set, are experiencing throttling before reaching their designated limits.
https://medium.com/@sharyash81/solving-cpu-throttling-issue-in-golang-applications-before-hitting-the-cpu-limit-in-kubernetes-7d8f40da6477
1
Service Meshes Decoded Part One: A performance comparison of Istio vs Linkerd vs Cilium
https://livewyer.io/blog/2024/05/08/comparison-of-service-meshes
A service mesh is a dedicated infrastructure layer that facilitates service-to-service communications between services or microservices using a proxy.
https://livewyer.io/blog/2024/05/08/comparison-of-service-meshes
1
how istio dns proxy improve dns performance, capabilities to resolve dns inter mesh cluster or outside mesh cluster, and distinguish multiple TCP services on the same port
https://medium.com/@espinaladrinaldi/how-istio-dns-proxy-improve-dns-performance-capabilities-to-resolve-dns-inter-mesh-cluster-or-546e03a44610
https://medium.com/@espinaladrinaldi/how-istio-dns-proxy-improve-dns-performance-capabilities-to-resolve-dns-inter-mesh-cluster-or-546e03a44610
1
alpine, distroless or scratch?
https://medium.com/google-cloud/alpine-distroless-or-scratch-caac35250e0b
I recently migrated the 4 Golang apps of the Online Boutique sample apps from alpine to scratch and its .NET app from alpine to chiseled. Here are some the things I have learned while doing that.
https://medium.com/google-cloud/alpine-distroless-or-scratch-caac35250e0b
1
Two-node HA Kubernetes for edge computing cost savings
https://itnext.io/two-node-ha-kubernetes-for-edge-computing-cost-savings-9a009eb076ac
Our 2-node HA architecture uses Spectro Cloud’s existing, battle-tested edge solution, which builds upon open source components including kairos, k3s, kube-vip, harbor, and system-upgrader-controller.
https://itnext.io/two-node-ha-kubernetes-for-edge-computing-cost-savings-9a009eb076ac
1
Talos - An Immutable OS for Kubernetes
https://a-cup-of.coffee/blog/talos
For some time now, I have been interested in Talos, an operating system for Kubernetes. I installed my first Talos cluster in November 2023, and my “production” (composed of 3 Raspberry Pi) is now running on this OS.
https://a-cup-of.coffee/blog/talos
1
Automating Deployments with FluxCD in AKS
https://gagovictor.medium.com/automating-deployments-with-fluxcd-in-aks-60c3814502bf
https://gagovictor.medium.com/automating-deployments-with-fluxcd-in-aks-60c3814502bf
1
multus-cni
https://github.com/k8snetworkplumbingwg/multus-cni
Multus CNI enables attaching multiple network interfaces to pods in Kubernetes.
https://github.com/k8snetworkplumbingwg/multus-cni
1
kube-startup-cpu-boost
https://github.com/google/kube-startup-cpu-boost
Kube Startup CPU Boost is a controller that increases CPU resource requests and limits during Kubernetes workload startup time. Once the workload is up and running, the resources are set back to their original values.
https://github.com/google/kube-startup-cpu-boost
4
rbac-wizard
https://github.com/pehlicd/rbac-wizard
RBAC Wizard is a tool that helps you visualize and analyze the RBAC configurations of your Kubernetes cluster. It provides a graphical representation of the Kubernetes RBAC objects.
https://github.com/pehlicd/rbac-wizard
1
cluster-template
https://github.com/onedr0p/cluster-template
A template for deploying a Talos Kubernetes cluster including Flux for GitOps
https://github.com/onedr0p/cluster-template
1
Hot Take: Don't provide incident resolution estimates
https://firehydrant.com/blog/hot-take-dont-provide-incident-resolution-estimates
https://firehydrant.com/blog/hot-take-dont-provide-incident-resolution-estimates
1
Continuous reinvention: A brief history of block storage at AWS
https://www.allthingsdistributed.com/2024/08/continuous-reinvention-a-brief-history-of-block-storage-at-aws.html
Marc Olson has been part of the team shaping Elastic Block Store (EBS) for over a decade. In that time, he’s helped to drive the dramatic evolution of EBS from a simple block storage service relying on shared drives to a massive network storage system that delivers over 140 trillion daily operations.
In this post, Marc provides a fascinating insider’s perspective on the journey of EBS. He shares hard-won lessons in areas such as queueing theory, the importance of comprehensive instrumentation, and the value of incrementalism versus radical changes. Most importantly, he emphasizes how constraints can often breed creative solutions. It’s an insightful look at how one of AWS’s foundational services has evolved to meet the needs of our customers (and the pace at which they’re innovating).
https://www.allthingsdistributed.com/2024/08/continuous-reinvention-a-brief-history-of-block-storage-at-aws.html
1
Why I don’t like discussing action items during incident reviews
https://surfingcomplexity.blog/2024/09/28/why-i-dont-like-discussing-action-items-during-incident-reviews
https://surfingcomplexity.blog/2024/09/28/why-i-dont-like-discussing-action-items-during-incident-reviews
1
Syncing PagerDuty Schedules to Slack Groups
https://www.honeycomb.io/blog/syncing-pagerduty-schedules-slack-groups
https://www.honeycomb.io/blog/syncing-pagerduty-schedules-slack-groups
1
1