NEW BOT Телеграм, страница

How We Saved 10s of Thousands of Dollars Deploying Low Cost Open Source AI Technologies At Scale with Kubernetes

When you first start building AI applications with generative AI, you'll likely end up using OpenAI's API at some point in your project's journey. And for good reason! Their API is well-structured, fast, and supported by great libraries. At a small scale or when you’re just getting started, using OpenAI can be relatively economical. There’s also a huge amount of really great educational material out there that walks you through the process of building AI applications and understanding complex techniques using OpenAI’s API.

https://dev.to/opensauced/how-we-saved-10s-of-thousands-of-dollars-deploying-low-cost-open-source-ai-technologies-at-scale-with-kubernetes-57j8

4.33K views15:00

DevOps&SRE Library

Kubectl Scale Deployment to 0

Scaling in Kubernetes refers to adjusting the number of pod replicas or modifying the resource allocations of pods to meet application demand, ensuring optimal performance, availability, and resource utilization in a cluster. Kubernetes supports both horizontal and vertical scaling: Horizontal scaling increases or decreases the number of pod replicas while vertical scaling adjusts the resource requests (CPU and memory) of existing pods.

In this article, you’ll learn how to use and apply the kubectl command with the scale subcommand to manually execute horizontal scaling. You’ll also see how and when to use kubectl to scale deployment to 0, which means reducing the number of running pods of an application or a workload to zero, and the limitations of this action. Additionally, you’ll learn how Kubernetes Event-Driven Autoscaling (KEDA) can fill in the gaps left by the Kubernetes Horizontal Pod Autoscaler (HPA). Finally, we highlight StormForge’s features for efficient horizontal and vertical scaling.

https://stormforge.io/kubernetes-autoscaling/kubectl-scale-deployment-to-0

4.03K views07:02

DevOps&SRE Library

Recent Docker BuildKit Features You're Missing Out On

With introduction of BuildKit - the improved builder backend for Docker - many new features has been added to Docker, many of which are little known. So, here's a rundown of the ones you definitely need to know about and should start using to make better use of Docker.

https://martinheinz.dev/blog/111

4.24K views15:02

DevOps&SRE Library

How to Structure Your Argo CD Repositories Using Application Sets

https://medium.com/containers-101/how-to-structure-your-argo-cd-repositories-using-application-sets-1150e75d05b3

3.96K views07:02

DevOps&SRE Library

Solving CPU throttling issue in Golang applications before hitting the CPU limit in Kubernetes

We faced an issue within our Kubernetes cluster wherein certain multi-threaded Golang applications, for which CPU limit has been set, are experiencing throttling before reaching their designated limits.

https://medium.com/@sharyash81/solving-cpu-throttling-issue-in-golang-applications-before-hitting-the-cpu-limit-in-kubernetes-7d8f40da6477

3.71K views15:01

DevOps&SRE Library

Service Meshes Decoded Part One: A performance comparison of Istio vs Linkerd vs Cilium

A service mesh is a dedicated infrastructure layer that facilitates service-to-service communications between services or microservices using a proxy.

https://livewyer.io/blog/2024/05/08/comparison-of-service-meshes

3.83K views07:01

DevOps&SRE Library

how istio dns proxy improve dns performance, capabilities to resolve dns inter mesh cluster or outside mesh cluster, and distinguish multiple TCP services on the same port

https://medium.com/@espinaladrinaldi/how-istio-dns-proxy-improve-dns-performance-capabilities-to-resolve-dns-inter-mesh-cluster-or-546e03a44610

3.77K views15:01

DevOps&SRE Library

alpine, distroless or scratch?

I recently migrated the 4 Golang apps of the Online Boutique sample apps from alpine to scratch and its .NET app from alpine to chiseled. Here are some the things I have learned while doing that.

https://medium.com/google-cloud/alpine-distroless-or-scratch-caac35250e0b

3.83K views07:02

DevOps&SRE Library

Two-node HA Kubernetes for edge computing cost savings

Our 2-node HA architecture uses Spectro Cloud’s existing, battle-tested edge solution, which builds upon open source components including kairos, k3s, kube-vip, harbor, and system-upgrader-controller.

https://itnext.io/two-node-ha-kubernetes-for-edge-computing-cost-savings-9a009eb076ac

3.79K views15:02

DevOps&SRE Library

Talos - An Immutable OS for Kubernetes

For some time now, I have been interested in Talos, an operating system for Kubernetes. I installed my first Talos cluster in November 2023, and my “production” (composed of 3 Raspberry Pi) is now running on this OS.

https://a-cup-of.coffee/blog/talos

3.9K views07:01

DevOps&SRE Library

Automating Deployments with FluxCD in AKS

https://gagovictor.medium.com/automating-deployments-with-fluxcd-in-aks-60c3814502bf

3.79K views15:00

DevOps&SRE Library

multus-cni

Multus CNI enables attaching multiple network interfaces to pods in Kubernetes.

https://github.com/k8snetworkplumbingwg/multus-cni

3.83K views07:00

DevOps&SRE Library

kube-startup-cpu-boost

Kube Startup CPU Boost is a controller that increases CPU resource requests and limits during Kubernetes workload startup time. Once the workload is up and running, the resources are set back to their original values.

https://github.com/google/kube-startup-cpu-boost

5.58K views15:02

DevOps&SRE Library

rbac-wizard

RBAC Wizard is a tool that helps you visualize and analyze the RBAC configurations of your Kubernetes cluster. It provides a graphical representation of the Kubernetes RBAC objects.

https://github.com/pehlicd/rbac-wizard

4.4K views07:02

DevOps&SRE Library

kubediff

Source VS Deployed

https://github.com/Ramilito/kubediff

3.97K views15:02

DevOps&SRE Library

cluster-template

A template for deploying a Talos Kubernetes cluster including Flux for GitOps

https://github.com/onedr0p/cluster-template

3.69K views07:01

DevOps&SRE Library

Hot Take: Don't provide incident resolution estimates

https://firehydrant.com/blog/hot-take-dont-provide-incident-resolution-estimates

3.48K views15:01

DevOps&SRE Library

Continuous reinvention: A brief history of block storage at AWS

Marc Olson has been part of the team shaping Elastic Block Store (EBS) for over a decade. In that time, he’s helped to drive the dramatic evolution of EBS from a simple block storage service relying on shared drives to a massive network storage system that delivers over 140 trillion daily operations.

In this post, Marc provides a fascinating insider’s perspective on the journey of EBS. He shares hard-won lessons in areas such as queueing theory, the importance of comprehensive instrumentation, and the value of incrementalism versus radical changes. Most importantly, he emphasizes how constraints can often breed creative solutions. It’s an insightful look at how one of AWS’s foundational services has evolved to meet the needs of our customers (and the pace at which they’re innovating).

https://www.allthingsdistributed.com/2024/08/continuous-reinvention-a-brief-history-of-block-storage-at-aws.html

3.59K views07:02

DevOps&SRE Library

Why I don’t like discussing action items during incident reviews

https://surfingcomplexity.blog/2024/09/28/why-i-dont-like-discussing-action-items-during-incident-reviews

3.9K views15:01

DevOps&SRE Library

Syncing PagerDuty Schedules to Slack Groups

https://www.honeycomb.io/blog/syncing-pagerduty-schedules-slack-groups

3.56K views07:01

DevOps&SRE Library

docmost

Open-source collaborative wiki and documentation software.

https://github.com/docmost/docmost

3.77K views15:01

About

Blog

Apps

Platform