Forwarded from KubeFM
Media is too big
VIEW IN TELEGRAM
Sai Vennam, Principal Specialist Solution Architect, Container @ AWS at AWS, shares his predictions for Kubernetes over the next decade.
He discusses how cloud providers are shifting focus from control plane optimization to the data plane where workloads actually run, highlighting AWS's recent achievement of supporting 100,000 node EKS clusters for massive training jobs—a 20x increase over upstream Kubernetes' recommended 5,000 nodes.
Watch the full interview: https://ku.bz/yXYpPR_48
He discusses how cloud providers are shifting focus from control plane optimization to the data plane where workloads actually run, highlighting AWS's recent achievement of supporting 100,000 node EKS clusters for massive training jobs—a 20x increase over upstream Kubernetes' recommended 5,000 nodes.
Watch the full interview: https://ku.bz/yXYpPR_48
This project implements a bare-metal autoscaler that scales by doing
It treats node types by labels instead of predefined “node groups.”
More: https://ku.bz/jXnFwTRbR
kubectl drain + poweroff on underutilized nodes and using Wake-on-LAN / IPMI to power nodes up.It treats node types by labels instead of predefined “node groups.”
More: https://ku.bz/jXnFwTRbR
Forwarded from LearnKube news
This week on Learn Kubernetes Weekly 152:
🌀 A Journey Through Kafkian SplitDNS in a Multitenant Kubernetes Offering
⚙️ Under the hood: Amazon EKS Auto Mode
👩💻 Most Cloud-Native Roles are Software Engineers
🚀 Start Sidecar First: How To Avoid Snags
📈 Enhancing Kubernetes Event Management with Custom Aggregation
⚡ Non-HA Kubernetes Gotchas: Downtime and Autoscaling Pitfalls with Single Replica Workloads
Read it now: https://kube.today/issues/152
⭐️ This newsletter is brought to you by AWS — Fully automate your Kubernetes clusters with Amazon EKS Auto Mode https://ku.bz/xZWD-2-Rk
🌀 A Journey Through Kafkian SplitDNS in a Multitenant Kubernetes Offering
⚙️ Under the hood: Amazon EKS Auto Mode
👩💻 Most Cloud-Native Roles are Software Engineers
🚀 Start Sidecar First: How To Avoid Snags
📈 Enhancing Kubernetes Event Management with Custom Aggregation
⚡ Non-HA Kubernetes Gotchas: Downtime and Autoscaling Pitfalls with Single Replica Workloads
Read it now: https://kube.today/issues/152
⭐️ This newsletter is brought to you by AWS — Fully automate your Kubernetes clusters with Amazon EKS Auto Mode https://ku.bz/xZWD-2-Rk
Cluster Template is an opinionated and extensible template for deploying a Talos Kubernetes cluster, including Flux for GitOps.
More: https://ku.bz/29N8gDrqP
More: https://ku.bz/29N8gDrqP
This tutorial shows how to implement Crossplane on AWS EKS for infrastructure as code, covering setup, custom API design, and governance strategies.
More: https://ku.bz/6SCqq3slb
More: https://ku.bz/6SCqq3slb
HwameiStor is a cloud-native local storage system designed for stateful workloads, offering features like auto expansion, backup & restore, high availability, disk health management, and control/data plane separation.
More: https://ku.bz/ppcB_9NLk
More: https://ku.bz/ppcB_9NLk
Forwarded from KubeFM
This media is not supported in your browser
VIEW IN TELEGRAM
From hitting the "scaling wall" to achieving operational excellence—this is how two global enterprises transformed their Kubernetes operations.
In Episode 3 of The Making of Flux, our KubeFM original series, Philippe Ensarguet from Orange and Arnab Chatterjee from Nomura share their GitOps journey with Flux, from initial challenges to production victories at massive scale.
You will learn:
- How Orange uses Flux to manage bare-metal Kubernetes through its SYLVR project.
- Why Nomura relies on GitOps to balance agility with governance in financial services.
- How Flux helps enterprises achieve resilience, compliance, and repeatability at scale.
Watch (or listen to) it here: https://ku.bz/tWcHlJm7M
🌟 Join the Flux maintainers and community at FluxCon, November 11th in Salt Lake City— https://ku.bz/L843kg0CK
With @Birthmarkb
In Episode 3 of The Making of Flux, our KubeFM original series, Philippe Ensarguet from Orange and Arnab Chatterjee from Nomura share their GitOps journey with Flux, from initial challenges to production victories at massive scale.
You will learn:
- How Orange uses Flux to manage bare-metal Kubernetes through its SYLVR project.
- Why Nomura relies on GitOps to balance agility with governance in financial services.
- How Flux helps enterprises achieve resilience, compliance, and repeatability at scale.
Watch (or listen to) it here: https://ku.bz/tWcHlJm7M
🌟 Join the Flux maintainers and community at FluxCon, November 11th in Salt Lake City— https://ku.bz/L843kg0CK
With @Birthmarkb
Forwarded from Kube Careers
How much does a Kubernetes engineer earn in Q3 2025?
Is Platform Engineering really eating DevOps' lunch?
We analyzed 509 Kubernetes job denoscriptions and discovered:
💰 North American salaries average $177,983 (€92,113 in Europe)
🚀 Platform Engineer roles jumped to 9% of positions (vs 4-7% last year)
👨💻 43% of jobs are for Software Engineers, but DevOps roles offer the best remote flexibility (56%)
🏠 Remote work paradox: 67% allow remote, but only 0.29% are truly location-independent
Dive into the complete State of Kubernetes Job Market Q3 2025 report: https://kube.careers/state-of-kubernetes-jobs-2025-q3
⭐️ This report is brought to you by LearnKube — get started on your Kubernetes journey through comprehensive online, in-person, or remote training. https://learnkube.com/training
Is Platform Engineering really eating DevOps' lunch?
We analyzed 509 Kubernetes job denoscriptions and discovered:
💰 North American salaries average $177,983 (€92,113 in Europe)
🚀 Platform Engineer roles jumped to 9% of positions (vs 4-7% last year)
👨💻 43% of jobs are for Software Engineers, but DevOps roles offer the best remote flexibility (56%)
🏠 Remote work paradox: 67% allow remote, but only 0.29% are truly location-independent
Dive into the complete State of Kubernetes Job Market Q3 2025 report: https://kube.careers/state-of-kubernetes-jobs-2025-q3
⭐️ This report is brought to you by LearnKube — get started on your Kubernetes journey through comprehensive online, in-person, or remote training. https://learnkube.com/training
This article explains how to deploy and use Kube-State-Metrics to monitor Kubernetes object states via Prometheus for cluster observability.
More: https://ku.bz/3nNd4bDkK
More: https://ku.bz/3nNd4bDkK
Forwarded from KubeFM
Media is too big
VIEW IN TELEGRAM
Niels Claeys shares how his team built a data platform processing up to 1.5 million core hours monthly. He explains the specific optimizations they discovered through production experience, from scheduler changes to achieving 97% spot instance usage without reliability issues.
You will learn:
- How to achieve 97% spot instance adoption through strategic instance type diversification, region selection, and Spark-specific techniques
- Node pool design principles that balance Kubernetes overhead with workload efficiency
- Platform-specific gotchas like AWS cross-AZ data transfer costs that can spike bills unexpectedly
Watch (or listen to) it here: https://ku.bz/hGRfkzDJW
🌟 This episode is brought to you by Testkube—the ultimate Continuous Testing Platform for Cloud Native applications. Scale fast, test continuously, and ship confidently https://ku.bz/lnxYK3s0L
With @Birthmarkb "Almost 40" Farrell
You will learn:
- How to achieve 97% spot instance adoption through strategic instance type diversification, region selection, and Spark-specific techniques
- Node pool design principles that balance Kubernetes overhead with workload efficiency
- Platform-specific gotchas like AWS cross-AZ data transfer costs that can spike bills unexpectedly
Watch (or listen to) it here: https://ku.bz/hGRfkzDJW
🌟 This episode is brought to you by Testkube—the ultimate Continuous Testing Platform for Cloud Native applications. Scale fast, test continuously, and ship confidently https://ku.bz/lnxYK3s0L
With @Birthmarkb "Almost 40" Farrell
Project Quay runs as a service inside or outside Kubernetes, storing images in S3 or local storage.
It scans images for vulnerabilities with Clair, supports image signing, and enforces repository access and security policies via webhooks and RBAC.
More: https://ku.bz/mXXL2JPl4
It scans images for vulnerabilities with Clair, supports image signing, and enforces repository access and security policies via webhooks and RBAC.
More: https://ku.bz/mXXL2JPl4
Forwarded from LearnKube news
This week on Learn Kubernetes Weekly 153:
🌍 Why Environments Beat Clusters for Developer Experience
🧩 Image Compatibility in Cloud Native Environments
🔁 From Terraform to Crossplane: Real-World IaC in Kubernetes for AWS
📊 Why Kube-State-Metrics Matters for Kubernetes Observability
⚙️ Optimising Kubernetes Deployment with Local Continuous Development Tooling
Read it now: https://kube.today/issues/153
⭐️ This newsletter is brought to you by Testkube - your app is Kubernetes-native, your testing should be too. Run any kind of test automation with the help of the platform built for it https://ku.bz/Zfrty_fcC
🌍 Why Environments Beat Clusters for Developer Experience
🧩 Image Compatibility in Cloud Native Environments
🔁 From Terraform to Crossplane: Real-World IaC in Kubernetes for AWS
📊 Why Kube-State-Metrics Matters for Kubernetes Observability
⚙️ Optimising Kubernetes Deployment with Local Continuous Development Tooling
Read it now: https://kube.today/issues/153
⭐️ This newsletter is brought to you by Testkube - your app is Kubernetes-native, your testing should be too. Run any kind of test automation with the help of the platform built for it https://ku.bz/Zfrty_fcC
This case study shows how to implement a multi-cluster reconciler to manage Kubernetes resources across sharded clusters for fault tolerance.
It covers sharding stateless workloads across 3 clusters to limit the impact of infrastructure failures.
More: https://ku.bz/1HTWb0GLC
It covers sharding stateless workloads across 3 clusters to limit the impact of infrastructure failures.
More: https://ku.bz/1HTWb0GLC
This project allows you to deploy and manage Kubernetes clusters directly on Proxmox VE to create a private Kubernetes cloud platform.
More: https://ku.bz/3DC5Dtmzj
More: https://ku.bz/3DC5Dtmzj
This tutorial explains how to expose Kubernetes services without relying on cloud LoadBalancer support, using MetalLB + NGINX Ingress to provide stable IPs and path-based routing on bare-metal/air-gapped clusters.
More: https://ku.bz/CDWB9HJg7
More: https://ku.bz/CDWB9HJg7
Forwarded from Kubesploit
This media is not supported in your browser
VIEW IN TELEGRAM
cnquery is a command-line tool that lets you inspect and query your cloud, Kubernetes, and servers from one place.
More: https://ku.bz/Jml2KcQ-N
More: https://ku.bz/Jml2KcQ-N
Forwarded from KubeFM
Media is too big
VIEW IN TELEGRAM
🎥 The Making of Flux finale: From GitOps tool to platform backbone
Episode 4 brings together the platform builders—GitLab, Microsoft, and Mirantis—who are embedding Flux at the heart of their enterprise offerings.
Bryan Ross (GitLab), Jane Yan (Microsoft), Sean O'Meara, and William Rizzo (Mirantis) reveal how GitOps has evolved from experiment to essential infrastructure.
Key insights:
- Why Microsoft chose Flux for Azure Arc's managed GitOps service
- How GitLab bridges the CI/CD to infrastructure gap with Flux
- Mirantis's vision for multi-cluster platform engineering with Cordant
Plus: Bryan's take on how AI will transform GitOps workflows (spoiler: less YAML, more architecture thinking).
Watch the series finale: https://ku.bz/tVqKwNYQH
🌟 Join the Flux maintainers and community at FluxCon, November 11th in Atlanta—register here
With @Birthmarkb
Episode 4 brings together the platform builders—GitLab, Microsoft, and Mirantis—who are embedding Flux at the heart of their enterprise offerings.
Bryan Ross (GitLab), Jane Yan (Microsoft), Sean O'Meara, and William Rizzo (Mirantis) reveal how GitOps has evolved from experiment to essential infrastructure.
Key insights:
- Why Microsoft chose Flux for Azure Arc's managed GitOps service
- How GitLab bridges the CI/CD to infrastructure gap with Flux
- Mirantis's vision for multi-cluster platform engineering with Cordant
Plus: Bryan's take on how AI will transform GitOps workflows (spoiler: less YAML, more architecture thinking).
Watch the series finale: https://ku.bz/tVqKwNYQH
🌟 Join the Flux maintainers and community at FluxCon, November 11th in Atlanta—register here
With @Birthmarkb
Kraken is a P2P-powered Docker registry that focuses on scalability and availability.
It is designed for Docker image management, replication, and distribution in a hybrid cloud environment.
More: https://ku.bz/Hvt7Zs8wg
It is designed for Docker image management, replication, and distribution in a hybrid cloud environment.
More: https://ku.bz/Hvt7Zs8wg
Forwarded from KubeFM
Media is too big
VIEW IN TELEGRAM
Mai Nishitani, Director of Enterprise Architecture at NTT Data and AWS Community Builder, demonstrates how Model Context Protocol (MCP) enables Claude to directly interact with Kubernetes clusters through natural language commands.
You will learn:
- How MCP servers work and why they're significant for standardizing AI integration with DevOps tools, moving beyond custom integrations to a universal protocol
- The practical capabilities and critical limitations of AI in Kubernetes operations
- Why fundamental troubleshooting skills matter more than ever as AI abstractions can fail in unexpected ways
Watch (or listen to) it here: https://ku.bz/3hWvQjXxp
🌟 This episode is brought to you by Testkube—the ultimate Continuous Testing Platform for Cloud Native applications. Scale fast, test continuously, and ship confidently https://ku.bz/lnxYK3s0L
With @Birthmarkb "Hip hop back up dancer" Farrell
You will learn:
- How MCP servers work and why they're significant for standardizing AI integration with DevOps tools, moving beyond custom integrations to a universal protocol
- The practical capabilities and critical limitations of AI in Kubernetes operations
- Why fundamental troubleshooting skills matter more than ever as AI abstractions can fail in unexpected ways
Watch (or listen to) it here: https://ku.bz/3hWvQjXxp
🌟 This episode is brought to you by Testkube—the ultimate Continuous Testing Platform for Cloud Native applications. Scale fast, test continuously, and ship confidently https://ku.bz/lnxYK3s0L
With @Birthmarkb "Hip hop back up dancer" Farrell
This article shows how to mimic cloud load balancer behavior in bare-metal Kubernetes using Layer 2 ARP/NDP or BGP routing (via MetalLB or Cilium) to expose
More: https://ku.bz/D9BWpg4Sq
LoadBalancer services.More: https://ku.bz/D9BWpg4Sq
Forwarded from LearnKube news
This week on Learn Kubernetes Weekly 154:
🧩 Kubernetes Observability: Troubleshooting Packet Drops
⚙️ We Broke Our EKS Cluster Autoscaler and Fixed It
🌐 Managing Kubernetes Resources Across Multiple Clusters
🐝 From kube-proxy to eBPF (Cilium)
🚧 Diagnosing API Server Communication Issues
Read it now: https://kube.today/issues/154
⭐️ This newsletter is brought to you by Heroku — Discover the thriving ecosystem of contributors, companies, and career paths in the Kubernetes World book. Reserve your copy now https://ku.bz/B0nqF7jBW
🧩 Kubernetes Observability: Troubleshooting Packet Drops
⚙️ We Broke Our EKS Cluster Autoscaler and Fixed It
🌐 Managing Kubernetes Resources Across Multiple Clusters
🐝 From kube-proxy to eBPF (Cilium)
🚧 Diagnosing API Server Communication Issues
Read it now: https://kube.today/issues/154
⭐️ This newsletter is brought to you by Heroku — Discover the thriving ecosystem of contributors, companies, and career paths in the Kubernetes World book. Reserve your copy now https://ku.bz/B0nqF7jBW