NEW BOT Телеграм, страница

DevOps & SRE notes

Managing Prometheus alerts in Kubernetes at scale can be challenging, but using GitOps practices can streamline this process. This article from Faun explores how to manage Prometheus alerts in large-scale Kubernetes environments using a GitOps approach. Learn how to automate, version control, and maintain consistency in your alerting configurations, ensuring efficient and reliable monitoring across your clusters.

https://faun.pub/managing-prometheus-alerts-in-kubernetes-at-scale-using-gitops-25d0ab4a2e2d

Medium

Managing Prometheus alerts in Kubernetes at scale using GitOps

Prometheus is a popular open-source monitoring and alerting solution. It is widely used in the Kubernetes ecosystem and is a part of the…

1.6K viewstutunak, edited 12:59

DevOps & SRE notes

Managing authentication across multiple AWS accounts in Terraform can be complex, but understanding the right techniques is crucial for secure and efficient operations. This article by Hector Reyes Alemán provides a comprehensive guide on using the Terraform AWS provider for multi-account authentication. Learn about the best practices, tools, and configurations needed to manage authentication seamlessly across different AWS accounts in your Terraform projects.

https://hector-reyesaleman.medium.com/terraform-aws-provider-everything-you-need-to-know-about-multi-account-authentication-and-f2343a4afd4b

Medium

Terraform AWS Provider — Everything you need to know about Multi-Account Authentication and…

There are multiple options available to configure the authentication between Terraform and AWS. But the two entry points are IAM users and…

👍2

1.61K viewstutunak, edited 05:59

DevOps & SRE notes

Move Kubernetes PVCs between Storage Classes and Namespaces

https://github.com/BeryJu/korb

GitHub

GitHub - BeryJu/korb: Move Kubernetes PVCs between Storage Classes and Namespaces

Move Kubernetes PVCs between Storage Classes and Namespaces - BeryJu/korb

1.64K viewstutunak, edited 12:59

DevOps & SRE notes

Run Kubernetes on MySQL, Postgres, sqlite, dqlite, not etcd.

https://github.com/k3s-io/kine

GitHub

GitHub - k3s-io/kine: Run Kubernetes on MySQL, Postgres, sqlite, not etcd.

Run Kubernetes on MySQL, Postgres, sqlite, not etcd. - k3s-io/kine

1.59K viewstutunak, edited 05:59

DevOps & SRE notes

Effective alerting is a cornerstone of observability, but it requires careful planning and execution. This article by Let Athena Sleep discusses the dos and don'ts of creating an effective alerting strategy. Learn about the best practices for setting up alerts that are actionable, minimize noise, and enhance your overall observability, ensuring that you stay informed and responsive to critical issues.

https://medium.com/@letathenasleep/alerting-the-dos-and-don-ts-for-effective-observability-139db9fb49d1

Medium

Sofia’s Observability Odyssey: The Do’s and Don’ts for Effective Observability

Before we begin, meet Sofia:

👍1

1.64K viewstutunak, edited 12:59

DevOps & SRE notes

In this thought-provoking article, Justin Garrison discusses AWS services that he believes should be canceled due to redundancy, lack of relevance, or better alternatives. He explores why some services no longer serve their original purpose or have been surpassed by more efficient solutions, offering insights into the ever-evolving cloud landscape. Learn about the importance of simplifying service offerings to enhance efficiency and focus on better tools.

https://justingarrison.com/blog/2024-08-05-more-aws-services-they-should-cancel/

Justin Garrison

21 More AWS Services They Should Cancel

Please Amazon 🙏 kill these services too.

1👍3

1.56K viewstutunak, edited 08:00

DevOps & SRE notes

Full feature EKS cluster with Terragrunt/Terraform

https://github.com/particuleio/teks

GitHub

GitHub - particuleio/teks: Full feature EKS cluster with Terragrunt/Terraform

Full feature EKS cluster with Terragrunt/Terraform - particuleio/teks

1.59K viewstutunak, edited 14:59

DevOps & SRE notes

A sidecar app which clones a git repo and keeps it in sync with the upstream.

https://github.com/kubernetes/git-sync

GitHub

GitHub - kubernetes/git-sync: A sidecar app which clones a git repo and keeps it in sync with the upstream.

A sidecar app which clones a git repo and keeps it in sync with the upstream. - kubernetes/git-sync

1.65K viewstutunak, edited 16:00

DevOps & SRE notes

🚀 Effortlessly create and deploy your own one-link website on GitHub. A free Linktree alternative.

https://github.com/thevahidal/jake

GitHub

GitHub - thevahidal/jake: 🚀 Effortlessly create and deploy your own one-link website on GitHub. A free Linktree alternative.

🚀 Effortlessly create and deploy your own one-link website on GitHub. A free Linktree alternative. - thevahidal/jake

👍1

1.58K viewstutunak, edited 07:59

DevOps & SRE notes

Testing Terraform resources is essential for ensuring that your infrastructure as code is reliable and functions as expected. This article from Better Programming introduces the basics of Terraform resource testing, covering the tools, frameworks, and best practices to validate your Terraform configurations. Learn how to implement effective testing strategies to catch errors early and maintain high-quality infrastructure code.

https://betterprogramming.pub/terraform-resource-testing-101-c9da424faaf3

Medium

Terraform Resource Testing 101

Validate code quality with a Terraform plan using JSON and Pytest

👍2

1.73K viewstutunak, edited 13:01

DevOps & SRE notes

P2P Docker registry capable of distributing TBs of data in seconds

https://github.com/uber/kraken

GitHub

GitHub - uber/kraken: P2P Docker registry capable of distributing TBs of data in seconds

P2P Docker registry capable of distributing TBs of data in seconds - uber/kraken

👍1

1.85K viewstutunak, edited 16:01

DevOps & SRE notes

Managing infrastructure as code with Terraform provides significant benefits, especially at scale. This article from Cloudflare details how they use Terraform to manage and automate their infrastructure. Discover the best practices, challenges, and strategies Cloudflare employs to optimize their Terraform workflows and achieve seamless, scalable infrastructure management.

https://blog.cloudflare.com/terraforming-cloudflare-at-cloudflare/

The Cloudflare Blog

How Cloudflare uses Terraform to manage Cloudflare

Cloudflare uses the Cloudflare Terraform provider extensively to make changes to our internal accounts as easy as opening a pull request.

👍1

1.64K viewstutunak, edited 08:59

DevOps & SRE notes

Tool to evaluate upgrade confidence for Kubernetes cluster addons

https://github.com/FairwindsOps/gonogo

GitHub

GitHub - FairwindsOps/gonogo: [alpha] Tool to evaluate upgrade confidence for Kubernetes cluster addons

[alpha] Tool to evaluate upgrade confidence for Kubernetes cluster addons - FairwindsOps/gonogo

1.61K viewstutunak, edited 13:01

DevOps & SRE notes

Calico and Kubernetes work together to provide a powerful solution for implementing robust network policies in cloud-native environments. This article from Faun discusses how Calico enhances Kubernetes' native networking capabilities, offering greater control, security, and flexibility. Learn how to configure and use Calico for managing network policies, improving both security and performance in your Kubernetes clusters.

https://faun.pub/calico-and-kubernetes-a-perfect-pair-for-robust-network-policy-2b91eb4eec44

Medium

Calico and Kubernetes: A Perfect Pair for Robust Network Policy

In K8s, a network policy which defines how pods are allowed to communicate with each other.

👍2❤1

1.78K viewstutunak, edited 16:00

DevOps & SRE notes

Preflight Checks and Support Bundles Framework for Kubernetes Applications

https://github.com/replicatedhq/troubleshoot

GitHub

GitHub - replicatedhq/troubleshoot: Preflight Checks and Support Bundles Framework for Kubernetes Applications

Preflight Checks and Support Bundles Framework for Kubernetes Applications - replicatedhq/troubleshoot

1👍3

1.68K viewstutunak, edited 09:01

DevOps & SRE notes

Understanding the Kubernetes kubeconfig file is crucial for managing access to your Kubernetes clusters. This article from DevOpsCube provides a comprehensive guide on the structure, usage, and best practices of the kubeconfig file. Learn how to configure, manage, and secure your kubeconfig to ensure efficient and secure interactions with your Kubernetes clusters.

https://devopscube.com/kubernetes-kubeconfig-file/

DevOpsCube – Easy DevOps, SRE Guides & Reviews

Kubeconfig File Explained (Examples, Usage and Configuration)

In this blog, you’ll learn what a Kubeconfig file is and how to create and use one to connect to a Kubernetes cluster with hands-on examples.

❤2

1.76K viewstutunak, edited 16:00

DevOps & SRE notes

Managing Grafana as code allows for consistent, version-controlled monitoring setups across environments. This comprehensive guide from Grafana covers tools, tips, and best practices for managing Grafana dashboards and configurations as code. Learn how to automate and streamline your Grafana deployments using various tools to enhance observability and maintain monitoring consistency.

https://grafana.com/blog/2022/12/06/a-complete-guide-to-managing-grafana-as-code-tools-tips-and-tricks/

Grafana Labs

Grafana as code: A complete guide to tools, tips, and tricks

Learn about some as code tools you can use to declaratively manage Grafana resources and find out how to incorporate them into your own use cases.

1.77K viewstutunak, edited 09:01

DevOps & SRE notes

K8s controller to manage the aws-auth configmap

https://github.com/maruina/aws-auth-manager

GitHub

GitHub - maruina/aws-auth-manager: K8s controller to manage the aws-auth configmap

K8s controller to manage the aws-auth configmap. Contribute to maruina/aws-auth-manager development by creating an account on GitHub.

1👍3

1.69K viewstutunak, edited 16:01

DevOps & SRE notes

A cat(1) clone with wings.

https://github.com/sharkdp/bat

GitHub

GitHub - sharkdp/bat: A cat(1) clone with wings.

A cat(1) clone with wings. Contribute to sharkdp/bat development by creating an account on GitHub.

❤3👍1

1.63K viewstutunak, edited 09:02

DevOps & SRE notes

Restricting cluster admin permissions in Kubernetes is essential for maintaining security and preventing unauthorized access. This article by Marcus Noble provides insights into best practices for limiting cluster admin privileges. Learn how to effectively manage roles and permissions to enhance the security of your Kubernetes environment and protect your infrastructure from potential threats.

https://marcusnoble.co.uk/2022-01-20-restricting-cluster-admin-permissions/

Marcus Noble

Restricting cluster-admin Permissions

Generally, and by default, operators of the cluster are assigned to the cluster-admin ClusterRole. This gives the user access and permission to do all operations on all resources in the cluster. There's very good reason for this, an admin generally needs…

👍2👎1

1.61K viewstutunak, edited 16:01

DevOps & SRE notes

Scaling Site Reliability Engineering (SRE) teams is crucial for maintaining high availability and performance as organizations grow. This article from DZone explores strategies for expanding SRE teams, including building scalable processes, leveraging automation, and fostering a culture of collaboration. Learn how to effectively scale your SRE practices to support the evolving needs of your organization.

https://dzone.com/articles/scaling-sre-teams

DZone

Scaling SRE Teams

Scaling teams of site reliability engineers comes with many challenges. Here, explore the challenges of scaling and review a successful scaling framework.

👍1

1.68K viewstutunak, edited 09:01

About

Blog

Apps

Platform