DevOps & SRE notes – Telegram
DevOps & SRE notes
12K subscribers
41 photos
19 files
2.5K links
Helpful articles and tools for DevOps&SRE

WhatsApp: https://whatsapp.com/channel/0029Vb79nmmHVvTUnc4tfp2F

For paid consultation (RU/EN), contact: @tutunak


All ways to support https://telegra.ph/How-support-the-channel-02-19
Download Telegram
The post emphasizes that while Kubernetes is a leading container management system, it’s not exempt from malicious attacks. Therefore, understanding Kubernetes’ environment and the different vulnerabilities one can be exposed to while creating, deploying, or running applications in clusters is crucial.
https://goteleport.com/blog/how-to-hack-kubernetes
Provides a comprehensive guide on how to recreate the Kubernetes RBAC authorization model from scratch, and practice the relationships between Roles, ClusterRoles, ServiceAccounts, RoleBindings, and ClusterRoleBindings.

https://learnk8s.io/rbac-kubernetes
The blog post discusses the complexities of achieving compliance in a dynamic, ephemeral environment such as Kubernetes, and offers insights and guidance on maintaining a secure and compliant cloud environment².


https://www.armosec.io/blog/kubernetes-compliance-challenges
The article provides a detailed account of an outage experienced by Cloudflare on November 2, 2023, highlighting the causes and resolution. It discusses the unintended power failure at a data center, its impact on Cloudflare's control plane and analytics systems, and the measures taken to restore services and prevent such incidents in the future
https://blog.cloudflare.com/post-mortem-on-cloudflare-control-plane-and-analytics-outage/
The author provides insights into automating Helm dependency updates, minimizing version gaps, and simplifying updates with Helm1. The author shares a nifty Bash noscript that operates in tandem with the artifacthub.io API to identify and update Helm dependencies whenever changes are detected1.
https://blog.devops.dev/charting-the-course-helm-dependencies-updates-made-easy-%EF%B8%8F-48656bfc59c
👍1
The comprehensive guide to PagerDuty’s role in SRE practices, including setting up schedules, integrating with existing tools, automating routine tasks, and managing incidents1. The guide is a must-read for anyone starting their journey in SRE or looking to sharpen their existing skills1.

https://blog.devgenius.io/pagerduty-101-the-ultimate-guide-for-first-time-site-reliability-engineers-c8864dceebf0
The article provides insights into maintaining highly available applications in a Kubernetes cluster, including dealing with different types of disruptions, such as involuntary and voluntary disruptions, and using Pod Disruption Budget (PDB) to make applications always available.

https://dev.to/oshi36/pod-disruption-budget-in-kubernetes-6kg