Reddit DevOps – Telegram
How's Debian for enterprise workflows in the cloud?

I’ve been curious about how people approach Debian in enterprise or team setups, especially when running it on cloud platforms like AWS, Azure, or GCP.

For those who’ve tried Debian in cloud environments:

Do you find a desktop interface actually useful for productivity or do you prefer going full CLI?

Any must-have tools you pre-install for dev or IT workflows?

How does Debian compare to Ubuntu, AlmaLinux or others in terms of stability and updates for enterprise workloads?

Do you run it as a daily driver in the cloud or more for testing and prototyping?

Would love to hear about real experiences, what worked, what didn’t, and any tips or gotchas for others considering Debian in enterprise cloud ops.

https://redd.it/1nowuvi
@r_devops
Looking for some advice on a deployment as a Jr

Hey folks,

I’m a software dev by trade, not a DevOps engineer, but I’ve landed in the deep end. My company is tiny staff-wise (it’s just me and one other guy), but we run a huge infrastructure — we’re basically our own ISP.

I’ve been tasked with rolling out a network monitoring system (NMS) for everything, and it needs to be highly available. After a lot of research, here’s the plan I came up with:

• Infra: vSphere / VMware, spread across 3 datacenters (no cloud).

• Cluster: Kubernetes with Talos, 5 control planes (2-2-1 across the DCs for quorum).

• CNI: Cilium.
• CSI: Mayastor.
• Monitoring: Zabbix via Helm chart.

I’ve spent hundreds of hours digging into this (Kubernetes, HA design, storage, CNIs, etc.), and I’ve definitely learned a ton. But I’m still not sure if I’m on the right track:

• Will this actually work the way I think it will?
• Is this anywhere close to “best practice”?
• Or… did I just massively overengineer this when there might be a simpler HA setup?

Constraints:

• No cloud — fully self-hosted.
• Storage available: NFS / TrueNAS / ZFS.
• Needs to handle large-scale infra, but the ops team is literally 2 people.

Ask: If you’ve deployed HA Zabbix (or any big NMS) — does this setup make sense? Should I stick with the K8s + Talos route, or would you recommend something more straightforward?

Any advice, feedback, or gotchas would mean a lot.

https://redd.it/1notg9a
@r_devops
Interview Test Prep suggestions for Oracle SRE-DevOps position?

I have a technical interview scheduled for a DevOps position at Oracle (the new health division) and there will be a noscripting test as part of it. It could either be Python or PowerShell, I'll probably do Python since I've worked with it more than PowerShell recently. I'd rank myself as intermediate with Python... I can get the job done but don't have much memorized. I didn't get to use Python in my last DevOps position because so I'm not even familiar with what people build in it.

Any suggestions on prepping? The phone screen interviewer didn't provide any direction to narrow it down from "Python" and I'm wondering what to expect or what will likely be in the test. She said they use Hackerrank and I got on there and started going through challenges but I can't imagine a lot of what I've done so far is what's going to be expected. I also have 3 or 4 different languages rolling around in my head and I know I'll get tripped up on syntax.

Any help is appreciated!

https://redd.it/1np0hdl
@r_devops
New hires, what helped you land the job??

4 years DevOps and overall 10 years IT experience. I’ve been looking since January (remove & even Raleigh, NC). Thousands of applications and the only 10 interviews I’ve gotten, I’ve been passed by other candidates and unsure why.

I’ve tried the LinkedIn Ai to tweak my resume, jobhire.ai to mass apply, endless resume ATS checkers, I’m honestly too burnt out to keep applying. Even putting freelance work on my resume

Has anything specific worked for yall? Any new tech I should be specifically looking at like azure, kubernetes, or terraform?

https://redd.it/1np08hp
@r_devops
Need guidance for Platform Engineer interview prep (Istio, K8s, AWS, Terraform, CI/CD)

Hi everyone,
I’ve got a technical interview coming up for a Platform role at a foreign MNC (payment domain). The JD mentions 3–5 years of experience, but I’ve only got about 2 years. Somehow my resume matched and I got the call.

The role mainly requires Istio, Kubernetes, AWS, Terraform, and CI/CD. I’ve worked with these technologies before, but I don’t feel super confident about how deep I should go or what to focus on for interview prep. I worked in startup so I kept hands on all most all the tools they required but I am afraid what if loose this opportunity, I am being preparing since last 2-3 days with some chatgpt mock interview and practicing python noscripting.

The interviewer will be from Brazil (I’m based in India), and I’m not sure what kind of questions to expect.

Can anyone suggest how I should prepare, especially for interviews at this level? Maybe some resources, topics to prioritize, or typical questions asked in such roles?

Thank you in advance

https://redd.it/1np26t4
@r_devops
Path to AWS devOps for very beginner

Hi everyone, I’m 30 and lately I’ve been thinking about learning AWS to land a job in 2026. Back in my 20s I went to IT school, so I’m somewhat familiar with technologies, but I haven’t really done anything hands-on in a long time since I was focused on other things.

I’d love your honest opinion — is it too late for me to start now?

Also, if anyone can recommend some good beginner-friendly courses, I’d really appreciate it


https://redd.it/1np3ubo
@r_devops
[FREE] AI-Powered Veo 3 Script Writer – Looking for Beta Testers! 🎬🤖

Hey r/devops 👋

I’ve built a free web tool called Veo 3 Script Writer that helps creators turn plain text into production-ready Veo 3 video noscripts.
It’s live now and I’d love some early feedback from the Reddit community.

https://preview.redd.it/fyhb1j8cr1rf1.png?width=1276&format=png&auto=webp&s=b951c7cf4046ef8c35b4ad8432acb1806b2f2b6a

https://preview.redd.it/p71p6dwgr1rf1.png?width=1546&format=png&auto=webp&s=154ad9f2a696036dfcbbbfd1645541979a112655

# What it Does

* Intelligent dialogue detection – automatically finds every line of spoken text.
* Visual prompt generation – creates scene cards and cinematic prompts ready for Veo 3.
* 95-character dialogue limit – auto-splits long lines so they’re Veo-friendly.
* Character & environment settings – keep characters and scenes visually consistent.

# 🛠 How to Use

1. Paste any noscript with dialogue.
2. Click “Generate Script.”
3. Get a full Veo 3-optimized noscript with scene prompts and dialogues you can copy or download.

# Why Test It?

I’m looking for real-world feedback from video creators:

* Does the dialogue detection work for your noscripts?
* Are the generated scene prompts clear enough?
* Any features you’d love to see added?

It’s 100% free to try—no signup needed.

👉 Give it a spin here: [https://www.avioncitojuego.com/](https://www.avioncitojuego.com/)

Thanks in advance for any thoughts, bug reports, or feature ideas! Your input will help make this a go-to noscript generator for Veo 3 and other AI video platforms.

— RAOGY

https://redd.it/1np3sto
@r_devops
Engineering intelligence - worth the hype?

So I keep hearing about these platforms that say they can tell you how your team is performing without asking you to track everything manually.

Cool in theory, but does anyone actually use them day-to-day? Or is it just another dashboard graveyard?

https://redd.it/1np5gbd
@r_devops
Best Path Forward?

Hey everyone,

I’m trying to figure out the best way to connect with an existing firm or shop that might need extra hands when they’ve got more work than they can handle. My background is pretty deep in Linux, with solid experience in AWS and GCP. I’m US-based and comfortable jumping into contract roles if it helps take some of the load off.

Has anyone here gone this route before? How did you find firms willing to subcontract out work? Any tips on where to start looking or how to approach the conversation would be appreciated.

https://redd.it/1np6lbh
@r_devops
Want to stand out in tech? Master the stuff most people ignore....

When I first started in tech, I thought the people who stood out had 10+ years of experience.
But over time, I noticed something different: the people who grow the fastest aren’t the ones who know every new tool they’re the ones who never skipped the fundamentals.

The truth is, most beginners rush past the basics. They chase frameworks, languages, and “hot skills,” but can’t explain how files move, how code is tracked, or how networks actually work. That gap shows up quickly in real projects and interviews.

If you want to level up your career faster, focus here first:

* Command Line Basics → navigating, managing files, running noscripts. It makes you way faster than click-hunting through GUIs.
* Git & Version Control → not optional. Every serious project lives on GitHub. Your repos are proof you can build.
* Networking 101 → IPs, DNS, ports, firewalls. Whether it’s AWS, Python, or DevOps, everything depends on it.
* Databases → CRUD, joins, indexes. Even a little SQL knowledge puts you ahead of “tutorial coders.”
* APIs → apps talk to each other through APIs. Learn how to send/receive data. It unlocks everything from web apps to automation.
* Cloud Essentials → EC2, S3, IAM, VPC. Even beginner-level cloud knowledge gives you an edge.
* Problem-Solving Mindset → syntax is easy. What makes you valuable is breaking down problems and figuring things out.

Frameworks and tools will keep changing. But fundamentals? They compound forever.

Curious which of these you’ve been focusing on lately?

https://redd.it/1np95k2
@r_devops
Keeping SPF record under the ten lookup limit

How do you keep your SPF record under the ten lookup limit when you add new vendors ?

https://redd.it/1np9ce5
@r_devops
Backstage Scaffolder

Hey everyone,

I'm working with Scaffolder templates and specifically trying to streamline the experience for creating new repositories (e.g., in GitLab).

The Challenge: The RepoUrlPicker field is fantastic for importing existing repositories, as it allows users to pick from a list of what's already there. However, for templates that are solely designed to create a brand new repository, this feature becomes problematic:

1. User Confusion: Users might accidentally select an existing repo, leading to template execution failures (as the publish action tries to create something that already exists).
2. Unnecessary UI: The dropdown for existing repos just adds visual clutter when the template's purpose is clear: "create something new."

What I'd ideally like:

Option 1: A `RepoUrlPicker` with an option to hide existing repos. Something like `ui:options: { showExistingRepos: false }`.
Option 2: A separate, simplified "RepoGroupPicker" or similar. This would only allow selecting a group/namespace (like platform/my-team for GitLab) and then combine that with a simple text input for the new repository name. This would be combined with a simple string parameter for the new repo name in template.yaml.

The current alternative involves either using a static enum (which is not scalable) or writing a custom frontend field extension to strip out the unwanted functionality (which feels like a lot of work for a common use case).

Has anyone else felt this pain point or found a neat workaround? Is this something that could be considered for a future enhancement to the RepoUrlPicker or Scaffolder fields in general?

Any thoughts or experiences are highly appreciated!

Thanks!

https://redd.it/1npbgwj
@r_devops
IT or Computer Science

I'm 16 year old with skills of: Linux, Bash, Git, GitHub, Networking, AWS, Terraform, Ansible, Docker, and now learning Kubernetes.

I also have certs of AWS CCP and AWS SAA.

My goal is to become DevOps & Cloud. Based on me, which would u recommend, IT or Computer Science?

https://redd.it/1npd0s3
@r_devops
Hetzner doesn't offer Managed databases (PostgreSQL) on CCX23. What Can I do?

Hello everyone, I'm sorry I'm not very familiar with DevOps, so excuse me if I don't know what I'm talking about.

I need to host a Laravel app, with a PostgreSQL database, Redis, and Grafana for monitoring.

So far, I've come to understand that my low-cost robust options are limited (max 25$ per month), and it seems that if I want a good performance for my application with a low response time, I should go with CCX23 (dedicated CPU).

My understanding is that I can allocate 10-12 GB of RAM for the app, and the rest for Grafana and Redis.

But Hetzner doesn't offer managed databases with the Hetzner Cloud VPS.

Are there any better options to host this App, and its database effectively in order to avoid any resource-related issues in the first year of the application (first year most likely ending in 500 users at an RPS of 200, 70% of which are reads).

I will be implementing caching and many other strategies with OPcache, Gzip... but I just want to host this application effectively for now.

https://redd.it/1npeylo
@r_devops
The $7 Trillion Delusion: Was Sam Altman the First Real Case of ChatGPT Psychosis?

SS: Super interesting and semi-satirical article that just popped up in my feed, makes me wonder what happend to this entire 7 trillion ordeal. I think its very very relevant to ask and understand how the people in charge interact with AI. The article touches on many current issues surrounding the psychological and by extension societal impact of AI, and I think it has multiple points that will spark an interesting discussion. The article brings a new angle to this topic and connects some very interesting dots about the AI bubble and how AI delusions might be affecting decisions. https://medium.com/@adan.nygaard/the-7-trillion-delusion-was-sam-altman-the-first-real-case-of-chatgpt-psychosis-949b6d89ec55

https://redd.it/1npha6u
@r_devops
Good DevOps projects for practice?

So I'm looking for any open source DevOps project that is fully functional but lacks all DevOps tools (pipelines, K8s files, docker files, ...). I want to use the given project as a way to demonstrate my knowledge of these tools by adding them to build the app further from CI to monitoring.

https://redd.it/1npktxx
@r_devops
SQL Indexing for Real-World Performance: What Every DevOps Engineer Should Know

As DevOps engineers, we often focus on CI/CD, automation, and infrastructure — but database performance can become a hidden bottleneck in production.

I recently made a beginner-friendly breakdown of SQL indexing that keeps it simple, visual, and practical:

Heap tables – what happens when no clustered index exists

Clustered indexes – how data is physically ordered and retrieved

Non-clustered indexes – when to use them and how they reference the table

Stored Procedure Lookups – real performance examples that show why indexing matters in production


👉 The goal: make indexing easy to understand for people who don’t live inside SQL every day, but still need to keep systems running fast and reliable.

Video link here: https://youtu.be/cDiCp64V-uQ?si=qCKHn0hyGd_ID5MM

Would love to hear how you approach database optimization in your DevOps workflow (monitoring, tuning, automation, etc.)

https://redd.it/1npnqut
@r_devops
Struggling with API misconfigurations in prod

Our team keeps running into API problems once code hits production; things like:

* Incorrect auth settings
* Debug endpoints left open
* Tokens that don’t get updated

A lot of it seems to come from drift between what’s in code and what’s actually deployed.

We’ve tried linting rules and CI/CD checks, but some issues still slip through.

For those of you managing complex stacks, what approaches have helped you catch or prevent API misconfigurations without slowing down your release cycle?

https://redd.it/1npsm3j
@r_devops
Dockerhub is down

TL;DR:
Docker Hub is partially down (mainly auth + registry + web). They know the issue and are working on it. Just hang tight, avoid huge CI bursts, and retry periodically.

What’s happening:

The trouble affects Docker Hub Registry, authentication, and web services.

Their status page says they’ve identified the cause and are working on a fix. 

What you can try now:

Retry operations (pulls/login) periodically- sometimes the errors are intermittent.

Use image digests instead of tags (some users report those are still working).

If you have a mirror or pull-through cache (e.g. via a private registry), consider using it.

Monitor Docker’s status page closely for updates.

P.S If you found this update helpful, consider checking out The Dev Dex.

https://redd.it/1nptobe
@r_devops
Should backend-to-database connections use SSL if proxy already has SSL?

If my backend is running behind a reverse proxy (e.g., Traefik/Nginx) that already has SSL/TLS enabled for client traffic, do I still need to enable SSL/TLS on the database connection between the backend and the database server considering when in Docker-compose or K8s the database is running on internal network therefore not exposed to the outside traffic?

https://redd.it/1nq0ixi
@r_devops