I build a public Chatbox, do you want to try it ?
It’s still a wip, but it’s functional
You can just type messages and its in real time
There is nobody in the chat, its normal :D
for the moment, its on my pc but i think i'll put the server on a raspberry pi OR an old smartphone for the experience :) i think its very lowcost energy for the smartphone concept server
If you want to try it, it will be extremely cool :) !!
>there is no save chat historic, and its anonym
just a newID is random generate to write when you open the browser
I just create a light anti-spam
Thanks a lot in advance to anyone who try
Sometimes the simplest tests make the biggest difference
Its very experimental, if you have any ideas for the way this app can take ? :D
>The online version is on ***https://chat.glhf.be***
i have created a windows app on dotnet, but its the same on the webpage :)
Tell me what do you think !
Cheeeeers
Joseph
https://redd.it/1pbb38h
@r_devops
It’s still a wip, but it’s functional
You can just type messages and its in real time
There is nobody in the chat, its normal :D
for the moment, its on my pc but i think i'll put the server on a raspberry pi OR an old smartphone for the experience :) i think its very lowcost energy for the smartphone concept server
If you want to try it, it will be extremely cool :) !!
>there is no save chat historic, and its anonym
just a newID is random generate to write when you open the browser
I just create a light anti-spam
Thanks a lot in advance to anyone who try
Sometimes the simplest tests make the biggest difference
Its very experimental, if you have any ideas for the way this app can take ? :D
>The online version is on ***https://chat.glhf.be***
i have created a windows app on dotnet, but its the same on the webpage :)
Tell me what do you think !
Cheeeeers
Joseph
https://redd.it/1pbb38h
@r_devops
AI intelligence debug tool. Need feedback.
I built a small tool to help debug test failures automatically. It pulls in your test runs and uses AI to surface flaky tests, failure clusters, weekly dashboard and stability trends and AI powered run summaries.
If a few people can try it and tell me what sucks before I launch officially, I'd really appreciate it.
https://redd.it/1pbc7x9
@r_devops
I built a small tool to help debug test failures automatically. It pulls in your test runs and uses AI to surface flaky tests, failure clusters, weekly dashboard and stability trends and AI powered run summaries.
If a few people can try it and tell me what sucks before I launch officially, I'd really appreciate it.
https://redd.it/1pbc7x9
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
$10K logging bill from one line of code - rant about why we only find these logs when it's too late (and what we did about it)
This is more of a rant than a product announcement, but there's a small open source tool at the end because we got tired of repeating this cycle.
Every few months we have the same ritual:
\- Management looks at the cost
\- Someone asks "why are logs so expensive?"
\- Platform scrambles to:
\- tweak retention and tiers
\- turn on sampling / drop filters
And every time, the core problem is the same:
\- We only notice logging explosions after the bill shows up
\- Our tooling shows cost by index / log group / namespace, not by lines of code
\- So we end up sending vague messages like "please log less" that don't actually tell any team what to change
In one case, when we finally dug into it properly, we realised:
\- The majority of the extra cost came from one or two log statements:
\- debug logs in hot paths
\- usage for that service gradually increased (so there were no spikes in usage)
\- verbose HTTP tracing we accidentally shipped into prod
\- payload dumps in loops
What we wanted was something that could say:
src/memoryutils.py:338 Processing step: %s
315 GB | $157.50 | 1.2M calls
i.e. "this exact line of code is burning $X/month", not just "this log index is expensive."
Because the current flow is:
\- DevOps/Platform owns the bill
\- Dev teams own the code
\- But neither side has a simple, continuous way to connect "this monthly cost" → "these specific lines"
At best someone does grepping through the logs (on DevOps side) and Dev team might look at that later if chased.
———
We ended up building a tiny Python library for our own services that:
\- wraps the standard logging module and print
\- records stats per file:line:level – counts and total bytes
\- does not store any raw log payloads (just aggregations)
Then we can run a service under normal load and get a report like (also, get Slack notifications):
Provider: GCP Currency: USD
Total bytes: 900,000,000,000 Estimated cost: 450.00 USD
Top 5 cost drivers:
\- src/memory\utils.py:338 Processing step: %s... 157.5000 USD
...
The interesting part for us wasn't "save money" in the abstract, it was:
\- Stop sending generic "log less" emails
\- Start sending very specific messages to teams:
"These 3 lines in your service are responsible for \~40% of the logging cost. If you change or sample them, you’ll fix most of the problem for this app."
\- It also fixes the classic DevOps problem of "I have no idea whether this log is important or not":
Platform can show cost and frequency,
Teams who own the code decide which logs are worth paying for.
It also runs continuously, so we don’t only discover the problem once the monthly bill arrives.
———
If anyone's curious, the Python piece we use is here (MIT): https://github.com/ubermorgenland/LogCost
It currently:
works as a drop‑in for Python logging (Flask/FastAPI/Django examples, K8s sidecar, Slack notifications)
only exports aggregated stats (file:line, level, count, bytes, cost) – no raw logs
https://redd.it/1pbcuny
@r_devops
This is more of a rant than a product announcement, but there's a small open source tool at the end because we got tired of repeating this cycle.
Every few months we have the same ritual:
\- Management looks at the cost
\- Someone asks "why are logs so expensive?"
\- Platform scrambles to:
\- tweak retention and tiers
\- turn on sampling / drop filters
And every time, the core problem is the same:
\- We only notice logging explosions after the bill shows up
\- Our tooling shows cost by index / log group / namespace, not by lines of code
\- So we end up sending vague messages like "please log less" that don't actually tell any team what to change
In one case, when we finally dug into it properly, we realised:
\- The majority of the extra cost came from one or two log statements:
\- debug logs in hot paths
\- usage for that service gradually increased (so there were no spikes in usage)
\- verbose HTTP tracing we accidentally shipped into prod
\- payload dumps in loops
What we wanted was something that could say:
src/memoryutils.py:338 Processing step: %s
315 GB | $157.50 | 1.2M calls
i.e. "this exact line of code is burning $X/month", not just "this log index is expensive."
Because the current flow is:
\- DevOps/Platform owns the bill
\- Dev teams own the code
\- But neither side has a simple, continuous way to connect "this monthly cost" → "these specific lines"
At best someone does grepping through the logs (on DevOps side) and Dev team might look at that later if chased.
———
We ended up building a tiny Python library for our own services that:
\- wraps the standard logging module and print
\- records stats per file:line:level – counts and total bytes
\- does not store any raw log payloads (just aggregations)
Then we can run a service under normal load and get a report like (also, get Slack notifications):
Provider: GCP Currency: USD
Total bytes: 900,000,000,000 Estimated cost: 450.00 USD
Top 5 cost drivers:
\- src/memory\utils.py:338 Processing step: %s... 157.5000 USD
...
The interesting part for us wasn't "save money" in the abstract, it was:
\- Stop sending generic "log less" emails
\- Start sending very specific messages to teams:
"These 3 lines in your service are responsible for \~40% of the logging cost. If you change or sample them, you’ll fix most of the problem for this app."
\- It also fixes the classic DevOps problem of "I have no idea whether this log is important or not":
Platform can show cost and frequency,
Teams who own the code decide which logs are worth paying for.
It also runs continuously, so we don’t only discover the problem once the monthly bill arrives.
———
If anyone's curious, the Python piece we use is here (MIT): https://github.com/ubermorgenland/LogCost
It currently:
works as a drop‑in for Python logging (Flask/FastAPI/Django examples, K8s sidecar, Slack notifications)
only exports aggregated stats (file:line, level, count, bytes, cost) – no raw logs
https://redd.it/1pbcuny
@r_devops
GitHub
GitHub - ubermorgenland/LogCost: Find and fix expensive log statements to reduce cloud logging costs
Find and fix expensive log statements to reduce cloud logging costs - ubermorgenland/LogCost
Best container image security tool for growing company?
Mentioned it here earlier but now leading a devops team following a quick departure by the person who hired me. That person completely ignored the Bitnami change to paid and now it’s up to me to figure out what to do. Not clear if this is one of the many reasons they were dismissed.
We’re using dozens of open source images like Python, ArgoCD, and Istio, and right now using Trivy for security scans but have been a crap ton of unnecessary vulnerability alerts.
I’m looking for something that handles vulnerability fatigue, CI/CD, etc., that doesn’t piss the team off.
Are most of you just eating the cost of your base images on Bitnami and patching vulnerabilities yourself? If not, what container image tool are you using?
Dev count is \~50 and devops is 5 including myself.
https://redd.it/1pbd14u
@r_devops
Mentioned it here earlier but now leading a devops team following a quick departure by the person who hired me. That person completely ignored the Bitnami change to paid and now it’s up to me to figure out what to do. Not clear if this is one of the many reasons they were dismissed.
We’re using dozens of open source images like Python, ArgoCD, and Istio, and right now using Trivy for security scans but have been a crap ton of unnecessary vulnerability alerts.
I’m looking for something that handles vulnerability fatigue, CI/CD, etc., that doesn’t piss the team off.
Are most of you just eating the cost of your base images on Bitnami and patching vulnerabilities yourself? If not, what container image tool are you using?
Dev count is \~50 and devops is 5 including myself.
https://redd.it/1pbd14u
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
Finally joined product company but in a bad team
Always worked in a mediacor company in my career, i had faced issues in my project where client was not happy but i have worked on it and got good feedback from the client instead.
I finally joined a product company which i always dreamed of. But this time I got into a very very bad team i say. Its been just few months only and am not able to adjust it. Start of the day with anxiety and ends with no energy to do anything.
I carry good amount of Cloud experience and being into devops as well.. But i feel like its very overwhelming for me.... Am getting panic attacks in every scrum, retro, refinements...
https://redd.it/1pbfifl
@r_devops
Always worked in a mediacor company in my career, i had faced issues in my project where client was not happy but i have worked on it and got good feedback from the client instead.
I finally joined a product company which i always dreamed of. But this time I got into a very very bad team i say. Its been just few months only and am not able to adjust it. Start of the day with anxiety and ends with no energy to do anything.
I carry good amount of Cloud experience and being into devops as well.. But i feel like its very overwhelming for me.... Am getting panic attacks in every scrum, retro, refinements...
https://redd.it/1pbfifl
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
Is DevOps/R&D dynamics so tense for all of you?
I'm in the first year of my first devops position, and the relationship between us the developers is so tense it's ridiculous. And from my view it seems like they are just lazy and not really owning their work. They’ll pick CPU and memory requests once in dev, ship, and then never think about it again. They don't load-test or profile, and then are very surprised when latency explodes at scale. I’m getting paged for their services becuase somehow the alerts are always “ops noise” instead of, you know, their code falling over.
A lot of my energy goes into being frustrated with them and their seeming inclination to first say anything wrong has got to do with us, and them if we check it and disagree, we need to make a court-worthy case in order to roll the problem back to them so they can fix whatever it is they didn't do well in the first place. Is it like that everywhere? Or is it just shitty culture in our org?
https://redd.it/1pbczlz
@r_devops
I'm in the first year of my first devops position, and the relationship between us the developers is so tense it's ridiculous. And from my view it seems like they are just lazy and not really owning their work. They’ll pick CPU and memory requests once in dev, ship, and then never think about it again. They don't load-test or profile, and then are very surprised when latency explodes at scale. I’m getting paged for their services becuase somehow the alerts are always “ops noise” instead of, you know, their code falling over.
A lot of my energy goes into being frustrated with them and their seeming inclination to first say anything wrong has got to do with us, and them if we check it and disagree, we need to make a court-worthy case in order to roll the problem back to them so they can fix whatever it is they didn't do well in the first place. Is it like that everywhere? Or is it just shitty culture in our org?
https://redd.it/1pbczlz
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
Event about shell noscripting with NuShell in Ghent
I am posting here, because I assume noscripting with Bash (or others) is an essential part of DevOps. There is this new language around the block called NuShell that promises to improve the Bash experience. I am giving a free interactive workshop in Ghent, Belgium on Wednesday. Anyone interested and in the area? You can sign up here
The content of the presentation (such as slides and exercises) is in a Git repository.
https://redd.it/1pbh9jj
@r_devops
I am posting here, because I assume noscripting with Bash (or others) is an essential part of DevOps. There is this new language around the block called NuShell that promises to improve the Bash experience. I am giving a free interactive workshop in Ghent, Belgium on Wednesday. Anyone interested and in the area? You can sign up here
The content of the presentation (such as slides and exercises) is in a Git repository.
https://redd.it/1pbh9jj
@r_devops
Meetup
NuShell: Super glue for your operating system, Wed, Dec 3, 2025, 7:00 PM | Meetup
NuShell is a fairly new type of shell focused on functional structured data processing. It fixes most problems of the traditional POSIX shellls and takes inspiration from P
The Hidden Cost of “Cold Starts”: Defeating EBS Lazy Loading in AI Pipelines
So i was working on ML Workload Optimization and faced some issues regarding Lazy Start and First Touch Latency. These are the things which we miss while doing the optimization for our high throught pipelines. A simple yet small thing can make such a big impact. Added my finding in this blog. Hope this might help you guys.
https://dcgmechanics.medium.com/the-hidden-cost-of-cold-starts-defeating-ebs-lazy-loading-in-ai-pipelines-ff784febba74
https://redd.it/1pblub0
@r_devops
So i was working on ML Workload Optimization and faced some issues regarding Lazy Start and First Touch Latency. These are the things which we miss while doing the optimization for our high throught pipelines. A simple yet small thing can make such a big impact. Added my finding in this blog. Hope this might help you guys.
https://dcgmechanics.medium.com/the-hidden-cost-of-cold-starts-defeating-ebs-lazy-loading-in-ai-pipelines-ff784febba74
https://redd.it/1pblub0
@r_devops
Medium
The Hidden Cost of “Cold Starts”: Defeating EBS Lazy Loading in AI Pipelines
In the world of MLOps, we often obsess over model inference time. We spend weeks optimizing PyTorch code to shave off 50 milliseconds per…
For those that declined a +50% raise from another company’s offer, why did you do it?
Current compensation including benefits insurance is $150k. I have an offer for $250k with room for growth, but the job would absolutely wreck me in regard to the amount of work I would be taking on. My current role is basically heaven in terms of workload. Both are fully remote.
https://redd.it/1pbowvm
@r_devops
Current compensation including benefits insurance is $150k. I have an offer for $250k with room for growth, but the job would absolutely wreck me in regard to the amount of work I would be taking on. My current role is basically heaven in terms of workload. Both are fully remote.
https://redd.it/1pbowvm
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
Wanted: A simple event bus service similar to Eiffel
At my last employer we interfaced with an Eiffel service (https://e-pettersson-ericsson.github.io/eiffel-community.github.io/) to be able to gather statistics about our CI/CD pipeline runs and then use that data to measure quality metrics for projects. It was an interesting setup, which we where mostly just users of and not the maintainers.
At my current employer we are currently trying to implement something similar for our CI/CD pipeline. A couple of my colleagues started with a PoC of the simplest possible usecase (i.e something custom). But I was thinking if there was something our there that is *not* Eiffel, a bit simpler, but still open source that we could look at instead of having to build and maintain something ourselves?
I spend quite a lot of time in the self-hosting community but haven't seen anything like it yet.
https://redd.it/1pbof06
@r_devops
At my last employer we interfaced with an Eiffel service (https://e-pettersson-ericsson.github.io/eiffel-community.github.io/) to be able to gather statistics about our CI/CD pipeline runs and then use that data to measure quality metrics for projects. It was an interesting setup, which we where mostly just users of and not the maintainers.
At my current employer we are currently trying to implement something similar for our CI/CD pipeline. A couple of my colleagues started with a PoC of the simplest possible usecase (i.e something custom). But I was thinking if there was something our there that is *not* Eiffel, a bit simpler, but still open source that we could look at instead of having to build and maintain something ourselves?
I spend quite a lot of time in the self-hosting community but haven't seen anything like it yet.
https://redd.it/1pbof06
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
I had AI peer-review its own rules. All three models missed "don't push to main"
I'm a DevOps engineer and was just having some fun with the Cursor user
rules, and thought... why not build rules with another AI to make the AI better
at my job? Maybe he'll stop being annoying to work with sometimes and
will actually be helpful. Especially when it comes to troubleshooting
and DevOps architectures.
So I went back and forth between Claude desktop and my Cursor IDE and
it was working pretty well - they were improving the rules and making
them easier to handle and more focused. I was very proud and sent it
to my teammate.
After about an hour he sent me a message that the agent pushed the
code to main... Apparently they do that unless you tell them otherwise.
So I thought: what else don't they know that we think is super obvious?
Anyway, I wrote a short article about it. You're welcome to check it out:medium article
Also if you wanna checkout the rules we created and add some of yours github repo
https://redd.it/1pbr13h
@r_devops
I'm a DevOps engineer and was just having some fun with the Cursor user
rules, and thought... why not build rules with another AI to make the AI better
at my job? Maybe he'll stop being annoying to work with sometimes and
will actually be helpful. Especially when it comes to troubleshooting
and DevOps architectures.
So I went back and forth between Claude desktop and my Cursor IDE and
it was working pretty well - they were improving the rules and making
them easier to handle and more focused. I was very proud and sent it
to my teammate.
After about an hour he sent me a message that the agent pushed the
code to main... Apparently they do that unless you tell them otherwise.
So I thought: what else don't they know that we think is super obvious?
Anyway, I wrote a short article about it. You're welcome to check it out:medium article
Also if you wanna checkout the rules we created and add some of yours github repo
https://redd.it/1pbr13h
@r_devops
GitHub
GitHub - yairt2/ai-devops-rules
Contribute to yairt2/ai-devops-rules development by creating an account on GitHub.
SMBs struggling with Cloud/DevOps/SRE? Let’s collaborate.
Hey everyone- I’m looking to collaborate with SMBs that want reliable, scalable, and cost-efficient cloud infrastructure without hiring a full in-house DevOps/SRE team.
I run a Cloud consultancy helping teams fix slow deployments, outages, high cloud bills, and legacy setups.
What we handle:
- Cloud engineering (AWS/Azure/GCP)
- DevOps automation & CI/CD
- Serverless deployments
- SRE (monitoring, SLOs, resilience)
- Security & cost optimization
Proven results:
From our case studies (FinTech, Healthcare, iGaming):
- <50ms API latency & 30% cost savings
- ~60% OPEX reduction
- 400k+ concurrent users @ 15ms latency
If you’re an SMB founder/CTO wanting to scale faster, reduce outages, or cut cloud costs, I’d love to collaborate.
DM me or comment: happy to share ideas.
https://redd.it/1pbu14k
@r_devops
Hey everyone- I’m looking to collaborate with SMBs that want reliable, scalable, and cost-efficient cloud infrastructure without hiring a full in-house DevOps/SRE team.
I run a Cloud consultancy helping teams fix slow deployments, outages, high cloud bills, and legacy setups.
What we handle:
- Cloud engineering (AWS/Azure/GCP)
- DevOps automation & CI/CD
- Serverless deployments
- SRE (monitoring, SLOs, resilience)
- Security & cost optimization
Proven results:
From our case studies (FinTech, Healthcare, iGaming):
- <50ms API latency & 30% cost savings
- ~60% OPEX reduction
- 400k+ concurrent users @ 15ms latency
If you’re an SMB founder/CTO wanting to scale faster, reduce outages, or cut cloud costs, I’d love to collaborate.
DM me or comment: happy to share ideas.
https://redd.it/1pbu14k
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
SIEM exploration as DevOps?
Boss wants me to evaluate potential SIEM products for enhanced Cyber Security of our infrastructure, does this fit my role as a DevOps person? I don’t know anything about siem and haven’t done anything with it before. Is he setting me up to fail
https://redd.it/1pbw72u
@r_devops
Boss wants me to evaluate potential SIEM products for enhanced Cyber Security of our infrastructure, does this fit my role as a DevOps person? I don’t know anything about siem and haven’t done anything with it before. Is he setting me up to fail
https://redd.it/1pbw72u
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
YAML: Yet Another Misery Language
Why does no one talk about how absolutely insane it is that half this job is debugging invisible whitespace, copy-pasted YAML rituals, and "why did Kubernetes decide to ignore this value today?"
Everyone keeps saying DevOps is about "culture" and "collaboration," but from what I can tell it's mostly convincing machines to accept indentation and hoping Helm doesn't summon demons.
Is this normal? Or did I accidentally join a giant industry-wide hazing ritual?
Asking respectfully for a friend...
https://redd.it/1pbwxlu
@r_devops
Why does no one talk about how absolutely insane it is that half this job is debugging invisible whitespace, copy-pasted YAML rituals, and "why did Kubernetes decide to ignore this value today?"
Everyone keeps saying DevOps is about "culture" and "collaboration," but from what I can tell it's mostly convincing machines to accept indentation and hoping Helm doesn't summon demons.
Is this normal? Or did I accidentally join a giant industry-wide hazing ritual?
Asking respectfully for a friend...
https://redd.it/1pbwxlu
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
What level of programimming language needed in devops.
I recently interviewed for a DevOps role where the technical round focused heavily on LeetCode-style coding problems rather than typical noscripting or infrastructure tasks. Is this common practice nowadays? I’m wondering if the industry expectation has shifted towards requiring software engineering-level proficiency in languages like Python or Go for infrastructure roles.
https://redd.it/1pbw1mz
@r_devops
I recently interviewed for a DevOps role where the technical round focused heavily on LeetCode-style coding problems rather than typical noscripting or infrastructure tasks. Is this common practice nowadays? I’m wondering if the industry expectation has shifted towards requiring software engineering-level proficiency in languages like Python or Go for infrastructure roles.
https://redd.it/1pbw1mz
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
our latest version is live and working, experience it earlier
I'm thrilled to announce the release of Tackleit v2.1! We've been listening to your feedback and have rolled out a suite of features designed to make your job search even more precise and efficient.
Link is below
https://redd.it/1pc0c1y
@r_devops
I'm thrilled to announce the release of Tackleit v2.1! We've been listening to your feedback and have rolled out a suite of features designed to make your job search even more precise and efficient.
Link is below
https://redd.it/1pc0c1y
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
Collaboration DevOps Engineer for a Decentralized AI Compute Network (DistriAI)
Hi everyone,
I’m building DistriAI, a decentralized AI compute network that aggregates unused CPU/GPU power from everyday devices (smartphones, laptops, desktops) into a globally distributed inference layer.
We’re entering the next stage of development, and we’re looking for someone with solid DevOps / Infrastructure experience to help shape the network’s backbone.
⸻
What DistriAI is doing
We orchestrate and validate micro-tasks across thousands of heterogeneous nodes, aiming to build a censorship-resistant, cost-efficient alternative to centralized compute providers.
Think DePIN × AI orchestration, with an emphasis on performance, security, and reliability.
⸻
What we already have:
• architecture v1 + v1.1 updates
(segmented node pipeline, scheduler, circuit breakers, adaptive rate limiting, RBAC, early fraud detection, reward-meter stub, audit logging…)
• whitepaper
• technical roadmap
• tokenomics
• presale structure
• backend + smart contract contributors
• security engineering support
• early monitoring + observability baseline
The core foundation is ready — now we want to harden and scale the infrastructure layer.
⸻
What we need from DevOps
A DevOps engineer who can collaborate on:
Infrastructure & Scaling
• containerized microservices
• orchestration (Kubernetes preferred)
• distributed task execution pipelines
• autoscaling & workload distribution
Observability
• metrics, logs, distributed tracing
• node heartbeat & uptime tracking
• anomaly + fraudulent behavior detection
Security & Reliability
• secure CI/CD
• secrets management
• vulnerability scanning
• fault-tolerant compute routing
Tooling
• local + cloud deployment tooling
• zero-downtime upgrades
• environment config management
• developer experience pipelines
⸻
Tech (flexible)
Docker, Kubernetes, NATS/Redis/Kafka, Prometheus/Grafana, Loki/ELK, Terraform, GitHub Actions.
Not required to know everything — but you must be comfortable designing systems that scale and survive.
⸻
Who we’re looking for
• someone who likes building infra from scratch
• strong reliability mindset
• experience with distributed systems or high-load environments
• ownership + clarity in communication
• ability to collaborate with backend/security contributors
⸻
If interested
Drop your GitHub, LinkedIn, or previous infra setups, or DM me directly for more details.
Happy to walk you through the architecture and where you’d plug in.
Let’s build the backbone of DistriAI together.
https://redd.it/1pc1rb2
@r_devops
Hi everyone,
I’m building DistriAI, a decentralized AI compute network that aggregates unused CPU/GPU power from everyday devices (smartphones, laptops, desktops) into a globally distributed inference layer.
We’re entering the next stage of development, and we’re looking for someone with solid DevOps / Infrastructure experience to help shape the network’s backbone.
⸻
What DistriAI is doing
We orchestrate and validate micro-tasks across thousands of heterogeneous nodes, aiming to build a censorship-resistant, cost-efficient alternative to centralized compute providers.
Think DePIN × AI orchestration, with an emphasis on performance, security, and reliability.
⸻
What we already have:
• architecture v1 + v1.1 updates
(segmented node pipeline, scheduler, circuit breakers, adaptive rate limiting, RBAC, early fraud detection, reward-meter stub, audit logging…)
• whitepaper
• technical roadmap
• tokenomics
• presale structure
• backend + smart contract contributors
• security engineering support
• early monitoring + observability baseline
The core foundation is ready — now we want to harden and scale the infrastructure layer.
⸻
What we need from DevOps
A DevOps engineer who can collaborate on:
Infrastructure & Scaling
• containerized microservices
• orchestration (Kubernetes preferred)
• distributed task execution pipelines
• autoscaling & workload distribution
Observability
• metrics, logs, distributed tracing
• node heartbeat & uptime tracking
• anomaly + fraudulent behavior detection
Security & Reliability
• secure CI/CD
• secrets management
• vulnerability scanning
• fault-tolerant compute routing
Tooling
• local + cloud deployment tooling
• zero-downtime upgrades
• environment config management
• developer experience pipelines
⸻
Tech (flexible)
Docker, Kubernetes, NATS/Redis/Kafka, Prometheus/Grafana, Loki/ELK, Terraform, GitHub Actions.
Not required to know everything — but you must be comfortable designing systems that scale and survive.
⸻
Who we’re looking for
• someone who likes building infra from scratch
• strong reliability mindset
• experience with distributed systems or high-load environments
• ownership + clarity in communication
• ability to collaborate with backend/security contributors
⸻
If interested
Drop your GitHub, LinkedIn, or previous infra setups, or DM me directly for more details.
Happy to walk you through the architecture and where you’d plug in.
Let’s build the backbone of DistriAI together.
https://redd.it/1pc1rb2
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
Are there any good reasons anymore to use a Virtual Machine (leaving alone emulation needs for cross-compiling) over Docker or devcontainers when developing an application? I keep hearing that at this point there is no reason as even container security can be hardened.
Are there any good reasons anymore to use a Virtual Machine (leaving alone emulation needs for cross-compiling) over Docker or devcontainers when developing an application? I keep hearing that at this point there is no reason as even container security can be hardened.
Thanks so much and I’m sorry if this is a boring noob question.
https://redd.it/1pc2rkd
@r_devops
Are there any good reasons anymore to use a Virtual Machine (leaving alone emulation needs for cross-compiling) over Docker or devcontainers when developing an application? I keep hearing that at this point there is no reason as even container security can be hardened.
Thanks so much and I’m sorry if this is a boring noob question.
https://redd.it/1pc2rkd
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
Do you guys actually use AI to code or am i just overthinking this?
I think I'm starting to realize how normal it’s becoming to lean on AI for coding. Tools like Cursor, Windsurf, Continue, all that, are just part of the workflow now. even our prof was cool with it as long as we can explain what the code is actually doing.
I tried Cursor for the first time and honestly it was nice. something that would’ve taken me hours turned into writing a few prompts and fixing tiny mistakes. But it feels like i’m skipping the “learning” part.
I've been mixing in other stuff too Aider, GPT Pilot, even Cosine here and there just so i don’t lose track of how files connect when a project gets messy.
I think i’m just trying to figure out the balance. are we supposed to fully embrace these tools or use them sparingly so we actually learn? how are you all handling this?
https://redd.it/1pc3pvg
@r_devops
I think I'm starting to realize how normal it’s becoming to lean on AI for coding. Tools like Cursor, Windsurf, Continue, all that, are just part of the workflow now. even our prof was cool with it as long as we can explain what the code is actually doing.
I tried Cursor for the first time and honestly it was nice. something that would’ve taken me hours turned into writing a few prompts and fixing tiny mistakes. But it feels like i’m skipping the “learning” part.
I've been mixing in other stuff too Aider, GPT Pilot, even Cosine here and there just so i don’t lose track of how files connect when a project gets messy.
I think i’m just trying to figure out the balance. are we supposed to fully embrace these tools or use them sparingly so we actually learn? how are you all handling this?
https://redd.it/1pc3pvg
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
Opsgenie alternatives
My team is currently using Opsgenie + Prometheus as a main way to react to important accidents. However, Opsgenie will shut down in 2027. So, please share your experience with other similar tools, preferably easy to use and open source.
https://redd.it/1pc4jw4
@r_devops
My team is currently using Opsgenie + Prometheus as a main way to react to important accidents. However, Opsgenie will shut down in 2027. So, please share your experience with other similar tools, preferably easy to use and open source.
https://redd.it/1pc4jw4
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community
Moved from Service Desk to DevOps and now I feel like a complete imposter. Need advice.
Hey everyone,
I really need some advice from people who’ve been in this situation.
I’ve been working in Service Desk for about 3 years, and somehow I managed to crack a DevOps interview for a FinTech startup. It felt like a huge step forward in my career.
But now reality is hitting me hard…
The team has started giving me an overview of their tech stack, and honestly, it’s stuff I’ve only heard of in videos or blogs. Things like CI/CD, AWS services, Terraform, Docker, pipelines, monitoring, etc. I understand the concepts, but I’ve never actually worked with them in a real environment.
I’ve never SSH’d into a real server, never used a real AWS Console, nothing. And now I’m feeling very small, like I’m not supposed to be here.
They think I know a lot because I interviewed well and answered most of the questions confidently. But internally I’m panicking because I don’t want to embarrass myself or let the team down.
I’m not trying to scam anyone, I genuinely want to become good at DevOps, but the gap between theory and real-world work feels massive right now.
So my question is:
How do I prepare quickly so I don’t feel like an imposter on Day 1?
What should I practice?
What projects should I build? How do I get comfortable with AWS, Linux, and pipelines before actually joining?
Any guidance from people who made the same transition would mean a lot. 🙏
TLDR: Coming from Service Desk with no real hands-on DevOps experience (no AWS, no SSH, no pipelines). Cracked a DevOps interview but now feel like an imposter because the tech stack is way beyond what I’ve practiced. Need advice on how to prepare fast and not freeze on the job.
https://redd.it/1pc4qsa
@r_devops
Hey everyone,
I really need some advice from people who’ve been in this situation.
I’ve been working in Service Desk for about 3 years, and somehow I managed to crack a DevOps interview for a FinTech startup. It felt like a huge step forward in my career.
But now reality is hitting me hard…
The team has started giving me an overview of their tech stack, and honestly, it’s stuff I’ve only heard of in videos or blogs. Things like CI/CD, AWS services, Terraform, Docker, pipelines, monitoring, etc. I understand the concepts, but I’ve never actually worked with them in a real environment.
I’ve never SSH’d into a real server, never used a real AWS Console, nothing. And now I’m feeling very small, like I’m not supposed to be here.
They think I know a lot because I interviewed well and answered most of the questions confidently. But internally I’m panicking because I don’t want to embarrass myself or let the team down.
I’m not trying to scam anyone, I genuinely want to become good at DevOps, but the gap between theory and real-world work feels massive right now.
So my question is:
How do I prepare quickly so I don’t feel like an imposter on Day 1?
What should I practice?
What projects should I build? How do I get comfortable with AWS, Linux, and pipelines before actually joining?
Any guidance from people who made the same transition would mean a lot. 🙏
TLDR: Coming from Service Desk with no real hands-on DevOps experience (no AWS, no SSH, no pipelines). Cracked a DevOps interview but now feel like an imposter because the tech stack is way beyond what I’ve practiced. Need advice on how to prepare fast and not freeze on the job.
https://redd.it/1pc4qsa
@r_devops
Reddit
From the devops community on Reddit
Explore this post and more from the devops community