CatOps – Telegram
CatOps
5.09K subscribers
94 photos
5 videos
19 files
2.57K links
DevOps and other issues by Yurii Rochniak (@grem1in) - SRE @ Preply && Maksym Vlasov (@MaxymVlasov) - Engineer @ Star. Opinions on our own.

We do not post ads including event announcements. Please, do not bother us with such requests!
Download Telegram
​​At Google Cloud blog published nice article about SLI tunning

Article not small, so I made some highlits for you:
- Compare SLI with what affects users (compared SLI metrics with mentions in social networks, posts on forums, support tickets). According to the received picture - adjust SLI to make it more realistic.
- Have simple ToDo for founding first alarm/spike for incident
- If you have no alarm (so sad...) and get info from users - Google have a specific tag: "Customer found it first" when they're carrying out their postmortems.
- If your SLIs are unhappy but customers are fine - Split logs created by users and service errors, change levels of abstraction, etc.

And, of cource, nearly any SLI is better than no SLI.

#monitoring #sli
​​If you try adopt autoscaling in AWS, this post helps you get around the pitfalls. All of them is trivial, but only after they came across.

#aws
Статья-перекрёсток от Gruntwork.

"Перекрёсток" в смысле, что это собрание ссылок на другие их статьи и заметки. И там куча интересного как раз почитать на выходных.
​​Towards an understanding of technical debt

Recently I have big discussion with programmers about "technical dept" and part of them try to said "we need time to rewrite all". So, it's not good idea, and this article explains why. Author describe five different phenomena that people mean when they say “technical debt”, and advocates understanding the full context rather than just assuming the folks that came before were fools.

Wrote in January 2016 but still actual.

#culture
​​Cloudinfo - cloud-agnostic tool to compare and contrast instance types between providers and/or regions. Written in Go and have nice UI.

Github | UI demo

#cloud #aws #azure #gcp
Видео и слайды с моего вчерашнего доклада о сетапе Vault с S3 + DynamoDB на бэкенде со встречи HashiCorp User Group - Saint Petersburg.

Презентация: https://drive.google.com/file/d/13xqw5weCkCvQFQMTk-TLiR3hpAeOIqlm/view?usp=sharing

Видео: https://youtu.be/sUl4s6IV6wc

#slides
​​Observability?! – Where do we go from here?

Long-read about this not simple question which covers reasons, pillars of observability, requirements for engineers and many other things.

Highly recommend to read this article for anyone who is even slightly interested in ~~observerless~~ Observability

Also, it have simply definition of Devops in one sentence:
>The DevOps philosophy promotes the concept of small, agile and cross-functional teams that completely manage the life cycle of the services they own.

#culture #observability
Мой опыт показывает, что чем дальше я от баз данных, тем лучше мне, базам данных и компании в целом 🙂

Но так или иначе, с реляционками приходится встречаться и работать. Ловите бесплатную вэб-книгу об индексах в БД

Там хорошее описание, как работать с индексами в популярных движках и, что происходит под капотом

#databases
Вчера зарелизили Grafana 6

Из нововведений:
- Explore: новый способ запросов, который позволяет делать запросы на лету для траблшутинга в реальном времени
- Grafana Loki: интеграция с Loki — который как Prometheus, но для логов
- Gauge Panel: отдельная панель для круговых показателей (gauges). Планируется цифровые показатели тоже привести к виду gauges
- New Panel Editor UX: новый редактор панелей
- Google Stackdriver Datasource: новый датасорс для догадайтесь чего
- Azure Monitor Datasource: перенесён из плагина в core датасорс
- React Plugin: Графана переезжает на React, React plugin позволяет проще создавать плагины
- Named Colors: цвета по названиям, а не кодам 😁
- Removal of user session storage: теперь проще сетапить Графану без стайтфул хранилища. Вместо хранения сессий выдаются короткоживущие токены

Больше инфы тут

#monitoring #grafana
Google рассказывают о семи шагах в DevOps (методологию) в своём блоге.

1. Начните внедрение с малого проекта — остальные подтянутся
2. Используйте Open Source решения
3. Занимайтесь вопросами безопасности во время разработки, а не сбоку от неё
4. Следуйте лучшим практикам (хехехе. Тут пункт про SRE Book на самом деле)
5. Проводите внутренние тренинги внутри компании/команды и делитесь знаниями
6. No blame culture
7. Развивайте культуру: базируйте решения на данных, будьте открытыми, ставьте общие цели

Как-то так…

#culture
​​Some people predicted that 2019 will be focusing on Security and since all the holidays were left behind time, it good time too start.

Modern Defense in Depth

This free ebook helps you take your cybersecurity plan to the next level by:
- Looking at the current cyberthreat landscape
- Explaining why these digital attacks are still so prevalent
- Giving you in-depth methodologies for keeping your organization safer

#book #security
I gave a talk last week about CI/CD for newbies/students and found that I can't remember all types of deployment strategies.

So I propose to read Six Strategies for Application Deployment in which the following practices are considered:
- Recreate
- Ramped
- Blue/Green
- Canary
- A/B testing
with pros, cons and friendly gifs)

P.S. Video record of talk here (ukrainian)

#cicd
AWS App Mesh is a service mesh based on Envoy that allows you to easily monitor and control communications across microservices applications on AWS. You can use App Mesh with microservices running on ECS, EKS and Kubernetes running on Amazon EC2. And it looks like you can switch traffic from one cluster to another.

In future they will add Fargate support.
In future AWS will add in App Mech something like OpenTracing solution for get visibility. And you will can use it with Cloudwatch, X-Ray, Datadog, Prometheus and Grafana!

Also, more slides from AWS Summit Berlin you can find in @aws_ru

#aws
Forwarded from Dmytro Zavalkin