Dev0ps – Telegram

Dev0ps

40 subscribers

211 photos

3 videos

50 files

3.33K links

Download Telegram

About

Blog

Apps

Platform

https://github.com/Noovolari/leapp

GitHub - Noovolari/leapp: Leapp is the DevTool to access your cloud

Leapp is the DevTool to access your cloud. Contribute to Noovolari/leapp development by creating an account on GitHub.

13 views19:49

Forwarded from Мониторим ИТ

PostgreSQL Monitoring for App Developers: Alerts & Troubleshooting

If you choose only one thing to alert on in your PostgreSQL cluster (and as I hope this article makes clear, you should alert on multiple things), it should be availability. If your application is unable to connect or transaction with your database, you're probably in for a bad day. Читать дальше.

PostgreSQL Monitoring for App Developers: Alerts & Troubleshooting

When should you be alerted about issues in your PostgreSQL clusters? How do you troubleshoot them? What are some typical solutions?

11 views06:33

Forwarded from Мониторим ИТ

Percona представляет новый плагин для мониторинга PostgreSQL — pg_stat_monitor.

Проект на Гитхабе.

GitHub - percona/pg_stat_monitor: Query Performance Monitoring Tool for PostgreSQL

Query Performance Monitoring Tool for PostgreSQL. Contribute to percona/pg_stat_monitor development by creating an account on GitHub.

13 views06:35

https://grafana.com/blog/2020/06/23/how-to-visualize-prometheus-histograms-in-grafana/

How to visualize Prometheus histograms in Grafana | Grafana Labs

Learn how to turn a Prometheus histogram into a stat panel, bar gauge, or heat map in Grafana

16 views06:41

https://www.w3.org/TR/trace-context/

This specification defines standard HTTP headers and a value format to propagate context information that enables distributed tracing scenarios. The specification standardizes how context information is sent and modified between services. Context information…

16 views08:27

https://www.softether.org/

15 views16:09

Forwarded from Записки админа

📟 Save your engineers' sleep: best practices for on-call processes. Собственно, из названия всё понятно - полезные советы для организации on-call процесса здорового человека.

#напочитать #support #oncall

8 views21:36

Forwarded from Грефневая Кафка (pro.kafka)

Время от времени спрашивают как делать приложения, чтобы при падении Кафки приложение не падало. Мне вспомнилась статья Jakub Korab как раз где он разбирается в различных подходах к решению этой задачи.

https://www.confluent.io/blog/how-to-survive-a-kafka-outage/

Apache Kafka® Broker Failures & Other Outages

Learn common causes of Apache Kafka® broker failures, as well as how to recover from outages and ensure high availability and resilience in your Kafka cluster.

16 views21:51

Forwarded from Updates rtfm.co.ua 🇺🇦 (rtfmcoua)

Prometheus: Recording Rules и теги – разделяем алерты в Slack

С 2018 года используем Opsgenie, который получает алерты от Prometheus, CloudWatch и Uptrends, которые потом через Slack-интеграцию отправляет нам в Slack. Интеграции Slack на данный момент выглядят так: В каждой из них настроен фильтр по уровню важности, например интеграция P1, P2 > Slack #devops-alarms-warning: Но есть проблема: так как каналы получаются общие, то все алерты…

https://rtfm.co.ua/prometheus-recording-rules-i-tegi-razdelyaem-alerty-v-slack/

RTFM: Linux, DevOps и системное администрирование | DevOps-инжиниринг и системное администрирование. Случаи из практики.

Prometheus: Recording Rules и теги — разделяем алерты в Slack

Применение Prometheus Recording Rules и Tags для выбора Slack-канала, используя Opsgenie

13 views19:12

https://www.spektor.dev/how-to-stream-mongodb-changes-to-kafka/

11 views20:16

Forwarded from Eugene 🦁

https://www.youtube.com/watch?v=swQbA4zub20

Слушая и радуюсь
Выглядит круто

AWS re:Invent 2018: How AWS Minimizes the Blast Radius of Failures (ARC338)

At AWS, we obsess over operational excellence. We have a deep understanding of system availability, informed by over a decade of experience operating the cloud and our roots of operating Amazon.com for nearly a quarter-century. One thing we've learned is…

12 views05:04

Forwarded from Записки админа

🔧 Sanoid - система управления ZFS снапшотами в Linux, которая, работая вместе с KVM, позволяет развернуть снапшот и восстановить работу виртуального сервера одной командой (собственно, как и любой другой правильный подход работы со снапшотами).

https://github.com/jimsalterjrs/sanoid

#zfs #backup #напочитать

GitHub - jimsalterjrs/sanoid: These are policy-driven snapshot management and replication tools which use OpenZFS for underlying…

These are policy-driven snapshot management and replication tools which use OpenZFS for underlying next-gen storage. (Btrfs support plans are shelved unless and until btrfs becomes reliable.) - jim...

12 views05:18

https://ieftimov.com/post/deep-dive-cors-history-how-it-works-best-practices/

Ilija Eftimov 👨‍🚀

Deep dive in CORS: History, how it works, and best practices

Learn the history and evolution of same-origin policy and CORS, understand CORS and the different types of cross-origin access in depth, and learn (some) best practices.

14 views05:58

Микрооптимизация кода на Go на примере простого веб-сервиса / Хабр
https://habr.com/ru/company/kaspersky/blog/591725/

Микрооптимизация кода на Go на примере простого веб-сервиса

Привет, Хабр! Я работаю старшим Go-разработчиком в «Лаборатории Касперского». Сегодня хочу поговорить о том, как искать узкие места и оптимизировать код на Go. Разберу процесс профилирования и...

15 views19:31

https://slack.engineering/tracing-at-slack-thinking-in-causal-graphs/

Engineering at Slack

Tracing at Slack: Thinking in Causal Graphs - Engineering at Slack

“Why is it slow?” is the hardest problem to debug in a complex distributed system like Slack. To diagnose a slow-loading channel with over a hundred thousand users, we’d need to look at client-side metrics, server-side metrics, and logs. It could be a client…

14 views20:51

https://www.honeycomb.io/play/

Play - Honeycomb

13 views21:19

https://github.com/cilium/pwru

GitHub - cilium/pwru: Packet, where are you? -- eBPF-based Linux kernel networking debugger

Packet, where are you? -- eBPF-based Linux kernel networking debugger - cilium/pwru

12 views21:20

Forwarded from DevOps&SRE Library

Effective IAM for Amazon Web Services

Effective IAM for Amazon Web Services is for Cloud engineers who design, develop, and review AWS IAM security policies in their daily work.

If you're struggling to deliver effective AWS security policies, this guide will help you understand why it's hard and how both you and your organization can use IAM well.

The AWS IAM documentation tells you what you can do. This guide will show you how to scale IAM best practices to all developers.

https://www.effectiveiam.com

9 views21:29

Forwarded from DevOps&SRE Library

lake

Dev Lake brings all your DevOps data into one practical, personalized, extensible view. Ingest, analyze, and visualize data from an ever-growing list of developer tools, with our free and open source product.

Dev Lake is most exciting for leaders and managers looking to make better sense of their development data, though it's useful for any developer looking to bring a more data-driven approach to their own practices. With Dev Lake you can ask your process any question, just connect and query.

https://github.com/merico-dev/lake

10 views21:31

Forwarded from Записки админа

📝 OOPS writeups - хороший пример такого, как можно оформлять отчёты о тех или иных происшествиях. #sre #напочитать #будничное

9 views21:37