Мониторим ИТ – Telegram

Мониторим ИТ

8.07K subscribers

200 photos

2 files

1.52K links

Канал о наблюдаемости (Monitoring & Observability): логи, трейсы, метрики.

Реклама: @gals_ad_bot
Вопросы: @antoniusfirst

@usr_bin_linux — Linux, Kubernetes, Docker, Terraform, etc.

@zabbix_ru — только Zabbix

@elasticstack_ru — ElasticSearch/OpenSearch

Download Telegram

About

Blog

Apps

Platform

Мониторим ИТ

8.07K subscribers

Мониторим ИТ

Подход, который поможет снизить количество событий в системе мониторинга — использование множественных проверок и зависимых триггеров. На приложенном скриншоте пример проверок доступности Zabbix-агента. Здесь его доступность проверяется тремя способами:

⚡️ ICMP Ping

⚡️ Проверка доступности порта агента 10050

⚡️ Проверка agent.ping

В зависимости от статуса каждой из проверок, в системе мониторинга срабатывает тот или иной триггер. Кроме того, такой подход позволит сразу же назначать инцидент на правильного инженера: системного, сетевого или ответственного за мониторинг. Применение подобного подхода для других систем поможет заметно ускорить выявление истинной причины недоступности чего-либо и снизит количество шумовых событий.

1.73K views08:00

👍 12 👎👀 2

Открыть комментарии

Мониторим ИТ

New in Grafana 7.2: $__rate_interval for Prometheus rate queries that just work

What range should I use with rate()? That’s not only the title of a true classic among the many useful Robust Perception blog posts; it’s also one of the most frequently asked questions when it comes to PromQL, the Prometheus query language. Читать дальше в блоге Grafana.

1.7K views11:05

Мониторим ИТ

Forwarded from DevOps Tricks | Десять лет в IT

Иногда мы сталкиваемся с распределенной через Интернет инфраструктурой. В случае если отсутствует VPN, использование активных агентов zabbix - отличный способ настроить мониторинг серверов и рабочих станций. Но что, если мы хотим просто проверить доступность IP-камеры и других устройств, расположенных за NAT?
Конечно использовать агента!

Разработал шаблон для таких кейсов, доступно на zabbix-share

Zabbix Share - Template Windows ICMP Macro Discovery Active

Sometimes we are faced with an infrastructure distributed over the Internet. In case there is no VPN, using zabbix active agents is a great way to configure monitoring of servers and workstations.

1.68K views15:14

Мониторим ИТ

How to Setup PostgreSQL Monitoring in Kubernetes

You don't need monitoring until you need it. But if you're running anything in production, you always need it. Читать дальше.

PostgreSQL Blog | Crunchy Data

PostgreSQL experts from Crunchy Data share advice, performance tips, and guides on successfully running PostgreSQL and Kubernetes solutions

1.79K views16:17

Мониторим ИТ

Promscale: An analytical platform and long-term store for Prometheus, with the combined power of SQL and PromQL

In this post we introduce Promscale, a new open-source long-term store for Prometheus data designed for analytics. Читать дальше.

2.72K views08:03

Мониторим ИТ

vRealize Operations 8.2 is now GA!

Кстати, да.

VMware Cloud Management

Announcing GA of vRealize Operations 8.2 and vRealize Operations Cloud

vRealize Operations 8.2 is now GA! This blog was co-authored with Brandon Gordon and John Dias. It doesn’t seem that long ago that we announced the latest release of vRealize Operations and vRealize Operations Cloud. However, a lot has happened in that…

1.77K views13:12

Мониторим ИТ

Now GA: Cortex blocks storage for running Prometheus at scale with reduced operational complexity

We’ve just launched Cortex 1.4.0, one of the most significant releases of 2020. The big headline: The new blocks storage engine has exited the experimental phase and is now marked as Generally Available. Читать дальше.

2.71K views11:18

Мониторим ИТ

PostgreSQL Monitoring for Application Developers: The Vitals

My professional background has been in application development with a strong affinity for developing with PostgreSQL (which I hope comes through in previous articles). However, in many of my roles, I found myself as the "accidental" systems administrator, where I would troubleshoot issues in production and do my best to keep things running and safe. Читать дальше.

PostgreSQL Monitoring for Application Developers: The Vitals

What are some of the key stats to look at to ensure your PostgreSQL cluster is healthy? How can you use this stats to diagnose the problem?

1.76K views05:34

Мониторим ИТ

New in Grafana Tanka: Customize Helm charts without modifying them

Helm charts are great. They combine high quality, ready-made runtime configurations for a huge number of applications with an incredible getting-started experience. Читать дальше.

New in Grafana Tanka: Customize Helm charts without modifying them | Grafana Labs

Grafana Tanka now enables you to load Helm charts into Jsonnet and treat them as regular JSON objects.

1.84K views09:00

Мониторим ИТ

sysmon

Graphical system monitor for linux, including information about CPU, GPU, Memory, HDD/SDD and your network connections. Similar to windows task manager. Репозиторий.

GitHub - MatthiasSchinzel/sysmon: Graphical system monitor for linux, including information about CPU, GPU, Memory, HDD/SDD and…

Graphical system monitor for linux, including information about CPU, GPU, Memory, HDD/SDD and your network connections. Similar to windows task manager. - MatthiasSchinzel/sysmon

2.09K views16:05

Мониторим ИТ

PostgreSQL Monitoring for App Developers: Alerts & Troubleshooting

If you choose only one thing to alert on in your PostgreSQL cluster (and as I hope this article makes clear, you should alert on multiple things), it should be availability. If your application is unable to connect or transaction with your database, you're probably in for a bad day. Читать дальше.

PostgreSQL Monitoring for App Developers: Alerts & Troubleshooting

When should you be alerted about issues in your PostgreSQL clusters? How do you troubleshoot them? What are some typical solutions?

2.9K views09:00

Мониторим ИТ

5 Prometheus Exporter Best Practices

20 октября Sysdig проведёт вебинар. Регистрация.

⚡ Find the right Prometheus exporter

⚡ Understand your exporter metrics

⚡ Set alerts that matter and are actionable

⚡️ Enable your team to use your data (or not)

⚡️ Have a plan for scale

1.87K views17:19

Мониторим ИТ

Percona представляет новый плагин для мониторинга PostgreSQL — pg_stat_monitor.

Проект на Гитхабе.

GitHub - percona/pg_stat_monitor: Query Performance Monitoring Tool for PostgreSQL

Query Performance Monitoring Tool for PostgreSQL. Contribute to percona/pg_stat_monitor development by creating an account on GitHub.

3K views06:00

Мониторим ИТ

We’re making Prometheus use less memory and restart faster

A few months ago, I blogged about memory-mapping of full chunks of the head block from disk. The feature, which was introduced in Prometheus v2.19.0, brings down memory usage and restart time.

Additionally, there’s another Prometheus feature in progress that snapshots in-memory data during shutdown for faster restarts; it’s expected to cut down the restart times by a big factor. Интересно, как это.

We’re making Prometheus use less memory and restart faster | Grafana Labs

Here's a recap of the new Prometheus features that are bringing down memory usage and restart time.

2.9K views16:50

Мониторим ИТ

How we improved our Kubernetes monitoring at Smarkets, and how you could too

Monitoring Kubernetes internal endpoints and APIs can be tricky, especially when you want automated infrastructure as a service to be used in your company. At Smarkets, we are not fully there yet, but thankfully we are close. I’m hoping that our journey through the process will help you if you wish to do something similar. Читать дальше.

How we improved our Kubernetes monitoring at Smarkets, and how you could too

Monitoring Kubernetes internal endpoints and APIs can be tricky when you want automated infrastructure as a service to be used company…

1.98K views12:00

Мониторим ИТ

Зонтичная система мониторинга, ресурсно-сервисные модели ML, AI и вот это всё в DX OI от Broadcom (бывший CA).

На Хабр!

Зонтичная система мониторинга и ресурсно-сервисные модели в обновленном DX Operations Intelligence от Broadcom (ex. CA)

В этом сентябре Broadcom (бывшая CA) выпустила новую версию 20.2 своего решения DX Operations Intelligence (DX OI). На рынке этот продукт позиционируется как зонтичная система мониторинга. Система...

2.08K viewsedited 06:05

Мониторим ИТ

Мониторинг СХД IBM Storwize при помощи Zabbix

В данной статье мы немного поговорим о мониторинге СХД IBM Storwize и других СХД, поддерживающих протоколы CIM/WBEM. Необходимость такого мониторинга оставлена за скобками, будем считать это аксиомой. В качестве системы мониторинга будем использовать Zabbix. На Хабр!

2.25K views12:43

Мониторим ИТ

Добавляем CMDB и географическую карту к Zabbix

В этой статье расскажем о паре инструментов для расширения функционала Zabbix: CMDB на базе бесплатного решения iTop и карте объектов на базе OpenStreetMap (OSM). А в конце статьи ваш ждет ссылка на репозиторий с кодом фронтовой части для OSM. Читать дальше.

2.41K views15:00

Мониторим ИТ

Как устроен прикладной и бизнес-мониторинг сервисов НСПК

Zabbix, Elastic и Splunk.

Как устроен прикладной и бизнес-мониторинг сервисов НСПК

НСПК сегодня – это не просто операционно-клиринговый центр для карточных операций, но и современная технологическая платформа для продвижения и развития платёжных инструментов и сервисов, как на...

2.46K views11:11

Мониторим ИТ

A guide to setting up Kubernetes Service Level Objectives (SLOs) with Prometheus and Linkerd

In this tutorial, we’re going to see how to set up a basic success rate SLO with a rolling window for a gRPC service running on Kubernetes. Of course, the techniques we use here are just as applicable to different types of metrics and SLOs. Читать дальше.

4.39K views15:00