DevOps&SRE Library
18.3K subscribers
456 photos
4 videos
2 files
4.94K links
Библиотека статей по теме DevOps и SRE.

Реклама: @ostinostin
Контент: @mxssl

РКН: https://www.gosuslugi.ru/snet/67704b536aa9672b963777b3
Download Telegram
Grafana alerts as code: Get started with Terraform and Grafana Alerting

https://grafana.com/blog/2022/09/20/grafana-alerts-as-code-get-started-with-terraform-and-grafana-alerting
pg_activity

pg_activity is a top like application for PostgreSQL server activity monitoring.

https://github.com/dalibo/pg_activity
agnos

Obtain (wildcard) certificates from let's encrypt using dns-01 without the need for API access to your DNS provider.

https://github.com/krtab/agnos
tracetest

Tracetest is a OpenTelemetry based tool that helps you develop and test your distributed applications. It assists you in the development process by enabling you to trigger your code and see the trace as you add OTel instrumentation. It also empowers you to create trace-based tests based on the data contained in your OpenTelemetry trace. You can verify against both the triggering transactions response AND any of the information contained deep in a span in your trace.

https://github.com/kubeshop/tracetest
Observability Best Practices when running FastAPI in a Lambda

https://www.eliasbrange.dev/posts/observability-with-fastapi-aws-lambda-powertools
k8spacket

k8spacket - packets traffic visualization for kubernetes

https://github.com/k8spacket/k8spacket
bindplane-op

BindPlane OP is an open source observability pipeline that gives you the ability to collect, refine, and ship metrics, logs, and traces to any destination. BindPlane OP provides the controls you need to reduce observability costs and simplify the deployment and management of telemetry agents at scale.

https://github.com/observIQ/bindplane-op
6 Best Practices for Effective Readiness and Liveness Probes

https://www.datree.io/resources/kubernetes-readiness-and-liveness-probes-best-practices
Optimizing TCP for high WAN throughput while preserving low latency

https://blog.cloudflare.com/optimizing-tcp-for-high-throughput-and-low-latency
Slowing Down to Speed Up – Circuit Breakers for Slack’s CI/CD

How Slack increased developer productivity and prevented cascading internal failures by implementing orchestration-level circuit breakers

https://slack.engineering/circuit-breakers
gprofiler

gProfiler is a system-wide profiler, combining multiple sampling profilers to produce unified visualization of what your CPU is spending time on.

https://github.com/Granulate/gprofiler
jc

CLI tool and python library that converts the output of popular command-line tools, file-types, and common strings to JSON, YAML, or Dictionaries. This allows piping of output to tools like jq and simplifying automation scripts.

https://github.com/kellyjonbrazil/jc