DevOps&SRE Library
19K subscribers
426 photos
2 videos
2 files
5.15K links
Библиотека статей по теме DevOps и SRE.

Реклама: @ostinostin
Контент: @mxssl

РКН: https://www.gosuslugi.ru/snet/67704b536aa9672b963777b3
Download Telegram
kubernetes-best-practices

A cookbook with the best practices to working with kubernetes.

https://github.com/diegolnasc/kubernetes-best-practices
Below: a time travelling resource monitoring tool

below was designed and developed by the resource control team at Facebook to view and record historical Linux system data.

https://developers.facebook.com/blog/post/2021/09/21/below-time-travelling-resource-monitoring-tool

code: https://github.com/facebookincubator/below
SLOs and why you should care

Ever wondered what all the fuss over Service Level Objectives (SLOs) is about? Let’s find out.

https://engineering.solarisbank.com/slos-and-why-you-should-care-136f80bf686e
kubermetrics

Kubermetrics is an open-source dev tool that provides Kubernetes cluster monitoring as well as data visualization in a simple and easy to understand user interface. Kubermetrics intergrates both the Prometheus and Grafana Dashboards on one page! Allowing for custominzable dashboards and alerts.

https://github.com/oslabs-beta/kubermetrics
What is expected in the SRE role? We analyzed 30 job postings to find out

https://spike.sh/blog/sre-role-2021-analysed-30-job-postings
Making Kubernetes Operations Easy with kubectl Plugins

https://martinheinz.dev/blog/58
peirates

Peirates, a Kubernetes penetration tool, enables an attacker to escalate privilege and pivot through a Kubernetes cluster. It automates known techniques to steal and collect service accounts, obtain further code execution, and gain control of the cluster.

https://github.com/inguardians/peirates
youki

youki is an implementation of the OCI runtime-spec in Rust, similar to runc.

https://github.com/containers/youki
Reverse Proxy, HTTP Keep-Alive Timeout, and sporadic HTTP 502s

https://iximiuz.com/en/posts/reverse-proxy-http-keep-alive-and-502s
automated-cloud-advisor

Automated Cloud Advisor is an extensible tool that aims at facilitating cost optimization in AWS, by collecting data for resources that are under utilized. In addition, this is a great learning tool for new DevOps/Cloud engineers that want to start automating things in AWS.

https://github.com/disneystreaming/automated-cloud-advisor
The Speed of Time

How long does it take to read the time? How would you time time? These strange questions came to the fore back in 2014 when Netflix was switching services from CentOS Linux to Ubuntu, and I helped debug several weird performance issues including one I'll describe here.

https://www.brendangregg.com/blog/2021-09-26/the-speed-of-time.html
pgmetrics

pgmetrics is an open-source, zero-dependency, single-binary tool that can collect 350+ metrics from a running PostgreSQL server and display it in easy-to-read text format or export it as JSON and CSV for scripting.

https://pgmetrics.io