DevOps&SRE Library
18.3K subscribers
456 photos
5 videos
2 files
4.93K links
Библиотека статей по теме DevOps и SRE.

Реклама: @ostinostin
Контент: @mxssl

РКН: https://www.gosuslugi.ru/snet/67704b536aa9672b963777b3
Download Telegram
Need your own incident post-mortem template? Here’s ours

Having a dedicated incident post-mortem is just as important as having a robust incident response plan. The post-mortem is key to understanding exactly what went wrong, why it happened in the first place, and what you can do to avoid it in the future.

It’s an essential document but many organizations either haphazardly put together post-incident notes that live in disparate places or don’t know where to start in creating their own post-mortems. To help, we’re sharing the incident post-mortem template that we use internally.

This template outlines our “sensible default” for documenting any incident, technical or otherwise. We believe it strikes a healthy balance between raw data, human interpretation, and concrete actions. And we say “sensible default” because it’s rare that this will perfectly cover the specific needs of your organization, and that’s fine. Think of this as a launching off point for your own incident post-mortem document.

Within each section, we’ve outlined the background on what it’s for, why it’s important, and how we advise you to complete it.

https://incident.io/blog/incident-post-mortem-template
Site Reliability Engineer (SRE) Interview Preparation Guide

This repository is an attempt to consolidate useful resources for Site Reliability Engineer (SRE) interview preparation.

https://github.com/mxssl/sre-interview-prep-guide
Devopedia

Devopedia is an open community platform for developers by developers to explain technology in a simple, clear and unopinionated way.

https://devopedia.org
mox

Mox is a modern full-featured open source secure mail server for low-maintenance self-hosted email.

https://github.com/mjl-/mox
Email explained from first principles

This article covers all aspects of modern email.

https://explained-from-first-principles.com/email
The Internet explained from first principles

https://explained-from-first-principles.com/internet
Taking the fear out of migrations

Over the last 18 months at incident.io, we’ve done a lot of migrations. Often, a new feature requires a change to our existing data model. For us to be successful, it’s important that we can seamlessly transition from the old world to the new as quickly as we can.

https://incident.io/blog/how-we-run-migrations
SRE in the Real World

SRE outside of Google.

https://blog.relyabilit.ie/sre-in-the-real-world
Our cloud spend in 2022

Since we published why we’re leaving the cloud, we’ve received a lot of questions about our actual spending. We’re happy to share, both where we currently are and where we’re going.

https://dev.37signals.com/our-cloud-spend-in-2022
SRE Evangelist

Over the last year I rebuilt an SRE team. It made me start to think a lot about what an SRE is and, maybe more importantly, what they are at GitHub specifically.

https://hross.substack.com/p/sre-evangelist
Tricky Kubernetes memory management for Java applications

How to use the Kubernetes memory requests and limits in combination with JVM Heap and stay out of trouble.

https://danoncoding.com/tricky-kubernetes-memory-management-for-java-applications-d2f88dd4e9f6
Comparing Resource Consumption in K0s vs K3s vs Microk8s

https://www.portainer.io/blog/comparing-k0s-k3s-microk8s
argocd-lovely-plugin

A plugin to make Argo CD behave like we'd like.

https://github.com/crumbhole/argocd-lovely-plugin
Backup-and-Restore of Containers with Kubernetes Checkpointing API

https://martinheinz.dev/blog/85
kubernetes-secret-generator

This repository contains a custom Kubernetes controller that can automatically create random secret values. This may be used for auto-generating random credentials for applications run on Kubernetes.

https://github.com/mittwald/kubernetes-secret-generator
Pitfalls reloading files from Kubernetes Secret & ConfigMap volumes

https://ahmet.im/blog/kubernetes-inotify/index.html