DevOps&SRE Library
18.3K subscribers
456 photos
5 videos
2 files
4.93K links
Библиотека статей по теме DevOps и SRE.

Реклама: @ostinostin
Контент: @mxssl

РКН: https://www.gosuslugi.ru/snet/67704b536aa9672b963777b3
Download Telegram
Become A DevOps Engineer in 2023: [Detailed Guide]

In this detailed guide, I have shared the roadmap for becoming a DevOps Engineer based on my 10 years of experience working as a DevOps Engineer in various organizations.

https://devopscube.com/become-devops-engineer
Etcd Backup and Restore on Kubernetes Cluster [Tutorial]

In this kubernetes tutorial, you will learn the etcd backup and restore on Kubernetes cluster with an etcd snapshot.

https://devopscube.com/backup-etcd-restore-kubernetes
pgweb

Pgweb is a web-based database explorer for PostgreSQL, written in Go, and works on Mac, Linux and Windows machines. Distributed as a simple binary with zero dependencies. Very easy to use and packs just the right amount of features.

https://github.com/sosedoff/pgweb
gitlab-ci-local

Run gitlab pipelines locally as shell executor or docker executor.

https://github.com/firecow/gitlab-ci-local
Redis Explained

A deep technical dive into all things Redis. Covering various Redis topologies, data persistence and process forking.

https://architecturenotes.co/redis
Alertmanager incident response automation with n8n

The prometheus monitoring stack includes an alert dispatching component called alertmanager. Many integrations are available to dispatch the alerts to pager, slack, etc… i.e. notification channels. But how to bring easily and efficiently automated responses is the question we’ll try to answer here.

https://touilleio.medium.com/alertmanager-incident-response-automation-with-n8n-c61227e196e9
Database Sharding Explained

More features, active users, and data are collected daily. Your database is slowing your application. Many people don't understand database sharding, which could solve their difficulties. This article explains database sharding, its benefits, including how to use it and when not to.

https://architecturenotes.co/database-sharding-explained
Building ClickHouse Cloud From Scratch in a Year

Have you ever wondered what it takes to build a serverless software as a service (SaaS) offering in under a year? In this blog post, we will describe how we built ClickHouse Cloud – a managed service on top of one of the most popular online analytical processing (OLAP) databases in the world – from the ground up. We delve into our planning process, design and architecture decisions, security and compliance considerations, how we achieved global scalability and reliability in the cloud, and some of the lessons we learned along the way.

https://clickhouse.com/blog/building-clickhouse-cloud-from-scratch-in-a-year
12 Factor App Revisited

The Twelve-Factor App methodology is a methodology for building software-as-a-service applications by Adam Wiggins. We cover how they have since evolved, and what we can learn from them today and how they changed the status quo of yesteryear.

https://architecturenotes.co/12-factor-app-revisited
Navigating the Storm: Strategies for Managing Production Incidents

Streamlining Your Team’s Incident Response

https://medium.com/saas-infra/navigating-the-storm-strategies-for-managing-production-incidents-e92ece8315c
Relational Databases Explained

How Relational Databases Work. This post talks about how indexes and transactions work on the inside of relational databases.

https://architecturenotes.co/things-you-should-know-about-databases
goreplay

GoReplay is an open-source tool for capturing and replaying live HTTP traffic into a test environment in order to continuously test your system with real data. It can be used to increase confidence in code deployments, configuration changes and infrastructure changes.

https://github.com/buger/goreplay
Writing Terraform for unsupported resources

TerraCurl is a utility Terraform provider that allows engineers to make managed and unmanaged API calls in their Terraform code.

https://www.hashicorp.com/blog/writing-terraform-for-unsupported-resources
terraform-workspaces-terragrunt-ansible

There are multiple ways to configure environment settings in Terraform. This repo evaluates initial four but it has branched out to multiple possible methods with an aim of writing DRY easy to maintain code.

https://github.com/neilpricetw/terraform-workspaces-terragrunt-ansible
The yaml document from hell

For a data format, yaml is extremely complicated. It aims to be a human-friendly format, but in striving for that it introduces so much complexity, that I would argue it achieves the opposite result. Yaml is full of footguns and its friendliness is deceptive. In this post I want to demonstrate this through an example.

This post is a rant, and more opinionated than my usual writing.

https://ruudvanasseldonk.com/2023/01/11/the-yaml-document-from-hell