DevOps&SRE Library
19K subscribers
426 photos
2 videos
2 files
5.16K links
Библиотека статей по теме DevOps и SRE.

Реклама: @ostinostin
Контент: @mxssl

РКН: https://www.gosuslugi.ru/snet/67704b536aa9672b963777b3
Download Telegram
Load Balancers

В этой статье сделаем небольшой обзор на балансеры, как одну из важных частей в распределённых системах: какую проблему решают, как реализованы.

https://vitkarpov.me/posts/load-balancers
NetworkPolicy Editor: Create, Visualize, and Share Kubernetes NetworkPolicies

https://cilium.io/blog/2021/02/10/network-policy-editor
Knative on Kind (KonK)

Setup Knative on Kind

https://github.com/csantanapr/knative-kind
VPS Showdown - March 2021 - DigitalOcean vs. Lightsail vs. Linode vs. UpCloud vs. Vultr

https://joshtronic.com/2021/03/01/vps-showdown-digitalocean-lightsail-linode-upcloud-vultr
shell-operator & addon-operator news: hooks as admission webhooks, Helm 3, OpenAPI, Go hooks, and more!

https://blog.flant.com/shell-operator-addon-operator-v1-rc1-changes
RATE LIMITING IN CONTROLLER-RUNTIME AND CLIENT-GO

https://danielmangum.com/posts/controller-runtime-client-go-rate-limiting
Build and publish container images to any cloud with Infrastructure as Code

https://www.pulumi.com/blog/build-publish-containers-iac
Engineering dependability and fault tolerance in a distributed system

https://ably.com/blog/engineering-dependability-and-fault-tolerance-in-a-distributed-system
Migrations: the sole scalable fix to tech debt

https://lethain.com/migrations
Scaling Celery workers with RabbitMQ on Kubernetes

https://learnk8s.io/scaling-celery-rabbitmq-kubernetes
Atlas: Our journey from a Python monolith to a managed platform

In this post, we’ll explain why and how we developed and deployed Atlas, a platform which provides the majority of benefits of a Service Oriented Architecture, while minimizing the operational cost that typically comes with owning a service. 

https://dropbox.tech/infrastructure/atlas--our-journey-from-a-python-monolith-to-a-managed-platform
cloudsplaining

Cloudsplaining is an AWS IAM Security Assessment tool that identifies violations of least privilege and generates a risk-prioritized HTML report.

https://github.com/salesforce/cloudsplaining
How they SRE

A curated collection of publicly available resources on how technology and tech-savvy organizations around the world practice Site Reliability Engineering (SRE)

https://github.com/upgundecha/howtheysre