DevOps&SRE Library
19.3K subscribers
427 photos
2 videos
2 files
5.25K links
Библиотека статей по теме DevOps и SRE.

Реклама: @ostinostin
Контент: @mxssl

РКН: https://www.gosuslugi.ru/snet/67704b536aa9672b963777b3
Download Telegram
🎥 Вебинар: «Организуем CD с помощью Ansible и GitLab CI»

О чём поговорим:
- Как организовать автоматизированный процесс деплоя с помощью GitLab CI и Ansible.
- Как использовать Ansible Playbooks и Roles для управления инфраструктурой.
- Лучшие практики по обновлению сервисов без даунтайма и обработке ошибок.

Что вы получите:
- Вы освоите автоматизацию CD-процессов с Ansible и GitLab CI.
- Сможете разрабатывать гибкие и безопасные пайплайны для деплоя в разных окружениях.
- Поймёте, как уменьшить риск ошибок при развертывании и минимизировать время простоя сервисов.
- Научитесь управлять конфигурациями инфраструктуры без лишней ручной работы.

👉 Для участия зарегистрируйтесь: https://vk.cc/cXAbo1
🎁 Все участники вебинара получат специальные условия на полное обучение курса «DevOps практики и инструменты»

Реклама. ООО «Отус онлайн-образование», ОГРН 1177746618576, www.otus.ru, erid: 2VtzqxMNiF9
waffle

Waffle is a CLI utility that automates AWS Well-Architected Framework Reviews by analyzing Terraform infrastructure using Amazon Bedrock foundation models via direct API invocation in AWS. The Well-Architected Framework questions are then being analyzed by Amazon Bedrock and answered and posted directly to the Well-Architected tool in AWS.


https://github.com/partly-notes/waffle
pgBackRest is dead. Now what?

I have been recommending pgBackRest as the best backup tool for PostgreSQL for years.


https://mydbanotebook.org/posts/pgbackrest-is-dead.-now-what
We built a self-healing registry mirror (because Docker Hub rate limits are no fun)

If you've ever stared at `ImagePullBackOff` in your cluster at 2 PM on a Tuesday, you know — Docker Hub rate limits hit, your pods can't pull, and a perfectly fine deployment is stuck.


https://tinysystems.io/blog/registry-mirror-automation
Running Temporal.io on Kubernetes in Production — What Nobody Tells You

A practical guide to GKE deployment, Cassandra backups, Istio security, and surviving your first outage


https://medium.com/@devansh2054/running-temporal-io-on-kubernetes-in-production-what-nobody-tells-you-d1f336e99306
What 6 Months of Tracking a Production OpenShift Cluster Revealed About Kubernetes Costs

Most Kubernetes teams track pod CPU and memory. Almost none track what the cluster actually costs to run.


https://blog.kubeledger.io/what-6-months-of-tracking-a-production-openshift-cluster-revealed-about-kubernetes-costs
Orchestrating Secure AI Agents on Amazon EKS

How we went from scaling video analysis on EKS to running autonomous coding agents in a custom agent harness, and why Kubernetes was the obvious choice.


https://dev.to/mattcamp/orchestrating-secure-ai-agents-on-amazon-eks-50kh
LLMs on Kubernetes: The Easy Way

Sometimes, you just want to run a Large Language Model (LLM)… no Jupyter notebook, no training pipeline, no fancy UI.


https://pittar.medium.com/llms-on-kubernetes-the-easy-way-f1ff6e0d47be
pii-shield

Zero-code log sanitization sidecar for Kubernetes. Prevents data leaks (GDPR/SOC2) by redacting PII from logs before they leave the pod.


https://github.com/aragossa/pii-shield
kubebuilder

Kubebuilder is a framework for building Kubernetes APIs using custom resource definitions (CRDs).


https://github.com/kubernetes-sigs/kubebuilder
chartpack

A single, opinionated Helm chart for deploying any Kubernetes application workload. Instead of maintaining separate charts per application, define your entire deployment through values.


https://github.com/cotzo/chartpack
hunk

Hunk is a review-first terminal diff viewer for agent-authored changesets, built on OpenTUI and Pierre diffs.


https://github.com/modem-dev/hunk
Not all index scans are equal: How we cut query latency by over 99%

https://www.datadoghq.com/blog/detect-inefficient-index-scans-with-dbm
Superficial Blamelessness

For many organizations, some form of blamelessness has become a more standard practice and blame-awareness has been gaining in popularity. However, there is an anti-pattern I have noticed as well, which I like to call superficial (or shallow) blamelessness that I think is important for people to be on the lookout for.


https://resilienceinsoftware.org/news/11502437
I Don’t Care if AI Wrote the Code. You Own It.

SREcon Chair Heinrich Hartmann on why the age of AI-assisted engineering demands a radical return to design rigor.


https://www.runllm.com/blog/i-dont-care-if-ai-wrote-the-code-you-own-it
Стартуем с Kubernetes без боли в Managed Kubernetes от MWS Cloud Platform.

27 мая в 16:00 Александр Курасов, технический владелец продукта в MWS Cloud Platform, покажет, как развернуть кластер за минуты, на вебинаре «Быстрый старт с Managed Kubernetes в облаке MWS».

Разберём архитектуру сервиса, его интеграцию с IAM, сетями и балансировщиками. Увидите, как управляемый сервис берёт на себя администрирование master-узлов и упрощает жизнь.

Будет интересно:

DevOps-инженерам, которые хотят упростить работу с Kubernetes
Backend-разработчикам, которым нужно быстро задеплоить сервис
Platform-инженерам, строящим cloud-native инфраструктуру
Техлидам и архитекторам, выбирающим Kubernetes в облаке

Зарегистрироваться
Please open Telegram to view this post
VIEW IN TELEGRAM
eraser

Eraser helps Kubernetes admins remove a list of non-running images from all Kubernetes nodes in a cluster.


https://github.com/eraser-dev/eraser
openhare

openhare is an AI-powered, cross-platform desktop SQL client with multi-database support, built for everyday development, data analysis, and DBA management workflows.


https://github.com/sjjian/openhare
codeburn

CodeBurn tracks token usage, cost, and performance across 19 AI coding tools. It breaks down spending by task type, model, tool, project, and provider so you can see exactly where your budget goes.


https://github.com/getagentseal/codeburn