DevOps&SRE Library
18.5K subscribers
459 photos
3 videos
2 files
5.03K links
Библиотека статей по теме DevOps и SRE.

Реклама: @ostinostin
Контент: @mxssl

РКН: https://www.gosuslugi.ru/snet/67704b536aa9672b963777b3
Download Telegram
Pulse

Pulse is a modern, unified dashboard for monitoring your infrastructure across Proxmox, Docker, and Kubernetes. It consolidates metrics, alerts, and AI-powered insights from all your systems into a single, beautiful interface.


https://github.com/rcourtman/Pulse
How we deploy the largest GitLab instance 12 times daily

Take a deep technical dive into GitLab.com's deployment pipeline, including progressive rollouts, Canary strategies, database migrations, and multiversion compatibility.


https://about.gitlab.com/blog/continuously-deploying-the-largest-gitlab-instance
It works on my cluster: a tale of two troubleshooters

https://octopus.com/blog/verifying-and-troubleshooting-kubernetes-deployments
Karpenter at Beekeeper by LumApps: Fun Stories

At the beginning of this year, we (Beekeeper by LumApps Engineering) decided to adopt Karpenter for our EKS (Kubernetes/K8s) workloads, replacing our previous node autoscaling setup that used cluster-autoscaler with a managed autoscaling group (ASG). We made this decision before the release and hype of EKS Auto Mode, which is why we chose to implement a self-managed Karpenter solution.


https://medium.com/beekeeper-technology-blog/karpenter-at-beekeeper-by-lumapps-fun-stories-7c55656f02b8
Extracting JVM Data from Crash-Looping Java Containers in Kubernetes

https://medium.com/@zelldon91/getting-data-out-of-burning-java-containers-6e0c8bb53eec
Intelligent Kubernetes Load Balancing at Databricks

Real-Time, Client-Side Load Balancing for Internal and Ingress Traffic in Kubernetes


https://www.databricks.com/blog/intelligent-kubernetes-load-balancing-databricks
Strengthen Kubernetes Security with Vault Agent Injector

https://hackernoon.com/strengthen-kubernetes-security-with-vault-agent-injector
build

Shipwright is an extensible framework for building container images on Kubernetes.


https://github.com/shipwright-io/build
kexa

Kexa is an open-source compliance management tool that simplifies security and compliance across multiple cloud platforms including Azure, Google Cloud, AWS, and more.


https://github.com/kexa-io/kexa
clickhouse-operator

The Altinity Kubernetes Operator for ClickHouse creates, configures and manages ClickHouse clusters running on Kubernetes.


https://github.com/Altinity/clickhouse-operator
ch-vmm

ch-vmm is a Kubernetes add-on for running Cloud Hypervisor virtual machines. By using Cloud Hypervisor as the underlying hypervisor, ch-vmm enables a lightweight and secure way to run fully virtualized workloads in a canonical Kubernetes cluster.


https://github.com/nalajala4naresh/ch-vmm
Enroll

Enroll inspects a Debian-like or RedHat-like system, harvests the state that matters, and generates Ansible roles/playbooks so you can bring snowflakes under management fast.


https://enroll.sh
PHP 8.5 benchmarks: The state of PHP performance across major CMSs and frameworks

PHP 8.5 has now been officially released, and developers naturally want to know what kind of performance improvements they can expect across popular CMSs and frameworks.

To find out, we benchmarked 13 widely used CMSs and frameworks, including WordPress, WooCommerce, Drupal, Joomla, Laravel, Symfony and CodeIgniter, on PHP 8.2, 8.3, 8.4, and 8.5 under identical conditions. WordPress was also tested on PHP 7.4, since a notable share of sites still run on that version.

Our intention is to provide a clear, practical look at how performance shifts across recent PHP releases and what you can expect when upgrading.


https://kinsta.com/blog/php-benchmarks
Finding the grain of sand in a heap of Salt

How do you find the root cause of a configuration management failure when you have a peak of hundreds of changes in 15 minutes on thousands of servers?

That was the challenge we faced as we built the infrastructure to reduce release delays due to failures of Salt, a configuration management tool. (We eventually reduced such failures on the edge by over 5%, as we’ll explain below.) We’ll explore the fundamentals of Salt, and how it is used at Cloudflare. We then describe the common failure modes and how they delay our ability to release valuable changes to serve our customers.

By first solving an architectural problem, we provided the foundation for self-service mechanisms to find the root cause of Salt failures on servers, datacenters and groups of datacenters. This system is able to correlate failures with git commits, external service failures and ad hoc releases. The result of this has been a reduction in the duration of software release delays, and an overall reduction in toilsome, repetitive triage for SRE.

To start, we will go into the basics of the Cloudflare network and how Salt operates within it. And then we’ll get to how we solved the challenge akin to finding a grain of sand in a heap of Salt.


https://blog.cloudflare.com/finding-the-grain-of-sand-in-a-heap-of-salt
Rethinking QA: From DevOps to Platform Engineering and SRE

A wake‑up call for QA to upskill for platform engineering and SRE, including cloud‑native practices, automation mastery, and system reliability at scale.


https://dzone.com/articles/rethinking-qa-from-devops-to-platform-engineering
Queue-Based Autoscaling Without Flapping: Rethinking App Scaling with K8s, KEDA, and RabbitMQ

https://blog.stackademic.com/autoscaling-with-message-queues-why-everyone-gets-it-wrong-with-kubernetes-keda-rabbitmq-and-f1a4c38e0df4
helm-controller

A simple way to manage helm charts with Custom Resource Definitions in k8s.


https://github.com/k3s-io/helm-controller
nxs-universal-chart

nxs-universal-chart is a Helm chart you can use to install any of your applications into Kubernetes/OpenShift and other orchestrators compatible with native Kubernetes API.


https://github.com/nixys/nxs-universal-chart
percona-xtradb-cluster-operator

Percona Operator for MySQL based on Percona XtraDB Cluster (PXC) automates the creation and management of highly available, enterprise-ready MySQL database clusters on Kubernetes.


https://github.com/percona/percona-xtradb-cluster-operator
1