Pulse
https://github.com/rcourtman/Pulse
Pulse is a modern, unified dashboard for monitoring your infrastructure across Proxmox, Docker, and Kubernetes. It consolidates metrics, alerts, and AI-powered insights from all your systems into a single, beautiful interface.
https://github.com/rcourtman/Pulse
How we deploy the largest GitLab instance 12 times daily
https://about.gitlab.com/blog/continuously-deploying-the-largest-gitlab-instance
Take a deep technical dive into GitLab.com's deployment pipeline, including progressive rollouts, Canary strategies, database migrations, and multiversion compatibility.
https://about.gitlab.com/blog/continuously-deploying-the-largest-gitlab-instance
It works on my cluster: a tale of two troubleshooters
https://octopus.com/blog/verifying-and-troubleshooting-kubernetes-deployments
https://octopus.com/blog/verifying-and-troubleshooting-kubernetes-deployments
Karpenter at Beekeeper by LumApps: Fun Stories
https://medium.com/beekeeper-technology-blog/karpenter-at-beekeeper-by-lumapps-fun-stories-7c55656f02b8
At the beginning of this year, we (Beekeeper by LumApps Engineering) decided to adopt Karpenter for our EKS (Kubernetes/K8s) workloads, replacing our previous node autoscaling setup that used cluster-autoscaler with a managed autoscaling group (ASG). We made this decision before the release and hype of EKS Auto Mode, which is why we chose to implement a self-managed Karpenter solution.
https://medium.com/beekeeper-technology-blog/karpenter-at-beekeeper-by-lumapps-fun-stories-7c55656f02b8
Extracting JVM Data from Crash-Looping Java Containers in Kubernetes
https://medium.com/@zelldon91/getting-data-out-of-burning-java-containers-6e0c8bb53eec
https://medium.com/@zelldon91/getting-data-out-of-burning-java-containers-6e0c8bb53eec
Intelligent Kubernetes Load Balancing at Databricks
https://www.databricks.com/blog/intelligent-kubernetes-load-balancing-databricks
Real-Time, Client-Side Load Balancing for Internal and Ingress Traffic in Kubernetes
https://www.databricks.com/blog/intelligent-kubernetes-load-balancing-databricks
Connection Pooling for Postgres using PG Bouncer
https://medium.com/@pablo.lopez.santori/connection-pooling-for-postgres-using-pg-bouncer-175bc1607db2
https://medium.com/@pablo.lopez.santori/connection-pooling-for-postgres-using-pg-bouncer-175bc1607db2
Strengthen Kubernetes Security with Vault Agent Injector
https://hackernoon.com/strengthen-kubernetes-security-with-vault-agent-injector
https://hackernoon.com/strengthen-kubernetes-security-with-vault-agent-injector
build
https://github.com/shipwright-io/build
Shipwright is an extensible framework for building container images on Kubernetes.
https://github.com/shipwright-io/build
kexa
https://github.com/kexa-io/kexa
Kexa is an open-source compliance management tool that simplifies security and compliance across multiple cloud platforms including Azure, Google Cloud, AWS, and more.
https://github.com/kexa-io/kexa
clickhouse-operator
https://github.com/Altinity/clickhouse-operator
The Altinity Kubernetes Operator for ClickHouse creates, configures and manages ClickHouse clusters running on Kubernetes.
https://github.com/Altinity/clickhouse-operator
ch-vmm
https://github.com/nalajala4naresh/ch-vmm
ch-vmm is a Kubernetes add-on for running Cloud Hypervisor virtual machines. By using Cloud Hypervisor as the underlying hypervisor, ch-vmm enables a lightweight and secure way to run fully virtualized workloads in a canonical Kubernetes cluster.
https://github.com/nalajala4naresh/ch-vmm
Enroll
https://enroll.sh
Enroll inspects a Debian-like or RedHat-like system, harvests the state that matters, and generates Ansible roles/playbooks so you can bring snowflakes under management fast.
https://enroll.sh
PHP 8.5 benchmarks: The state of PHP performance across major CMSs and frameworks
https://kinsta.com/blog/php-benchmarks
PHP 8.5 has now been officially released, and developers naturally want to know what kind of performance improvements they can expect across popular CMSs and frameworks.
To find out, we benchmarked 13 widely used CMSs and frameworks, including WordPress, WooCommerce, Drupal, Joomla, Laravel, Symfony and CodeIgniter, on PHP 8.2, 8.3, 8.4, and 8.5 under identical conditions. WordPress was also tested on PHP 7.4, since a notable share of sites still run on that version.
Our intention is to provide a clear, practical look at how performance shifts across recent PHP releases and what you can expect when upgrading.
https://kinsta.com/blog/php-benchmarks
Finding the grain of sand in a heap of Salt
https://blog.cloudflare.com/finding-the-grain-of-sand-in-a-heap-of-salt
How do you find the root cause of a configuration management failure when you have a peak of hundreds of changes in 15 minutes on thousands of servers?
That was the challenge we faced as we built the infrastructure to reduce release delays due to failures of Salt, a configuration management tool. (We eventually reduced such failures on the edge by over 5%, as we’ll explain below.) We’ll explore the fundamentals of Salt, and how it is used at Cloudflare. We then describe the common failure modes and how they delay our ability to release valuable changes to serve our customers.
By first solving an architectural problem, we provided the foundation for self-service mechanisms to find the root cause of Salt failures on servers, datacenters and groups of datacenters. This system is able to correlate failures with git commits, external service failures and ad hoc releases. The result of this has been a reduction in the duration of software release delays, and an overall reduction in toilsome, repetitive triage for SRE.
To start, we will go into the basics of the Cloudflare network and how Salt operates within it. And then we’ll get to how we solved the challenge akin to finding a grain of sand in a heap of Salt.
https://blog.cloudflare.com/finding-the-grain-of-sand-in-a-heap-of-salt
Rethinking QA: From DevOps to Platform Engineering and SRE
https://dzone.com/articles/rethinking-qa-from-devops-to-platform-engineering
A wake‑up call for QA to upskill for platform engineering and SRE, including cloud‑native practices, automation mastery, and system reliability at scale.
https://dzone.com/articles/rethinking-qa-from-devops-to-platform-engineering
Queue-Based Autoscaling Without Flapping: Rethinking App Scaling with K8s, KEDA, and RabbitMQ
https://blog.stackademic.com/autoscaling-with-message-queues-why-everyone-gets-it-wrong-with-kubernetes-keda-rabbitmq-and-f1a4c38e0df4
https://blog.stackademic.com/autoscaling-with-message-queues-why-everyone-gets-it-wrong-with-kubernetes-keda-rabbitmq-and-f1a4c38e0df4
helm-controller
https://github.com/k3s-io/helm-controller
A simple way to manage helm charts with Custom Resource Definitions in k8s.
https://github.com/k3s-io/helm-controller
nxs-universal-chart
https://github.com/nixys/nxs-universal-chart
nxs-universal-chart is a Helm chart you can use to install any of your applications into Kubernetes/OpenShift and other orchestrators compatible with native Kubernetes API.
https://github.com/nixys/nxs-universal-chart
percona-xtradb-cluster-operator
https://github.com/percona/percona-xtradb-cluster-operator
Percona Operator for MySQL based on Percona XtraDB Cluster (PXC) automates the creation and management of highly available, enterprise-ready MySQL database clusters on Kubernetes.
https://github.com/percona/percona-xtradb-cluster-operator
1