pgque
https://github.com/NikolayS/pgque
PgQue brings back PgQ — one of the longest-running Postgres queue architectures in production — in a form that runs on any Postgres platform, managed providers included.
PgQ was designed at Skype to run messaging for hundreds of millions of users, and it ran on large self-managed Postgres deployments for over a decade. Standard PgQ depends on a C extension (pgq) and an external daemon (pgqd), neither of which run on most managed Postgres providers.
PgQue rebuilds that battle-tested engine in pure PL/pgSQL, so the zero-bloat queue pattern works anywhere you can run SQL — without adding another distributed system to your stack.
The anti-extension. Pure SQL + PL/pgSQL on any Postgres 14+ — including RDS, Aurora, Cloud SQL, AlloyDB, Supabase, Neon, and most other managed providers. No C extension, no shared_preload_libraries, no provider approval, no restart.
https://github.com/NikolayS/pgque
Hidden Infrastructure Challenges in Distributed LLM Inference on Kubernetes
https://substack.com/home/post/p-188586336
Chapter 1: A networking story
https://substack.com/home/post/p-188586336
Решайте DevOps-, SRE- и FinOps-задачи с помощью облачного ИИ-помощника 💬
Большое обновление от Cloud.ru. Что нового:
1⃣ Сразу несколько ВМ в разных конфигурациях
2⃣ Три новых сценария
👉 Попробовать
Большое обновление от Cloud.ru. Что нового:
Теперь ИИ-помощник в облаке может создавать несколько виртуальных машин, а после управлять ими по команде. Например, добавлять или удалять диски, менять конфигурации и выполнять другие повседневные операции.
▶
DevOps-агент
— может разворачивать и обслуживать PostgreSQL, Kafka, WordPress, GitLab и другие популярные сервисы по текстовому промпту.
▶
SRE-агент
— настраивает мониторинг, алертинг и помогает разбирать инциденты.
▶
FinOps-агент
— находит забытые или неиспользуемые ВМ и предлагает их удалить, чтобы исключить бессмысленные траты. А еще может показать топ дорогих ресурсов, позволяя сравнивать траты за разные периоды.
👉 Попробовать
Please open Telegram to view this post
VIEW IN TELEGRAM
Simplifying Model Serving with Kubernetes and Ray: Inside DoubleVerify’s ML Platform
https://medium.com/doubleverify-engineering/simplifying-model-serving-with-kubernetes-and-ray-inside-doubleverifys-ml-platform-78b33faa9e91
https://medium.com/doubleverify-engineering/simplifying-model-serving-with-kubernetes-and-ray-inside-doubleverifys-ml-platform-78b33faa9e91
chainplane
https://github.com/tazhate/chainplane
A Kubernetes operator for deploying and managing blockchain full nodes. Supports 102 chains with built-in health monitoring, snapshot bootstrapping, and automatic recovery.
https://github.com/tazhate/chainplane
Lazy-Pulling Container Images: A Deep Dive Into OCI Seekability
https://blog.zmalik.dev/p/lazy-pulling-container-images-a-deep
From DEFLATE dependency chains to FUSE mounts: how few competing approaches make container layers randomly accessible, and what they all require you to change on every node.
https://blog.zmalik.dev/p/lazy-pulling-container-images-a-deep
Building eBPF-Based Bandwidth Limiting in AWS Network Policy Agent — Why Vibe Coding Isn’t Enough
https://medium.com/@jayanthvn_55441/building-ebpf-based-bandwidth-limiting-in-aws-network-policy-agent-why-vibe-coding-isnt-enough-f8c6681aa278
https://medium.com/@jayanthvn_55441/building-ebpf-based-bandwidth-limiting-in-aws-network-policy-agent-why-vibe-coding-isnt-enough-f8c6681aa278
Hardware-Backed TLS Certificates with cert-manager and YubiHSM 2
https://charles.dev/blog/yubihsm-cert-manager
Your cert-manager CA key is one kubectl get secret away from being stolen. It's a base64-encoded blob sitting in etcd, and anyone with the right RBAC can read it, copy it, and use it to sign certificates for any service in your cluster.
https://charles.dev/blog/yubihsm-cert-manager
Mastering KEDA on GKE: A Deep Dive into Event-Driven Autoscaling
https://saeed.hashnode.dev/keda-on-gke
Event Driven Scaling and How to Fix It When It Breaks
https://saeed.hashnode.dev/keda-on-gke
ing-switch: Migrate from Ingress NGINX to Traefik or Gateway API in Minutes, Not Days
https://blog.kubesimplify.com/ing-switch-migrate-from-ingress-nginx-to-traefik-or-gateway-api-in-minutes-not-days
https://blog.kubesimplify.com/ing-switch-migrate-from-ingress-nginx-to-traefik-or-gateway-api-in-minutes-not-days
warden
https://github.com/stephnangue/warden
The open-source egress gateway for AI agents — every API call is authenticated, authorized, and audited. No credentials ever reach the agent.
https://github.com/stephnangue/warden
aibrix
https://github.com/vllm-project/aibrix
Cost-efficient and pluggable Infrastructure components for GenAI inference
https://github.com/vllm-project/aibrix
kloudlite
https://github.com/kloudlite/kloudlite
Kloudlite provides cloud-based development workspaces with live service connectivity. Think Telepresence meets cloud IDEs — but with per-developer environment ownership, instant environment switching, and cross-team collaboration built in.
https://github.com/kloudlite/kloudlite
cpg
https://github.com/SoulKyu/cpg
Cilium Policy Generator -- because writing CiliumNetworkPolicies by hand in a default-deny cluster is nobody's idea of a good Friday night.
https://github.com/SoulKyu/cpg
x509-certificate-exporter
https://github.com/enix/x509-certificate-exporter
A Prometheus exporter for certificates focusing on expiration monitoring, written in Go. Designed to monitor Kubernetes clusters from inside, it can also be used as a standalone exporter.
https://github.com/enix/x509-certificate-exporter
sish
https://github.com/antoniomika/sish
Open source SSH tunneling for HTTP(S), WS(S), TCP, aliases, and SNI.
If you like the simplicity of serveo/ngrok-style sharing but want to use plain SSH and run your own infrastructure, sish is built for that.
https://github.com/antoniomika/sish
Forwarded from AvitoTech
This media is not supported in your browser
VIEW IN TELEGRAM
Эх, захотелось… Но пока работаем с тем, что есть в SRE-реальности ↖️
Ребята с подкаста «В SREду на кухне» посвятили бюджету ошибок целый выпуск — вместе с Кириллом Борисовым, тимлидом из VK, они обсудили:
🔸 что такое Error budget и можно ли жить без него;
🔸 как объяснить бизнесу его необходимость;
🔸 как его считать;
🔸 почему идеальная надёжность — это опасная иллюзия и миф;
🔸 как метрики помогают упростить расчёт.
Смотрим и слушаем по ссылкам:
📱 YouTube
📱 VK
📱 Rutube
#sre
Ребята с подкаста «В SREду на кухне» посвятили бюджету ошибок целый выпуск — вместе с Кириллом Борисовым, тимлидом из VK, они обсудили:
Смотрим и слушаем по ссылкам:
#sre
Please open Telegram to view this post
VIEW IN TELEGRAM
AvitoTech
Эх, захотелось… Но пока работаем с тем, что есть в SRE-реальности ↖️ Ребята с подкаста «В SREду на кухне» посвятили бюджету ошибок целый выпуск — вместе с Кириллом Борисовым, тимлидом из VK, они обсудили: 🔸 что такое Error budget и можно ли жить без него; …
Реклама. Рекламодатель ООО «Авито Тех». erid: 2Vtzqx6K4Ns
How we built a real-world evaluation platform for autonomous SRE agents at scale
https://www.datadoghq.com/blog/engineering/bits-ai-eval-platform
Bits AI SRE is Datadog’s autonomous agent for investigating production incidents. It reasons across metrics, logs, traces, infrastructure metadata, network telemetry, monitor configuration, and more to determine, triage, and remediate the root cause of an issue.
https://www.datadoghq.com/blog/engineering/bits-ai-eval-platform
otel-cardinality-processor
https://github.com/YElayyat/otel-cardinality-processor
An OpenTelemetry Collector processor that catches metric cardinality explosions before they reach your TSDB.
https://github.com/YElayyat/otel-cardinality-processor
otelite
https://github.com/planetf1/otelite
Lightweight OpenTelemetry receiver and dashboard for local development
Otelite is a single-binary observability tool that receives OpenTelemetry data (logs, traces, metrics) and provides a web dashboard and terminal UI for viewing it. Designed for local LLM development with minimal resource usage (<100MB memory, <5% CPU), it starts in seconds and requires no external dependencies.
https://github.com/planetf1/otelite