#other #awesome #awesome_list #chaos #chaos_community #chaos_engineering #chaos_monkey #chaos_testing #netflix_chaos_monkey #resilience #simian_army #site_reliability_engineering
https://github.com/dastergon/awesome-chaos-engineering
https://github.com/dastergon/awesome-chaos-engineering
GitHub
GitHub - dastergon/awesome-chaos-engineering: A curated list of Chaos Engineering resources.
A curated list of Chaos Engineering resources. Contribute to dastergon/awesome-chaos-engineering development by creating an account on GitHub.
#javascript #devops #monitoring #best_practices #incident_response #site_reliability_engineering #post_mortem #reliability #alerting #on_call #dev_ops #sre #observability #incident_management #chaos_engineering #sre_team #sre_teams #sre_culture #sre_classroom
https://github.com/upgundecha/howtheysre
https://github.com/upgundecha/howtheysre
GitHub
GitHub - upgundecha/howtheysre: A curated collection of publicly available resources on how technology and tech-savvy organizations…
A curated collection of publicly available resources on how technology and tech-savvy organizations around the world practice Site Reliability Engineering (SRE) - upgundecha/howtheysre
#other #devops #availability #list #awesome #monitoring #reliability_engineering #incident_response #site_reliability_engineering #production #post_mortem #capacity_planning #service_level_agreement #scalability #reliability #alerting #on_call #awesome_list #sre #postmortem #site_reliability
https://github.com/dastergon/awesome-sre
https://github.com/dastergon/awesome-sre
GitHub
GitHub - dastergon/awesome-sre: A curated list of Site Reliability and Production Engineering resources.
A curated list of Site Reliability and Production Engineering resources. - dastergon/awesome-sre
#python #ai_sre #alerting #datadog #grafana #incident_management #observability #remediation #root_cause_analysis #site_reliability_engineering #slack #sre
OpenSRE is a free open-source tool to build AI agents that fix production issues fast. It connects to 40+ tools like Kubernetes, Datadog, Slack, and LLMs, then auto-fetches alerts, analyzes logs/metrics/traces, finds root causes with evidence, suggests fixes, and posts updates. Install easily with one command, run tests, and customize workflows on your infrastructure. This saves you hours on incident debugging, cuts downtime, and predicts failures—letting you focus on building instead of firefighting.
https://github.com/Tracer-Cloud/opensre
OpenSRE is a free open-source tool to build AI agents that fix production issues fast. It connects to 40+ tools like Kubernetes, Datadog, Slack, and LLMs, then auto-fetches alerts, analyzes logs/metrics/traces, finds root causes with evidence, suggests fixes, and posts updates. Install easily with one command, run tests, and customize workflows on your infrastructure. This saves you hours on incident debugging, cuts downtime, and predicts failures—letting you focus on building instead of firefighting.
https://github.com/Tracer-Cloud/opensre
GitHub
GitHub - Tracer-Cloud/opensre: Build your own AI SRE agents. The open source toolkit for the AI era ✨
Build your own AI SRE agents. The open source toolkit for the AI era ✨ - Tracer-Cloud/opensre
👍1