#javascript #devops #monitoring #best_practices #incident_response #site_reliability_engineering #post_mortem #reliability #alerting #on_call #dev_ops #sre #observability #incident_management #chaos_engineering #sre_team #sre_teams #sre_culture #sre_classroom
https://github.com/upgundecha/howtheysre
https://github.com/upgundecha/howtheysre
GitHub
GitHub - upgundecha/howtheysre: A curated collection of publicly available resources on how technology and tech-savvy organizations…
A curated collection of publicly available resources on how technology and tech-savvy organizations around the world practice Site Reliability Engineering (SRE) - upgundecha/howtheysre
#python #catalog #incident_response #playbook #cybersecurity #mitre #incident_management #incidents #contributions_welcome #mitre_attack #contributors_welcome #cybersecurity_playbook
https://github.com/austinsonger/Incident-Playbook
https://github.com/austinsonger/Incident-Playbook
GitHub
GitHub - austinsonger/Incident-Playbook: GOAL: Incident Response Playbooks Mapped to MITRE Attack Tactics and Techniques. [Contributors…
GOAL: Incident Response Playbooks Mapped to MITRE Attack Tactics and Techniques. [Contributors Friendly] - austinsonger/Incident-Playbook
#python #ai_sre #alerting #datadog #grafana #incident_management #observability #remediation #root_cause_analysis #site_reliability_engineering #slack #sre
OpenSRE is a free open-source tool to build AI agents that fix production issues fast. It connects to 40+ tools like Kubernetes, Datadog, Slack, and LLMs, then auto-fetches alerts, analyzes logs/metrics/traces, finds root causes with evidence, suggests fixes, and posts updates. Install easily with one command, run tests, and customize workflows on your infrastructure. This saves you hours on incident debugging, cuts downtime, and predicts failures—letting you focus on building instead of firefighting.
https://github.com/Tracer-Cloud/opensre
OpenSRE is a free open-source tool to build AI agents that fix production issues fast. It connects to 40+ tools like Kubernetes, Datadog, Slack, and LLMs, then auto-fetches alerts, analyzes logs/metrics/traces, finds root causes with evidence, suggests fixes, and posts updates. Install easily with one command, run tests, and customize workflows on your infrastructure. This saves you hours on incident debugging, cuts downtime, and predicts failures—letting you focus on building instead of firefighting.
https://github.com/Tracer-Cloud/opensre
GitHub
GitHub - Tracer-Cloud/opensre: Build your own AI SRE agents. The open source toolkit for the AI era ✨
Build your own AI SRE agents. The open source toolkit for the AI era ✨ - Tracer-Cloud/opensre
👍1