incident-runbook-templates

by Unknown v1.0.0

This skill provides production-ready templates for creating incident response runbooks, covering all phases from detection and triage to mitigation, resolution, and communication. It helps users establish standardized incident response procedures, build service-specific runbooks, define escalation paths, and document recovery procedures. The skill includes core concepts like incident severity levels and runbook structure, along with example runbooks and best practices for effective incident management.

Use this skill when building new incident response procedures, onboarding on-call engineers, or responding to active incidents. The templates provide a solid foundation and ensure that all critical aspects of incident management are addressed. They also aid in quickly identifying the impact, potential causes, and appropriate mitigation steps during an incident.

This skill provides templates and instructions, but requires access to infrastructure and tooling such as Kubernetes, databases, monitoring systems and communication platforms to fully execute the procedures.

What It Does

Provides templates and guidance to create structured incident response runbooks with pre-defined steps, escalation paths, and recovery actions, enhancing incident management efficiency.

When To Use

When creating new incident response procedures, building service-specific runbooks, establishing escalation paths, documenting recovery procedures, responding to active incidents, or onboarding on-call engineers.

Installation

Copy SKILL.md to your skills directory

View Universal documentation

Have a Skill to Share?

Join the community and help AI agents learn new capabilities. Submit your skill and reach thousands of developers.