Incident Response Playbook for Engineering Teams
Structured process for detection, response, communication, and postmortem for production incidents.
Establish a clear incident response process that minimizes detection and resolution time (MTTR), protects user experience, and generates learnings for the team.
At a glance
Access
Free prompt
Open to copy without upgrading.
Prompt objective
Establish a clear incident response process that minimizes detection and resolution time (MTTR), protects user experience, and generates learnings for the team.
Real use case
A delivery platform experienced [NUMBER] severe incidents in the past month. In each case, the team took over 30 minutes to understand what was happening, nobody knew who should lead the response, and customer communication was chaotic. They need a structured playbook.
Customize these fields first
Replace the placeholders with your own context before you run the prompt. That usually improves the first output more than adding more instructions later.
Prompt
Create an incident response playbook for [COMPANY NAME], a [COMPANY TYPE] with [NUMBER] active users and [UPTIME]% SLA.\\\\\\\\n\\\\\\\\n**Context:**\\\\\\\\n- Engineering team: [NUMBER] engineers\\\\\\\\n- Stack: [TECHNOLOGY LIST]\\\\\\\\n- Support hours: [24/7 or BUSINESS HOURS]\\\\\\\\n- Recent incidents: [DESCRIBE 2-3 INCIDENTS]\\\\\\\\n\\\\\\\\n**1) Severity Classification:**\\\\\\\\n\\\\\\\\n| Level | Criteria | Example | Response SLA |\\\\\\\\n|-------|----------|---------|-------------|\\\\\\\\n| SEV1 — Critical | [Criteria] | [Example] | [Time] |\\\\\\\\n| SEV2 — High | [Criteria] | [Example] | [Time] |\\\\\\\\n| SEV3 — Medium | [Criteria] | [Example] | [Time] |\\\\\\\\n| SEV4 — Low | [Criteria] | [Example] | [Time] |\\\\\\\\n\\\\\\\\n**2) Detection and Escalation:**\\\\\\\\n- Automated alerts that trigger (Prometheus/Datadog/Sentry)\\\\\\\\n- Who gets notified by severity (on-call rotation)\\\\\\\\n- How to activate the Incident Commander (IC)\\\\\\\\n- Communication channels (dedicated Slack channel, bridge call)\\\\\\\\n\\\\\\\\n**3) Response Process (by severity):**\\\\\\\\n\\\\\\\\n**SEV1/SEV2:**\\\\\\\\n- Minute 0-5: IC takes over, creates channel #incident-YYYY-MM-DD\\\\\\\\n- Minute 5-15: Initial diagnosis (dashboards, logs, git blame recent deploy)\\\\\\\\n- Minute 15-30: Decision — rollback vs. hotfix vs. feature flag\\\\\\\\n- Every 30 min: Status update to stakeholders\\\\\\\\n- Resolution: verify with smoke tests before declaring resolved\\\\\\\\n\\\\\\\\n**4) Communication:**\\\\\\\\n- Internal message template (for the team)\\\\\\\\n- External message template (for customers — status page)\\\\\\\\n- Who approves external communication\\\\\\\\n- Update frequency by severity\\\\\\\\n- Status page channel (Statuspage.io, Instatus, or custom)\\\\\\\\n\\\\\\\\n**5) Postmortem (required for SEV1/SEV2):**\\\\\\\\n- Detailed incident timeline\\\\\\\\n- Root cause analysis (5 Whys)\\\\\\\\n- Contributing factors\\\\\\\\n- What went well\\\\\\\\n- Action items with owner and deadline\\\\\\\\n- Postmortem document template\\\\\\\\n- Review meeting (blameless — focus on systems, not people)\\\\\\\\n\\\\\\\\n**6) Runbooks by Scenario:**\\\\\\\\nCreate specific runbooks for:\\\\\\\\n- [SCENARIO 1: e.g. database out of space]\\\\\\\\n- [SCENARIO 2: e.g. API returning 500 errors globally]\\\\\\\\n- [SCENARIO 3: e.g. unexpected traffic spike]\\\\\\\\n- [SCENARIO 4: e.g. security breach]\\\\\\\\n\\\\\\\\nEach runbook: symptoms, diagnostic commands, resolution steps, who to call.\\\\\\\\n\\\\\\\\n**7) Incident Metrics:**\\\\\\\\n- MTTD (Mean Time to Detect)\\\\\\\\n- MTTR (Mean Time to Resolve)\\\\\\\\n- MTBF (Mean Time Between Failures)\\\\\\\\n- Incidents per month (trend)\\\\\\\\n- % of action items completed on time
Open directly in an AI — the text is pre-filled:
How to use this prompt
- 1Replace the key placeholders first: COMPANY NAME, COMPANY TYPE, NUMBER, UPTIME.
- 2Replace any bracketed placeholders like [this] with your own context.
- 3Add extra background information when you want more tailored results.
- 4Combine multiple prompts in one conversation when you need a richer output.
- 5Save your best-performing prompts so they are easy to reuse later.
Next best step
Open the guide first, then branch only if you still need more.
A guide for technical builders choosing between prompts, coding workflows, and agent-based implementation.
If this prompt is close but not quite right, generate variants next. If the job is recurring, move into the course library after the guide.
Related prompts
View allComplete CI/CD Pipeline with GitHub Actions for Next.js Applications
Automated pipeline configuration with tests, build, preview deploys, and production deployment.
Best for
Automate the entire software delivery lifecycle with GitHub Actions, from push to production deployment, including tests, code analysis, and preview environments.
Docker Containerization and Docker Compose Orchestration for Production
Optimized Dockerfiles and docker-compose for development and production environments.
Best for
Create a containerized environment that ensures parity between development and production, with optimized builds, multi-stage builds, and security configurations.
Monitoring and Observability Setup with Grafana, Prometheus, and Alerts
Complete monitoring stack setup with metrics, logs, and alerts for production applications.
Best for
Implement full observability that detects issues before users notice them, with actionable dashboards and intelligent alerting.
Infrastructure as Code with Terraform for AWS/Hetzner
Automated cloud infrastructure provisioning with Terraform, reusable modules, and state management.
Best for
Automate provisioning of all infrastructure required for a production application, ensuring reproducibility, version control, and compliance.
Explore other prompt categories
Move sideways into adjacent libraries when the current category is not the full answer.
Free browsing stays open. Premium prompts unlock the reusable workflow layer.
Use the guides and role paths to validate the job first. Upgrade when you want the full prompt text, editable premium prompts, and the surrounding course paths in one place.
Free access
- Browse guides, role paths, and category pages.
- Preview prompts before you decide to upgrade.
- Find the right starting point without friction.
Membership access
- Unlock premium prompts and the full copy text.
- See more workflow paths and course connections.
- Keep the reusable templates in one place.