AdvancedDevOpsFree prompt

Monitoring and Observability Setup with Grafana, Prometheus, and Alerts

Complete monitoring stack setup with metrics, logs, and alerts for production applications.

Implement full observability that detects issues before users notice them, with actionable dashboards and intelligent alerting.

At a glance

Access

Free prompt

Open to copy — no account or payment needed.

Prompt objective

Implement full observability that detects issues before users notice them, with actionable dashboards and intelligent alerting.

Real use case

A SaaS platform serving [NUMBER] active users experiences database storage issues at 3am with no alerting in place. The on-call team only discovers the problem when customers start reporting issues at 8am, resulting in significant user churn and revenue loss. They need proactive monitoring to catch infrastructure problems before business impact.

Customize these fields first

PROJECT NAMEAPPLICATION TYPENUMBERDOCKER/KUBERNETES/VPSLIST: e.g. Node.js API, PostgreSQL, Redis, Nginx, Workers

Replace the placeholders with your own context before you run the prompt. That usually improves the first output more than adding more instructions later.

Prompt

Configure a complete observability stack for [PROJECT NAME], a [APPLICATION TYPE] application with [NUMBER] active users running on [DOCKER/KUBERNETES/VPS].

**Services to monitor:**
- [LIST: e.g. Node.js API, PostgreSQL, Redis, Nginx, Workers]
- Infrastructure: CPU, memory, disk, network

**1) Metrics (Prometheus + Grafana):**

**Prometheus:**
- prometheus.yml configuration with targets
- Scrape interval per service
- Required exporters: node_exporter, postgres_exporter, redis_exporter
- Application custom metrics (prom-client for Node.js):
  - `http_requests_total` (counter by route, method, status)
  - `http_request_duration_seconds` (histogram)
  - `active_connections` (gauge)
  - `business_events_total` (counter: signups, orders, payments)
  - `queue_size` (gauge per queue)
- Retention and storage sizing

**Grafana Dashboards:**
- Dashboard 1: Overview (uptime, request rate, error rate, latency p50/p95/p99)
- Dashboard 2: Infrastructure (CPU, RAM, disk, network per container)
- Dashboard 3: Database (connections, query duration, cache hit ratio, dead tuples)
- Dashboard 4: Business (revenue/hour, conversions, churn indicators)
- For each dashboard: exportable JSON with template variables

**2) Logs (Grafana Loki or ELK):**
- Structured log format (JSON)
- Correct log levels: ERROR (failures), WARN (degradation), INFO (events), DEBUG (dev)
- Correlation ID per request (trace ID)
- Log rotation and retention policy
- Query examples for common troubleshooting scenarios

**3) Alerts (Alertmanager):**
- Alert rules by severity:
  - **CRITICAL** (PagerDuty/SMS): downtime, error rate > 5%, disk > 95%
  - **WARNING** (Slack/Email): latency p95 > 2s, CPU > 80%, memory > 85%
  - **INFO** (Slack): deploy completed, backup finished, cron executed
- Routing: who receives which alert (on-call rotation)
- Silencing and inhibition rules
- Runbooks linked to each alert (what to do when triggered)

**4) Uptime Monitoring:**
- Standardized health check endpoints (/health, /ready)
- External ping (UptimeRobot/Better Uptime)
- Public status page for customers

**5) docker-compose for the monitoring stack:**
- Prometheus + Grafana + Loki + Alertmanager
- Persistent volumes for data
- Network configuration

Provide all configuration files and dashboard JSON exports.

Open directly in an AI — the text is pre-filled:

Open in ChatGPT Open in Claude Open in Gemini

How to use this prompt

1Replace the key placeholders first: PROJECT NAME, APPLICATION TYPE, NUMBER, DOCKER/KUBERNETES/VPS.
2Replace any bracketed placeholders like [this] with your own context.
3Add extra background information when you want more tailored results.
4Combine multiple prompts in one conversation when you need a richer output.
5Save your best-performing prompts so they are easy to reuse later.

Next best step

Open the guide first, then branch only if you still need more.

A guide for technical builders choosing between prompts, coding workflows, and agent-based implementation.

If this prompt is close but not quite right, generate variants next. If the job is recurring, move into the course library after the guide.

Open the guide Generate variants

Developer path Browse courses

Related prompts

View all

Complete CI/CD Pipeline with GitHub Actions for Next.js Applications

Automated pipeline configuration with tests, build, preview deploys, and production deployment.

IntermediateFree prompt

Best for

Automate the entire software delivery lifecycle with GitHub Actions, from push to production deployment, including tests, code analysis, and preview environments.

Copy-ready promptOpen prompt

Docker Containerization and Docker Compose Orchestration for Production

Optimized Dockerfiles and docker-compose for development and production environments.

IntermediateFree prompt

Best for

Create a containerized environment that ensures parity between development and production, with optimized builds, multi-stage builds, and security configurations.

Copy-ready promptOpen prompt

Infrastructure as Code with Terraform for AWS/Hetzner

Automated cloud infrastructure provisioning with Terraform, reusable modules, and state management.

AdvancedFree prompt

Best for

Automate provisioning of all infrastructure required for a production application, ensuring reproducibility, version control, and compliance.

Copy-ready promptOpen prompt

Incident Response Playbook for Engineering Teams

Structured process for detection, response, communication, and postmortem for production incidents.

BeginnerFree prompt

Best for

Establish a clear incident response process that minimizes detection and resolution time (MTTR), protects user experience, and generates learnings for the team.

Copy-ready promptOpen prompt

Explore other prompt categories

Move sideways into adjacent libraries when the current category is not the full answer.

📊Data Analysis 🎨Design & UX 📋Project Management View all categories

Every prompt here is free. The course teaches the thinking behind them.

Copy as many prompts as you like. When you want to move from single prompts to a repeatable AI workflow, Learn AI in 30 Days walks through it, one day at a time.

Get the course See the 30-day curriculum first

Buy the course once ($15/$20 by length), or go all-access for $10/mo with a verifiable certificate.