SLO Playbook

Service Level Objectives anchor reliability discussions. Here’s how we set them across the Dev, Productivity, and AI pods.

1. Pick the Right SLI First

Format: “Measure ≤ Target for % of requests over rolling window”.
Example: “Productivity automations p95 latency <= 700 ms for 99% of runs in a 28‑day window”.
Tie objective to user outcome (fast automations, reliable agents, etc.).

Weekly SLO review: status, burn, actions.
Monthly: re-evaluate if SLO still meaningful; adjust targets based on data + business goals.
Post-incident: decide whether threshold/indicator needs revision.

### SLO Card
- Service:
- Indicator:
- Target:
- Window:
- Alert policy:
- Owners:
- Links: dashboards, runbooks, repos

Use this playbook when onboarding a new service or revisiting objectives for an existing one.