Question 1

What's the difference between an SLI, SLO, and SLA?

Accepted Answer

SLI (Service Level Indicator) is the measurement itself — p95 latency, 5xx error rate, request success ratio. SLO (Service Level Objective) is the target you set internally for that SLI — 'p95 under 300ms over rolling 28 days'. SLA (Service Level Agreement) is the contractual promise to customers, usually with financial penalties if missed. Every SLA implies an SLO and an SLI; not every SLO has an SLA.

Question 2

Do I need an SLA for my Laravel app?

Accepted Answer

Only if you're selling enterprise contracts where customers demand uptime guarantees. Internal SLOs are far more useful — they tell your team when to invest in reliability versus new features. SLAs introduce legal and financial complexity. Start with SLOs, graduate to SLAs when customers require it.

Question 3

What's an 'error budget'?

Accepted Answer

The inverse of an SLO. If your SLO is 99.9% success rate, your error budget is 0.1% failures. Over a 30-day window that's 43.2 minutes of failure. Spend the budget on risky deploys or new feature velocity when it's plentiful; tighten change velocity when it's running out. The error budget framing turns SLOs from pass/fail gates into a currency.

Question 4

What SLO should I set for a Laravel app?

Accepted Answer

Depends on the endpoint. Common starting points: 99% success rate over rolling 28 days for non-critical routes, 99.9% for payment/auth/checkout paths. Latency SLOs: p95 under 500ms for UI render endpoints, under 200ms for JSON API endpoints. These are rough floors — tune based on what users actually complain about.

Question 5

Should I alert on SLO burn or on raw SLI breach?

Accepted Answer

SLO burn rate is better for noise reduction. Alerting on any p95 spike is noisy; alerting when you've burned 5% of your monthly error budget in the last hour is signal. Google's SRE workbook has canonical burn-rate alert configurations. Raw threshold alerts have their place for novel events (first occurrence of a new exception), but daily ops alerts should be burn-based.

Question 6

Can NightOwl enforce SLOs?

Accepted Answer

Not yet as a first-class feature. You can approximate with threshold alerts on request latency and error rates — configure alert channels to page when a route's p95 exceeds its SLO for 10+ minutes. Full SLO tooling with error budgets and burn rates is a feature on the roadmap, not shipping today.

Endpoint class	Latency SLO	Availability SLO
Auth, checkout, payment	p95 < 300ms	99.9% / 28 days
JSON API (list / read)	p95 < 200ms	99.5% / 28 days
Page render (dashboard)	p95 < 500ms	99.5% / 28 days
Internal admin	p95 < 1000ms	99% / 28 days

SLO vs SLA — and where SLI fits

The three terms

Error budget — the unlock

Common Laravel SLOs

Burn-rate alerts

Frequently asked questions

Flat pricing. No event caps. No per-seat fees.