Question 1

What's the difference between head-based and tail-based trace sampling?

Accepted Answer

Head-based sampling decides to keep a trace at the moment the root span starts — before the trace is complete. Tail-based sampling buffers the whole trace and decides after it finishes. Head is cheap and fast; tail is smarter because you can keep all error traces and all slow traces and drop the boring ones.

Question 2

Why do I need to sample traces at all?

Accepted Answer

Cost. A fully-sampled trace at moderate traffic (1000 req/s) generates ~100 GB of trace data per day at ~10 spans per request. At 50¢/GB stored for 14 days, that's $700/month just for traces. Sampling at 10% cuts it to $70. At 1% with tail-sampling bias toward errors, you keep all the interesting data at ~$7.

Question 3

How do I configure head-based sampling in Laravel?

Accepted Answer

In OpenTelemetry SDK config, set TraceIdRatioBasedSampler with a ratio (0.1 = 10%, 0.01 = 1%). The SDK will deterministically keep or drop based on the trace ID's first bytes, so all services with the same sampler config agree on the same decision — keeping cross-service trace consistency.

Question 4

What's the right sampling rate?

Accepted Answer

Depends on traffic and budget. For low-traffic apps (under 100 req/s) keep everything. For moderate traffic (100-1000 req/s) head-sample at 10% + always keep errors and slow requests (tail-sampling). For high traffic (1000+ req/s) head-sample at 1% + tail-sampling for anomalies. The pure head-sampling 'all at 100%' stops being viable around 1B traces/month.

Question 5

How does NightOwl handle sampling?

Accepted Answer

NightOwl stores every request by default — no sampling — because BYOD Postgres at Laravel-typical volumes (1K-10K req/s) is still cheap. At very high volumes (tens of thousands of req/s) we recommend enabling sampling at the agent level. This is the opposite tradeoff from cloud APMs, where per-event cost forces sampling earlier.

Question 6

Can I use tail-based sampling with OpenTelemetry?

Accepted Answer

Yes, via the OTel Collector's tail_sampling_processor. You run a Collector cluster that buffers spans per trace ID for a window (usually 30-60 seconds) then decides what to keep based on rules (always keep errors, sample 10% of successful traces over 1 second, etc.). More operational complexity than head-sampling but dramatically better data quality.

Head-based vs tail-based trace sampling

Why sample at all

Head-based sampling

Tail-based sampling

Hybrid approach

The NightOwl approach

Frequently asked questions

Flat pricing. No event caps. No per-seat fees.