Added gateway overhead
The single number that is genuinely ours: how many milliseconds NemoRouter adds on top of the provider round-trip. Auth, credit reserve, guardrail evaluation, routing, and the cost-header read all happen here.
- Measured as wall-clock time inside the gateway, excluding provider latency
- Reported at p50, p95, and p99 — the tail is where gateways hide cost
- Broken down by stage: auth, guardrails, routing, settle
- A pass-through request (no guardrails enabled) is the floor case