Docs/Finops/Performance Metrics

Cost & Performance Metrics

These analytics views sit between the headline dashboards and the optimizer: they line up cost, latency, and quality so you can decide where tuning or a model swap is worth it.

ℹ

Open them from FinOps → Cost Performance Metrics and FinOps → Latency & Analytics.

Cost Performance Metrics

A detailed look at where cost meets quality:

Per-model cost table — cost, usage count, and quality scores side by side, so you can spot models that cost a lot for little quality gain.
Top agents/traces by volume — what's driving usage.
Cost by user — per-user spend.
Model usage trends — usage per model over time.
Scores table — evaluation scores with trends.

Latency & Analytics

Performance with a quality lens:

Latency tables — p50 / p95 / p99 by model and user.
Generation latency over time.
Score analytics — quality distributions alongside latency, to expose speed-vs-quality trade-offs.

Reading the trade-off

The point of these views is comparison, not a single number:

A model that's cheap but low-quality shows up as low cost + low scores — a candidate to replace.
A model that's expensive but no better shows high cost + similar scores — a candidate to downgrade via the AI Optimizer.
A model that's fast but worse appears as low latency + low scores — weigh the trade.

Next steps

AI Optimizer — get a concrete model-swap recommendation.
Scores — how the quality numbers are produced.