Benchmark Overview
38.40
400.00
2
Apr 8, 2026, 12:04 AM
Key Insights
gpt-4o-mini streams at 38.40 tokens/second on average across the last 2 benchmark runs.
Performance fluctuated by 9.00 tokens/second (23.4% coefficient of variation), indicating variable behavior across benchmark runs.
Average time to first token is 400.00 ms (excellent latency), suitable for latency-sensitive workloads.
Latest measurements completed on Apr 8, 2026, 12:04 AM based on 2 total samples.
Performance Distribution
Distribution of throughput measurements showing performance consistency across benchmark runs.
Performance Over Time
Historical performance trends showing how throughput has changed over the benchmarking period.
gpt-4o-mini
Benchmark Samples
| Provider | Model | Avg Toks/Sec | Min | Max | Avg TTF (ms) |
|---|---|---|---|---|---|
| openai | gpt-4o-mini | 38.40 | 33.90 | 42.90 | 400.00 |
Frequently Asked Questions
The latest rolling average throughput is 38.40 tokens per second with an average time to first token of 400.00 ms across 2 recent runs.
Benchmarks refresh automatically whenever the monitoring cron runs. The most recent run completed on Apr 8, 2026, 12:04 AM.