Benchmark Overview
55.70
310.00
4
Feb 7, 2026, 12:10 AM
Key Insights
qwen-2.5-72b streams at 55.70 tokens/second on average across the last 4 benchmark runs.
Performance fluctuated by 12.70 tokens/second (22.8% coefficient of variation), indicating variable behavior across benchmark runs.
Average time to first token is 310.00 ms (excellent latency), suitable for latency-sensitive workloads.
Latest measurements completed on Feb 7, 2026, 12:10 AM based on 4 total samples.
Performance Distribution
Distribution of throughput measurements showing performance consistency across benchmark runs.
Performance Over Time
Historical performance trends showing how throughput has changed over the benchmarking period.
qwen-2.5-72b
Benchmark Samples
| Provider | Model | Avg Toks/Sec | Min | Max | Avg TTF (ms) |
|---|---|---|---|---|---|
| together | qwen-2.5-72b | 55.70 | 49.40 | 62.10 | 310.00 |
Frequently Asked Questions
The latest rolling average throughput is 55.70 tokens per second with an average time to first token of 310.00 ms across 4 recent runs.
Benchmarks refresh automatically whenever the monitoring cron runs. The most recent run completed on Feb 7, 2026, 12:10 AM.