Benchmark Overview
21.20
680.00
3
Jan 2, 2026, 03:03 AM
Key Insights
qwen-3-235b streams at 21.20 tokens/second on average across the last 3 benchmark runs.
Performance fluctuated by 15.00 tokens/second (70.8% coefficient of variation), indicating variable behavior across benchmark runs.
Average time to first token is 680.00 ms (good latency), suitable for latency-sensitive workloads.
Latest measurements completed on Jan 2, 2026, 03:03 AM based on 3 total samples.
Performance Distribution
Distribution of throughput measurements showing performance consistency across benchmark runs.
Performance Over Time
Historical performance trends showing how throughput has changed over the benchmarking period.
qwen-3-235b
Benchmark Samples
| Provider | Model | Avg Toks/Sec | Min | Max | Avg TTF (ms) |
|---|---|---|---|---|---|
| deepinfra | qwen-3-235b | 21.20 | 14.60 | 29.60 | 680.00 |
Frequently Asked Questions
The latest rolling average throughput is 21.20 tokens per second with an average time to first token of 680.00 ms across 3 recent runs.
Benchmarks refresh automatically whenever the monitoring cron runs. The most recent run completed on Jan 2, 2026, 03:03 AM.