Provider Snapshot
6
54.80
0.00
Jun 25, 2026
Key Takeaways
6 fireworks models are actively benchmarked with 1445 total measurements across 1334 benchmark runs.
GPT-oss-120b leads the fleet with 116.00 tokens/second, while glm-5p1 delivers 29.30 tok/s.
Performance varies by 295.9% across the fireworks model lineup, indicating diverse optimization strategies for different use cases.
The fireworks model fleet shows varied performance characteristics (54.9% variation coefficient), reflecting diverse model architectures.
Fastest Models
| Provider | Model | Avg Toks/Sec | Min | Max | Avg TTF (ms) |
|---|---|---|---|---|---|
| fireworks | GPT-oss-120b | 116.00 | 33.20 | 195.00 | 0.00 |
| fireworks | minimax-m2p7 | 67.80 | 2.64 | 111.00 | 0.00 |
| fireworks | kimi-k2p6 | 45.50 | 2.27 | 83.00 | 0.00 |
| fireworks | kimi-k2p5 | 36.00 | 1.17 | 71.40 | 0.00 |
| fireworks | glm-5p2 | 34.20 | 6.16 | 116.00 | 0.00 |
| fireworks | glm-5p1 | 29.30 | 5.97 | 55.30 | 0.00 |
All Models
Complete list of all fireworks models tracked in the benchmark system. Click any model name to view detailed performance data.
| Provider | Model | Avg Toks/Sec | Min | Max | Avg TTF (ms) |
|---|---|---|---|---|---|
| fireworks | glm-5p1 | 29.30 | 5.97 | 55.30 | 0.00 |
| fireworks | glm-5p2 | 34.20 | 6.16 | 116.00 | 0.00 |
| fireworks | GPT-oss-120b | 116.00 | 33.20 | 195.00 | 0.00 |
| fireworks | kimi-k2p5 | 36.00 | 1.17 | 71.40 | 0.00 |
| fireworks | kimi-k2p6 | 45.50 | 2.27 | 83.00 | 0.00 |
| fireworks | minimax-m2p7 | 67.80 | 2.64 | 111.00 | 0.00 |
Featured Models
Frequently Asked Questions
Based on recent tests, GPT-oss-120b shows the highest average throughput among tracked fireworks models.
This provider summary aggregates 1445 individual prompts measured across 1334 monitoring runs over the past month.