Benchmark Overview
2.94
0.00
3
Jun 9, 2026, 11:46 PM
Key Insights
Llama-Guard-4-12B streams at 2.94 tokens/second on average across the last 3 benchmark runs.
Performance fluctuated by 1.51 tokens/second (51.4% coefficient of variation), indicating variable behavior across benchmark runs.
Latest measurements completed on Jun 9, 2026, 11:46 PM based on 3 total samples.
Performance Distribution
Distribution of throughput measurements showing performance consistency across benchmark runs.
Performance Over Time
Historical performance trends showing how throughput has changed over the benchmarking period.
Llama-Guard-4-12B
Benchmark Samples
| Provider | Model | Avg Toks/Sec | Min | Max | Avg TTF (ms) |
|---|---|---|---|---|---|
| together | Llama-Guard-4-12B | 2.94 | 2.14 | 3.65 | 0.00 |
Frequently Asked Questions
The latest rolling average throughput is 2.94 tokens per second with an average time to first token of 0.00 ms across 3 recent runs.
Benchmarks refresh automatically whenever the monitoring cron runs. The most recent run completed on Jun 9, 2026, 11:46 PM.