Llama-Guard-4-12B by together Benchmarks

Benchmark Overview

Avg Tokens / Second

2.94

Avg Time to First Token (ms)

0.00

Runs Analysed

Last Updated

Jun 9, 2026, 11:46 PM

Llama-Guard-4-12B streams at 2.94 tokens/second on average across the last 3 benchmark runs.
Performance fluctuated by 1.51 tokens/second (51.4% coefficient of variation), indicating variable behavior across benchmark runs.
Latest measurements completed on Jun 9, 2026, 11:46 PM based on 3 total samples.

Distribution of throughput measurements showing performance consistency across benchmark runs.

Historical performance trends showing how throughput has changed over the benchmarking period.

Provider	Model	Avg Toks/Sec	Min	Max	Avg TTF (ms)
together	Llama-Guard-4-12B	2.94	2.14	3.65	0.00

The latest rolling average throughput is 2.94 tokens per second with an average time to first token of 0.00 ms across 3 recent runs.

Benchmarks refresh automatically whenever the monitoring cron runs. The most recent run completed on Jun 9, 2026, 11:46 PM.