API Status

gemma-4-31b-it Benchmarks

Provider: together

Explore real-world latency and throughput results for gemma-4-31b-it. These measurements come from automated benchmarking runs against the provider APIs using the same harness that powers the public cloud dashboard.

Want a broader view of this vendor? Visit the together provider hub to compare every tracked model side-by-side.

Visit together Official Website

Benchmark Overview

Avg Tokens / Second

18.70

Avg Time to First Token (ms)

0.00

Runs Analysed

4

Last Updated

Jun 9, 2026, 04:34 AM

Key Insights

  • gemma-4-31b-it streams at 18.70 tokens/second on average across the last 4 benchmark runs.

  • Performance fluctuated by 31.79 tokens/second (170.0% coefficient of variation), indicating variable behavior across benchmark runs.

  • Latest measurements completed on Jun 9, 2026, 04:34 AM based on 4 total samples.

Performance Distribution

Distribution of throughput measurements showing performance consistency across benchmark runs.

Performance Over Time

Historical performance trends showing how throughput has changed over the benchmarking period.

gemma-4-31b-it

Benchmark Samples

ProviderModelAvg Toks/SecMinMaxAvg TTF (ms)
togethergemma-4-31b-it18.708.5140.300.00

Frequently Asked Questions

The latest rolling average throughput is 18.70 tokens per second with an average time to first token of 0.00 ms across 4 recent runs.

Benchmarks refresh automatically whenever the monitoring cron runs. The most recent run completed on Jun 9, 2026, 04:34 AM.

Related Links