gemini-2.5-flash-lite Benchmarks

Provider: google

Explore real-world latency and throughput results for gemini-2.5-flash-lite. These measurements come from automated benchmarking runs against the provider APIs using the same harness that powers the public cloud dashboard.

Want a broader view of this vendor? Visit the google provider hub to compare every tracked model side-by-side.

Visit google Official Website

Benchmark Overview

Avg Tokens / Second

48.20

Avg Time to First Token (ms)

940.00

Runs Analysed

2

Last Updated

Feb 4, 2026, 09:25 PM

Key Insights

  • gemini-2.5-flash-lite streams at 48.20 tokens/second on average across the last 2 benchmark runs.

  • Performance fluctuated by 6.60 tokens/second (13.7% coefficient of variation), indicating consistent behavior across benchmark runs.

  • Average time to first token is 940.00 ms (good latency), suitable for latency-sensitive workloads.

  • Latest measurements completed on Feb 4, 2026, 09:25 PM based on 2 total samples.

Benchmark Samples

ProviderModelAvg Toks/SecMinMaxAvg TTF (ms)
google48.2044.9051.50940.00

Frequently Asked Questions

The latest rolling average throughput is 48.20 tokens per second with an average time to first token of 940.00 ms across 2 recent runs.

Benchmarks refresh automatically whenever the monitoring cron runs. The most recent run completed on Feb 4, 2026, 09:25 PM.

Related Links