API Status

deepseek-v4-pro Benchmarks

Provider: fireworks

Explore real-world latency and throughput results for deepseek-v4-pro. These measurements come from automated benchmarking runs against the provider APIs using the same harness that powers the public cloud dashboard.

Want a broader view of this vendor? Visit the fireworks provider hub to compare every tracked model side-by-side.

Visit fireworks Official Website

Benchmark Overview

Avg Tokens / Second

35.10

Avg Time to First Token (ms)

0.00

Runs Analysed

9

Last Updated

May 8, 2026, 07:43 PM

Key Insights

  • deepseek-v4-pro streams at 35.10 tokens/second on average across the last 9 benchmark runs.

  • Performance fluctuated by 55.00 tokens/second (156.7% coefficient of variation), indicating variable behavior across benchmark runs.

  • Latest measurements completed on May 8, 2026, 07:43 PM based on 9 total samples.

Performance Distribution

Distribution of throughput measurements showing performance consistency across benchmark runs.

Performance Over Time

Historical performance trends showing how throughput has changed over the benchmarking period.

deepseek-v4-pro

Benchmark Samples

ProviderModelAvg Toks/SecMinMaxAvg TTF (ms)
fireworksdeepseek-v4-pro35.1014.5069.500.00

Frequently Asked Questions

The latest rolling average throughput is 35.10 tokens per second with an average time to first token of 0.00 ms across 9 recent runs.

Benchmarks refresh automatically whenever the monitoring cron runs. The most recent run completed on May 8, 2026, 07:43 PM.

Related Links