API Status

claude-opus-4.6 Benchmarks

Provider: anthropic

Explore real-world latency and throughput results for claude-opus-4.6. These measurements come from automated benchmarking runs against the provider APIs using the same harness that powers the public cloud dashboard.

Want a broader view of this vendor? Visit the anthropic provider hub to compare every tracked model side-by-side.

Visit anthropic Official Website

Benchmark Overview

Avg Tokens / Second

21.80

Avg Time to First Token (ms)

1370.00

Runs Analysed

7

Last Updated

May 14, 2026, 07:43 PM

Key Insights

  • claude-opus-4.6 streams at 21.80 tokens/second on average across the last 7 benchmark runs.

  • Performance fluctuated by 7.70 tokens/second (35.3% coefficient of variation), indicating variable behavior across benchmark runs.

  • Average time to first token is 1370.00 ms (moderate latency), consider alternatives for latency-sensitive workloads.

  • Latest measurements completed on May 14, 2026, 07:43 PM based on 7 total samples.

Performance Distribution

Distribution of throughput measurements showing performance consistency across benchmark runs.

Performance Over Time

Historical performance trends showing how throughput has changed over the benchmarking period.

claude-opus-4.6

Benchmark Samples

ProviderModelAvg Toks/SecMinMaxAvg TTF (ms)
anthropicclaude-opus-4.621.8019.3027.001370.00

Frequently Asked Questions

The latest rolling average throughput is 21.80 tokens per second with an average time to first token of 1370.00 ms across 7 recent runs.

Benchmarks refresh automatically whenever the monitoring cron runs. The most recent run completed on May 14, 2026, 07:43 PM.

Related Links