API Status

mixtral-8x7b Benchmarks

Provider: bedrock

Explore real-world latency and throughput results for mixtral-8x7b. These measurements come from automated benchmarking runs against the provider APIs using the same harness that powers the public cloud dashboard.

Want a broader view of this vendor? Visit the bedrock provider hub to compare every tracked model side-by-side.


Benchmark Overview

Avg Tokens / Second

80.30

Avg Time to First Token (ms)

210.00

Runs Analysed

3

Last Updated

May 11, 2026, 07:01 PM

Key Insights

  • mixtral-8x7b streams at 80.30 tokens/second on average across the last 3 benchmark runs.

  • Performance fluctuated by 2.30 tokens/second (2.9% coefficient of variation), indicating consistent behavior across benchmark runs.

  • Average time to first token is 210.00 ms (excellent latency), suitable for latency-sensitive workloads.

  • Latest measurements completed on May 11, 2026, 07:01 PM based on 3 total samples.

Performance Distribution

Distribution of throughput measurements showing performance consistency across benchmark runs.

Performance Over Time

Historical performance trends showing how throughput has changed over the benchmarking period.

mixtral-8x7b

Benchmark Samples

ProviderModelAvg Toks/SecMinMaxAvg TTF (ms)
bedrockmixtral-8x7b80.3079.4081.70210.00

Frequently Asked Questions

The latest rolling average throughput is 80.30 tokens per second with an average time to first token of 210.00 ms across 3 recent runs.

Benchmarks refresh automatically whenever the monitoring cron runs. The most recent run completed on May 11, 2026, 07:01 PM.

Related Links