Cloud BenchmarksLocal Benchmarks
API Status

☁️ Cloud Benchmarks ☁️

I run cron jobs to periodically test the token generation speed of different cloud LLM providers. The chart helps visualize the distributions of different speeds, as they can vary somewhat depending on the loads. For readability not all models are shown, but you can see the full results in the table below.

Every provider and model now has a dedicated landing page with narrative insights, SEO-friendly metadata, and structured data for search engines. Click any provider or model in the table to explore performance in depth.

I am working daily to add more providers and models, looking anywhere that does not require purchasing dedicated endpoints for hosting (why some models may appear to be missing). If you have any more suggestions let me know on GitHub!! 😊

Pick A Path In 10 Seconds

Quick recommendations from the latest 7-day benchmark slice. Use one path, jump into full results, then drill into provider/model pages.

Loading 7-day recommendations…

Fastest Models Right Now (updated <24h)

#ModelProviderSpeed
1llama-3.1-8bgroq289 tok/s
2qwen-3-32bgroq205 tok/s
3llama-4-scoutgroq189 tok/s
4llama-3.3-70bgroq179 tok/s
5llama-3.1-8bcerebras176 tok/s

πŸ“Š Speed Distribution πŸ“Š

πŸ“š Full Results πŸ“š

Showing 94 of 94 modelsFlagged statuses: likely_deprecated, deprecated, failing, stale, never_succeeded, disabled
Status
groqllama-3.1-8bActive29m ago289.00130424110.00
groqllama-4-maverickActive20d ago225.001307990.00
groqqwen-3-32bActive29m ago205.0015287210.00
groqllama-4-scoutActive29m ago189.0043333250.00
groqllama-3.3-70bActive29m ago179.0040340170.00
cerebrasgpt-oss-120bActive17d ago177.0013801430.00
cerebrasllama-3.1-8bActive30m ago176.0013531330.00
togetherllama-3.1-8bActive15d ago144.003228420.00
groqkimi-k2Active29m ago126.0012215400.00
bedrocknova-microActive27m ago124.0069152260.00
openaio3 MiniNever Succeeded(Medium)27m ago111.0081690.00
bedrockllama-4-maverickActive27m ago105.001145520.00
openaio3-mini-2025-01-31Active28m ago105.00151600.00
bedrocknova-liteActive27m ago99.8020132300.00
bedrockllama-4-scoutActive27m ago98.703130310.00
bedrockllama-3.3-70bActive27m ago95.402128310.00
openaiGPT-5.4-nanoActive28m ago91.0042129410.00
deepinframistral-7bStale(Medium)30m ago90.3010148480.00
togetherqwen-2.5-7bActive26m ago89.301141530.00
openaiGPT-5.1-codex-maxActive27m ago88.40141171200.00
openaiGPT-5.4-nano-2026-03-17Active28m ago87.0036125460.00
openaio1Active28m ago83.002114750.00
deepinfradevstral-smallNever Succeeded(Medium)30m ago80.609140520.00
bedrocknova-proActive27m ago79.8019118380.00
openaiGPT-5.4-miniActive28m ago77.9016111500.00
googlegemini-2.5-flash-liteActive26m ago74.9010117530.00
openaiGPT-5.4-mini-2026-03-17Active28m ago74.809119530.00
fireworksmixtral-8x22bActive29m ago74.5028111330.00
openaigpt-4.1-nanoActive27m ago73.9020139430.00
openaigpt-3.5-turboActive26m ago73.804125510.00
googlegemini-2.5-flashNever Succeeded(Medium)26m ago64.5061051030.00
openaigpt-4oActive26m ago61.2051421610.00
togetherdeepseek-r1Active26m ago61.201113710.00
togethermixtral-8x7bActive26m ago59.008114200.00
fireworksllama-3.3-70bActive30m ago56.6011081350.00
togetherllama-3.3-70bActive26m ago53.6011211070.00
openaigpt-4.1-miniActive27m ago53.0018109400.00
openaio4-mini-2025-04-16Active28m ago52.1028770.00
openaiGPT-5-chat-latestActive29m ago52.001382550.00
openaio4 MiniNever Succeeded(Medium)27m ago50.704770.00
togetherllama-3.2-3bActive23d ago49.3051181700.00
anthropicclaude-haiku-4.5Active30m ago49.00373660.00
bedrockllama-3.2-90bActive27m ago46.60250380.00
deepinfrallama-3-8bStale(Medium)30m ago45.601869320.00
openaigpt-4.1Active27m ago42.501585530.00
deepinfrallama-3.2-1bStale(Medium)30m ago41.403100740.00
deepinfraQwen 2.5 Coder 32BNever Succeeded(Medium)30m ago40.901843280.00
openaio3-2025-04-16Active28m ago40.8013710.00
bedrockmistral-largeActive27m ago40.70247550.00
deepinfrallama-3.2-3bStale(Medium)30m ago40.20399760.00
bedrockclaude-haiku-4.5Active27m ago39.903651150.00
googlegemini-2.5-proNever Succeeded(Medium)26m ago39.802651690.00
openaigpt-4o-miniActive26m ago39.40764410.00
openaio3Active28m ago37.7010670.00
deepinfrallama-3.1-8bStale(Medium)30m ago37.10278780.00
openaiGPT-5.1-2025-11-13Active29m ago35.001262800.00
deepinfrallama-3.2-90bStale(Medium)30m ago33.50482820.00
openaiGPT-5.1Active27m ago32.90264970.00
openaigpt-4-turboActive4d ago32.80151520.00
bedrockclaude-3-5-haikuActive27m ago32.40538660.00
bedrockclaude-3-5-sonnetActive27m ago32.20146780.00
bedrockclaude-3-7-sonnetActive27m ago31.90242790.00
deepinfrallama-2-70bStale(Medium)30m ago30.90353700.00
openaiGPT-5.4-2026-03-05Active28m ago30.601842690.00
deepinfrallama-3-70bStale(Medium)30m ago30.50251690.00
openaiGPT-5.4Active28m ago30.201545770.00
openaiGPT-5.2-2025-12-11Active29m ago30.001843710.00
deepinfraqwen-2.5-72bStale(Medium)30m ago29.401462280.00
openaiGPT-5.2Active27m ago29.20447860.00
openaiGPT-5.1-chat-latestActive29m ago28.901052950.00
openaiGPT-5.1-codexActive27m ago27.801521200.00
openaiGPT-5.1-codex-miniActive3h ago26.301511150.00
openaigpt-4Active26m ago26.30447640.00
openaiGPT-5.3-codexActive28m ago25.20740880.00
deepinfrallama-3.1-405bStale(Medium)30m ago22.401391030.00
deepinfrallama-3.1-70bStale(Medium)30m ago21.901421080.00
bedrockclaude-sonnet-4.5Active27m ago21.301291750.00
anthropicclaude-opus-4.5Active30m ago20.802331730.00
deepinfrallama-3.2-11bStale(Medium)30m ago19.101611700.00
anthropicclaude-4-sonnetActive30m ago19.006321970.00
bedrockclaude-opus-4.5Active27m ago18.801272340.00
deepinfrallama-3.3-70bNever Succeeded(Medium)30m ago18.601432390.00
anthropicClaude Opus 4.1Active30m ago17.707271490.00
anthropicclaude-4-opusActive30m ago17.205221340.00
openaigpt-5.2-codexActive28m ago16.501351480.00
openaiGPT-5.2-chat-latestActive29m ago10.901271520.00
openaio1-proLikely Deprecated(Medium)27m ago10.10118580.00
openaiGPT-5.2-proActive27m ago9.064144740.00
openaiGPT-5-codexActive9h ago7.991171810.00
openaio3-proActive28m ago7.36115320.00
openaio3-pro-2025-06-10Active28m ago7.16214630.00
deepinfraqwen-3-235bNever Succeeded(Medium)30m ago6.881535640.00
openaiGPT-5-proActive29m ago3.94170.00
openaiGPT-5.2-pro-2025-12-11Active6h ago1.96147980.00
Lifecycle snapshot
Loading status summary…

πŸ“ˆ Time Series πŸ“ˆ