Cloud BenchmarksLocal Benchmarks
API Status

☁️ Cloud Benchmarks ☁️

I run cron jobs to periodically test the token generation speed of different cloud LLM providers. The chart helps visualize the distributions of different speeds, as they can vary somewhat depending on the loads. For readability not all models are shown, but you can see the full results in the table below.

Every provider and model now has a dedicated landing page with narrative insights, SEO-friendly metadata, and structured data for search engines. Click any provider or model in the table to explore performance in depth.

I am working daily to add more providers and models, looking anywhere that does not require purchasing dedicated endpoints for hosting (why some models may appear to be missing). If you have any more suggestions let me know on GitHub!! 😊

Pick A Path In 10 Seconds

Quick recommendations from the latest 7-day benchmark slice. Use one path, jump into full results, then drill into provider/model pages.

Loading 7-day recommendations…

Fastest Models Right Now (updated <24h)

#ModelProviderSpeed
1llama-3.1-8bgroq296 tok/s
2qwen-3-32bgroq205 tok/s
3llama-4-scoutgroq185 tok/s
4llama-3.3-70bgroq183 tok/s
5llama-3.1-8bcerebras169 tok/s

πŸ“Š Speed Distribution πŸ“Š

πŸ“š Full Results πŸ“š

Showing 96 of 96 modelsFlagged statuses: likely_deprecated, deprecated, failing, stale, never_succeeded, disabled
Status
groqllama-3.1-8bActive25m ago296.00130450100.00
groqllama-4-maverickActive16d ago213.001307850.00
groqqwen-3-32bActive25m ago205.0015287200.00
groqllama-4-scoutActive25m ago185.0051333260.00
groqllama-3.3-70bActive25m ago183.0040340160.00
cerebrasgpt-oss-120bActive13d ago180.0013801320.00
cerebrasllama-3.1-8bActive29m ago169.0013531370.00
togetherllama-3.1-8bActive11d ago144.003228380.00
groqkimi-k2Active25m ago127.0012215390.00
bedrocknova-microActive21m ago123.0065152260.00
openaio3 MiniNever Succeeded(Medium)22m ago109.0081690.00
openaio3-mini-2025-01-31Active23m ago108.00151600.00
bedrockllama-4-maverickActive21m ago105.001145520.00
bedrocknova-liteActive21m ago100.0020132300.00
bedrockllama-4-scoutActive21m ago99.303130300.00
bedrockllama-3.3-70bActive21m ago95.803134310.00
openaiGPT-5.4-nanoActive23m ago92.1043129410.00
togetherqwen-2.5-7bActive21m ago90.501145530.00
deepinframistral-7bStale(Medium)25m ago88.005148540.00
openaiGPT-5.4-nano-2026-03-17Active23m ago86.9036125490.00
openaio1Active23m ago86.40211470.00
openaiGPT-5.1-codex-maxActive22m ago85.30141181150.00
deepinfradevstral-smallNever Succeeded(Medium)3h ago82.009140540.00
bedrocknova-proActive21m ago81.0019121380.00
togetherllama-3.1-70bActive28d ago80.2015116310.00
openaiGPT-5.4-miniActive23m ago78.5016111530.00
openaiGPT-5.4-mini-2026-03-17Active23m ago75.709119590.00
togethermistral-7bActive28d ago74.003589230.00
googlegemini-2.5-flash-liteActive21m ago73.7010117540.00
fireworksmixtral-8x22bActive25m ago73.6029111340.00
openaigpt-3.5-turboActive22m ago73.604126530.00
openaigpt-4.1-nanoActive22m ago70.7018149450.00
googlegemini-2.5-flashNever Succeeded(Medium)21m ago64.3061051040.00
openaigpt-4oActive21m ago63.4081421580.00
togethermixtral-8x7bActive21m ago60.108114200.00
togetherdeepseek-r1Active21m ago57.201113740.00
fireworksllama-3.3-70bActive25m ago54.7011081600.00
openaio4-mini-2025-04-16Active23m ago52.5028740.00
openaigpt-4.1-miniActive22m ago51.8015109430.00
togetherllama-3.3-70bActive21m ago51.8011211190.00
openaiGPT-5-chat-latestActive24m ago51.601382540.00
togetherllama-3.2-3bActive19d ago51.1051211690.00
anthropicclaude-haiku-4.5Active30m ago49.80373630.00
openaio4 MiniNever Succeeded(Medium)22m ago49.604770.00
bedrockllama-3.2-90bActive21m ago46.60250380.00
deepinfrallama-3-8bStale(Medium)25m ago45.101869320.00
deepinfrallama-3.1-8bStale(Medium)25m ago41.20378670.00
openaigpt-4.1Active22m ago41.201583540.00
bedrockmistral-largeActive21m ago40.70247540.00
openaigpt-4o-miniActive22m ago39.70764400.00
bedrockclaude-haiku-4.5Active21m ago39.603621150.00
googlegemini-2.5-proNever Succeeded(Medium)21m ago39.602721720.00
deepinfrallama-3.2-1bStale(Medium)25m ago39.503100790.00
openaio3-2025-04-16Active23m ago38.4013680.00
deepinfrallama-3.2-3bStale(Medium)25m ago38.30299850.00
deepinfraQwen 2.5 Coder 32BNever Succeeded(Medium)29m ago38.001843510.00
openaiGPT-5.1-2025-11-13Active24m ago36.801262760.00
openaio3Active23m ago35.8012630.00
deepinfrallama-3.2-90bStale(Medium)25m ago34.50482850.00
deepinfrallama-2-70bStale(Medium)25m ago33.70357620.00
deepinfrallama-3-70bStale(Medium)25m ago33.20255640.00
openaigpt-4-turboActive3h ago32.50152520.00
openaiGPT-5.1Active22m ago32.202641040.00
bedrockclaude-3-5-haikuActive21m ago32.10538650.00
bedrockclaude-3-5-sonnetActive21m ago32.10146770.00
bedrockclaude-3-7-sonnetActive21m ago31.80242780.00
deepinfraqwen-2.5-72bStale(Medium)3h ago31.101461160.00
openaiGPT-5.4-2026-03-05Active23m ago30.401842680.00
openaiGPT-5.2-2025-12-11Active24m ago30.001840660.00
openaiGPT-5.1-chat-latestActive24m ago29.601347920.00
openaiGPT-5.4Active22m ago29.501541810.00
openaiGPT-5.2Active22m ago28.00447930.00
openaiGPT-5.1-codexActive22m ago26.801521250.00
openaigpt-4Active21m ago26.60447640.00
openaiGPT-5.1-codex-miniActive6h ago25.601501170.00
openaiGPT-5.3-codexActive23m ago23.90737950.00
deepinfrallama-3.1-405bStale(Medium)25m ago23.00139980.00
deepinfrallama-3.1-70bStale(Medium)25m ago23.001421080.00
bedrockclaude-sonnet-4.5Active21m ago21.301281760.00
anthropicclaude-opus-4.5Active30m ago20.502331800.00
bedrockclaude-opus-4.5Active21m ago19.401272110.00
anthropicclaude-4-sonnetActive30m ago19.306321920.00
anthropicClaude Opus 4.1Active30m ago17.607271510.00
deepinfrallama-3.3-70bNever Succeeded(Medium)29m ago17.601432760.00
anthropicclaude-4-opusActive29m ago17.205221350.00
openaigpt-5.2-codexActive9h ago15.301351610.00
deepinfrallama-3.2-11bStale(Medium)25m ago14.401611930.00
openaiGPT-5.2-chat-latestActive24m ago10.801241580.00
openaio1-proLikely Deprecated(Medium)22m ago9.91118430.00
openaiGPT-5.2-proActive22m ago8.844144770.00
openaiGPT-5-codexActive24m ago8.571171750.00
deepinfraqwen-3-235bNever Succeeded(Medium)29m ago7.221535230.00
openaio3-proActive23m ago6.63113490.00
openaio3-pro-2025-06-10Active23m ago6.36211950.00
openaiGPT-5-proActive24m ago3.77160.00
openaiGPT-5.2-pro-2025-12-11Active3h ago1.91148020.00
Lifecycle snapshot
Loading status summary…

πŸ“ˆ Time Series πŸ“ˆ