Cloud BenchmarksLocal Benchmarks
API Status

☁️ Cloud Benchmarks ☁️

I run cron jobs to periodically test the token generation speed of different cloud LLM providers. The chart helps visualize the distributions of different speeds, as they can vary somewhat depending on the loads. For readability not all models are shown, but you can see the full results in the table below.

Every provider and model now has a dedicated landing page with narrative insights, SEO-friendly metadata, and structured data for search engines. Click any provider or model in the table to explore performance in depth.

I am working daily to add more providers and models, looking anywhere that does not require purchasing dedicated endpoints for hosting (why some models may appear to be missing). If you have any more suggestions let me know on GitHub!! 😊

Pick A Path In 10 Seconds

Quick recommendations from the latest 7-day benchmark slice. Use one path, jump into full results, then drill into provider/model pages.

Loading 7-day recommendations…

Fastest Models Right Now (updated <24h)

#ModelProviderSpeed
1llama-3.1-8bgroq307 tok/s
2qwen-3-32bgroq213 tok/s
3llama-4-scoutgroq193 tok/s
4llama-3.3-70bgroq193 tok/s
5llama-3.1-8bcerebras171 tok/s

πŸ“Š Speed Distribution πŸ“Š

πŸ“š Full Results πŸ“š

Showing 98 of 98 modelsFlagged statuses: likely_deprecated, deprecated, failing, stale, never_succeeded, disabled
Status
groqllama-3.1-8bActive31m ago307.00130459100.00
groqqwen-3-32bActive31m ago213.002374320.00
groqllama-4-maverickActive9d ago204.001307700.00
groqllama-4-scoutActive31m ago193.0038335250.00
groqllama-3.3-70bActive31m ago193.0068340140.00
cerebrasgpt-oss-120bActive6d ago184.0013801190.00
cerebrasllama-3.1-8bActive32m ago171.0013531310.00
togetherllama-3.1-8bActive4d ago141.003228360.00
groqkimi-k2Active6h ago136.0012215330.00
bedrocknova-microActive22m ago121.0065152270.00
openaio3 MiniNever Succeeded(Medium)29m ago109.0081640.00
bedrockllama-4-maverickActive22m ago107.003139270.00
bedrockllama-4-scoutActive22m ago101.006130280.00
openaio3-mini-2025-01-31Active30m ago101.00151550.00
bedrocknova-liteActive22m ago100.0022132300.00
openaiGPT-5.4-nanoActive29m ago98.9068120360.00
bedrockllama-3.3-70bActive22m ago96.303136290.00
openaiGPT-5.4-miniActive29m ago96.2082111350.00
openaiGPT-5.4-nano-2026-03-17Active30m ago93.9066120440.00
togetherqwen-2.5-7bActive28m ago93.201145500.00
openaiGPT-5.4-mini-2026-03-17Active30m ago92.5042119440.00
bedrocknova-proActive22m ago85.1019121370.00
openaiGPT-5.1-codex-maxActive29m ago82.00111181260.00
deepinframistral-7bStale(Medium)32m ago79.305148610.00
openaio1Active30m ago75.40411410.00
togetherllama-3.1-70bActive21d ago75.1015129330.00
deepinfradevstral-smallNever Succeeded(Medium)32m ago74.809140590.00
openaigpt-3.5-turboActive28m ago74.8013126520.00
googlegemini-2.5-flash-liteActive28m ago72.2010117550.00
openaigpt-4.1-nanoActive29m ago70.209149490.00
fireworksmixtral-8x22bActive31m ago70.1029111390.00
togethermistral-7bActive21d ago70.00690390.00
openaigpt-4oActive28m ago67.8081731490.00
googlegemini-2.5-flashNever Succeeded(Medium)28m ago66.105105980.00
togethermixtral-8x7bActive28m ago60.9014114170.00
togetherllama-3.2-3bActive12d ago55.0051211440.00
fireworksllama-3.3-70bActive31m ago54.6011081660.00
togetherdeepseek-r1Active28m ago54.501113740.00
deepinframixtral-8x22bStale(Medium)28d ago51.701466670.00
togetherllama-3.3-70bActive28m ago51.7011461240.00
openaigpt-4.1-miniActive29m ago51.4015109440.00
anthropicclaude-haiku-4.5Active32m ago51.301973550.00
openaio4-mini-2025-04-16Active29m ago49.6041570.00
openaiGPT-5-chat-latestActive31m ago49.403064680.00
openaio4 MiniNever Succeeded(Medium)29m ago48.804720.00
bedrockllama-3.2-90bActive22m ago46.70251370.00
deepinfrallama-3-8bStale(Medium)31m ago45.001869320.00
deepinfrallama-3.1-8bStale(Medium)31m ago44.40385680.00
openaigpt-4.1Active29m ago41.101083510.00
bedrockmistral-largeActive22m ago40.60247570.00
googlegemini-2.5-proNever Succeeded(Medium)28m ago40.502721720.00
deepinfrallama-3.2-1bStale(Medium)32m ago40.401100860.00
openaigpt-4o-miniActive28m ago39.60764400.00
deepinfrallama-3.2-3bStale(Medium)32m ago39.40299830.00
bedrockclaude-haiku-4.5Active22m ago39.303621190.00
deepinfrallama-2-70bStale(Medium)31m ago34.80357600.00
openaiGPT-5.1-2025-11-13Active30m ago34.502143790.00
deepinfrallama-3.2-90bStale(Medium)32m ago34.30382840.00
deepinfrallama-3-70bStale(Medium)31m ago34.00255650.00
openaio3Active30m ago33.6019560.00
bedrockclaude-3-5-sonnetActive22m ago32.70246650.00
deepinfraqwen-2.5-72bStale(Medium)32m ago32.50150780.00
openaigpt-4-turboActive28m ago32.40752520.00
openaio3-2025-04-16Active30m ago32.4017410.00
deepinfraQwen 2.5 Coder 32BNever Succeeded(Medium)32m ago32.201823530.00
bedrockclaude-3-7-sonnetActive22m ago32.10242770.00
bedrockclaude-3-5-haikuActive22m ago31.70938640.00
openaiGPT-5.4-2026-03-05Active30m ago31.302536720.00
openaiGPT-5.2-2025-12-11Active30m ago31.102240700.00
openaiGPT-5.1Active29m ago29.802571100.00
openaiGPT-5.4Active29m ago27.5015361000.00
openaiGPT-5.2Active29m ago27.20440960.00
openaigpt-4Active28m ago27.20847630.00
openaiGPT-5.1-codexActive29m ago26.401481250.00
openaiGPT-5.1-codex-miniActive29m ago25.401521190.00
openaiGPT-5.1-chat-latestActive30m ago25.1017391100.00
deepinfrallama-3.1-405bStale(Medium)31m ago24.80139830.00
deepinfrallama-3.1-70bStale(Medium)31m ago23.301441090.00
bedrockclaude-sonnet-4.5Active22m ago22.001281680.00
openaiGPT-5.3-codexActive29m ago21.507321200.00
anthropicclaude-opus-4.5Active32m ago20.602331790.00
bedrockclaude-3-opusActive27d ago19.50822890.00
anthropicclaude-4-sonnetActive32m ago19.406311900.00
bedrockclaude-opus-4.5Active22m ago19.301271960.00
anthropicClaude Opus 4.1Active32m ago17.607271570.00
deepinfrallama-3.3-70bNever Succeeded(Medium)32m ago17.501462570.00
anthropicclaude-4-opusActive32m ago17.305221340.00
openaigpt-5.2-codexActive6h ago13.901271710.00
openaio1-proLikely Deprecated(Medium)29m ago9.63118670.00
openaiGPT-5-codexActive31m ago9.091141430.00
openaiGPT-5.2-proActive29m ago8.584144850.00
deepinfrallama-3.2-11bStale(Medium)32m ago8.411602650.00
deepinfraqwen-3-235bNever Succeeded(Medium)32m ago8.401535620.00
openaiGPT-5.2-chat-latestActive30m ago7.404171780.00
openaio3-pro-2025-06-10Active30m ago6.704112150.00
openaio3-proActive30m ago5.923100.00
openaiGPT-5-proActive31m ago3.65250.00
openaiGPT-5.2-pro-2025-12-11Active3h ago1.41127880.00
Lifecycle snapshot
Loading status summary…

πŸ“ˆ Time Series πŸ“ˆ