Cloud BenchmarksLocal Benchmarks
API Status

☁️ Cloud Benchmarks ☁️

I run cron jobs to periodically test the token generation speed of different cloud LLM providers. The chart helps visualize the distributions of different speeds, as they can vary somewhat depending on the loads. For readability not all models are shown, but you can see the full results in the table below.

Every provider and model now has a dedicated landing page with narrative insights, SEO-friendly metadata, and structured data for search engines. Click any provider or model in the table to explore performance in depth.

I am working daily to add more providers and models, looking anywhere that does not require purchasing dedicated endpoints for hosting (why some models may appear to be missing). If you have any more suggestions let me know on GitHub!! 😊

Pick A Path In 10 Seconds

Quick recommendations from the latest 7-day benchmark slice. Use one path, jump into full results, then drill into provider/model pages.

Loading 7-day recommendations…

Fastest Models Right Now (updated <24h)

#ModelProviderSpeed
1llama-3.1-8bgroq302 tok/s
2qwen-3-32bgroq206 tok/s
3llama-3.3-70bgroq188 tok/s
4llama-4-scoutgroq186 tok/s
5llama-3.1-8bcerebras173 tok/s

πŸ“Š Speed Distribution πŸ“Š

πŸ“š Full Results πŸ“š

Showing 96 of 96 modelsFlagged statuses: likely_deprecated, deprecated, failing, stale, never_succeeded, disabled
Status
groqllama-3.1-8bActive3h ago302.00130459100.00
groqqwen-3-32bActive3h ago206.0015287200.00
groqllama-4-maverickActive13d ago204.001307790.00
cerebrasgpt-oss-120bActive10d ago188.0013801210.00
groqllama-3.3-70bActive3h ago188.0068340150.00
groqllama-4-scoutActive6h ago186.0038335260.00
cerebrasllama-3.1-8bActive3h ago173.0013531320.00
togetherllama-3.1-8bActive8d ago143.003228360.00
groqkimi-k2Active6h ago133.0012215340.00
bedrocknova-microActive30m ago123.0065152260.00
openaio3-mini-2025-01-31Active3h ago114.00151600.00
openaio3 MiniNever Succeeded(Medium)3h ago110.0081690.00
bedrockllama-4-maverickActive30m ago106.001145440.00
bedrocknova-liteActive30m ago100.0022132300.00
bedrockllama-4-scoutActive30m ago100.003130290.00
bedrockllama-3.3-70bActive30m ago96.703136310.00
openaio1Active3h ago94.00411470.00
togetherqwen-2.5-7bActive3h ago93.001145500.00
openaiGPT-5.4-nanoActive3h ago92.8043129410.00
openaiGPT-5.4-nano-2026-03-17Active3h ago91.4036125450.00
deepinframistral-7bStale(Medium)3h ago86.405148550.00
openaiGPT-5.1-codex-maxActive3h ago84.40111181250.00
bedrocknova-proActive30m ago83.2019121370.00
deepinfradevstral-smallNever Succeeded(Medium)3h ago82.009140550.00
openaiGPT-5.4-miniActive3h ago81.3016111550.00
openaiGPT-5.4-mini-2026-03-17Active3h ago78.809119610.00
togetherllama-3.1-70bActive25d ago76.0015129310.00
openaigpt-3.5-turboActive3h ago74.404126520.00
googlegemini-2.5-flash-liteActive3h ago73.1010117540.00
openaigpt-4.1-nanoActive3h ago71.409149480.00
fireworksmixtral-8x22bActive3h ago71.0029111370.00
togethermistral-7bActive25d ago69.00690400.00
openaigpt-4oActive3h ago66.6081421520.00
googlegemini-2.5-flashNever Succeeded(Medium)3h ago65.2051051000.00
togethermixtral-8x7bActive3h ago61.7014114170.00
togetherdeepseek-r1Active3h ago56.801113720.00
togetherllama-3.2-3bActive16d ago55.5051211440.00
fireworksllama-3.3-70bActive3h ago53.9011081650.00
openaio4-mini-2025-04-16Active3h ago53.6028740.00
openaiGPT-5-chat-latestActive3h ago52.701782520.00
openaigpt-4.1-miniActive3h ago52.5015109430.00
togetherllama-3.3-70bActive3h ago51.3011211240.00
anthropicclaude-haiku-4.5Active3h ago50.30373640.00
openaio4 MiniNever Succeeded(Medium)3h ago49.704770.00
bedrockllama-3.2-90bActive30m ago46.60250370.00
deepinfrallama-3-8bStale(Medium)3h ago44.901869320.00
deepinfrallama-3.1-8bStale(Medium)3h ago42.60381690.00
openaigpt-4.1Active3h ago41.801583520.00
bedrockmistral-largeActive30m ago40.70247550.00
googlegemini-2.5-proNever Succeeded(Medium)3h ago40.402721720.00
bedrockclaude-haiku-4.5Active30m ago40.203621130.00
openaiGPT-5.1-2025-11-13Active3h ago40.101962700.00
openaigpt-4o-miniActive3h ago40.00764390.00
deepinfrallama-3.2-1bStale(Medium)3h ago39.903100790.00
openaio3-2025-04-16Active3h ago39.6013680.00
deepinfrallama-3.2-3bStale(Medium)3h ago38.90299830.00
openaio3Active3h ago38.4012630.00
deepinfraQwen 2.5 Coder 32BNever Succeeded(Medium)3h ago36.201843500.00
deepinfrallama-3.2-90bStale(Medium)3h ago34.70382880.00
deepinfrallama-2-70bStale(Medium)3h ago34.40357600.00
deepinfrallama-3-70bStale(Medium)3h ago34.00255640.00
bedrockclaude-3-5-sonnetActive30m ago32.90146590.00
openaigpt-4-turboActive3h ago32.50152530.00
deepinfraqwen-2.5-72bStale(Medium)3h ago32.20150780.00
bedrockclaude-3-7-sonnetActive30m ago32.20242770.00
bedrockclaude-3-5-haikuActive30m ago32.10938640.00
openaiGPT-5.1Active3h ago31.502641080.00
openaiGPT-5.4-2026-03-05Active3h ago30.901840690.00
openaiGPT-5.2-2025-12-11Active3h ago30.702240660.00
openaiGPT-5.4Active3h ago29.001540860.00
openaiGPT-5.2Active3h ago27.90447930.00
openaiGPT-5.1-codexActive3h ago27.401521200.00
openaigpt-4Active3h ago27.20847630.00
openaiGPT-5.1-chat-latestActive3h ago27.101341990.00
openaiGPT-5.1-codex-miniActive3h ago25.601521170.00
deepinfrallama-3.1-405bStale(Medium)3h ago24.00139880.00
deepinfrallama-3.1-70bStale(Medium)3h ago23.401421100.00
openaiGPT-5.3-codexActive3h ago23.407361020.00
bedrockclaude-sonnet-4.5Active30m ago21.801281680.00
anthropicclaude-opus-4.5Active3h ago20.702331770.00
bedrockclaude-opus-4.5Active30m ago19.601271990.00
anthropicclaude-4-sonnetActive3h ago19.206321950.00
anthropicClaude Opus 4.1Active3h ago17.507271540.00
deepinfrallama-3.3-70bNever Succeeded(Medium)3h ago17.501462670.00
anthropicclaude-4-opusActive3h ago17.105221360.00
openaigpt-5.2-codexActive3h ago14.901351630.00
deepinfrallama-3.2-11bStale(Medium)3h ago11.501602240.00
openaio1-proLikely Deprecated(Medium)3h ago9.89118550.00
openaiGPT-5.2-chat-latestActive3h ago9.732241640.00
openaiGPT-5.2-proActive3h ago8.754144820.00
openaiGPT-5-codexActive6h ago7.721141580.00
deepinfraqwen-3-235bNever Succeeded(Medium)9h ago7.711535370.00
openaio3-proActive3h ago6.59213800.00
openaio3-pro-2025-06-10Active3h ago6.243111570.00
openaiGPT-5-proActive3h ago3.84160.00
openaiGPT-5.2-pro-2025-12-11Active3h ago1.86147710.00
Lifecycle snapshot
Loading status summary…

πŸ“ˆ Time Series πŸ“ˆ