Cloud BenchmarksLocal Benchmarks
API Status

☁️ Cloud Benchmarks ☁️

I run cron jobs to periodically test the token generation speed of different cloud LLM providers. The chart helps visualize the distributions of different speeds, as they can vary somewhat depending on the loads. For readability not all models are shown, but you can see the full results in the table below.

Every provider and model now has a dedicated landing page with narrative insights, SEO-friendly metadata, and structured data for search engines. Click any provider or model in the table to explore performance in depth.

I am working daily to add more providers and models, looking anywhere that does not require purchasing dedicated endpoints for hosting (why some models may appear to be missing). If you have any more suggestions let me know on GitHub!! 😊

Pick A Path In 10 Seconds

Quick recommendations from the latest 7-day benchmark slice. Use one path, jump into full results, then drill into provider/model pages.

Loading 7-day recommendations…

Fastest Models Right Now (updated <24h)

#ModelProviderSpeed
1llama-3.1-8bgroq287 tok/s
2qwen-3-32bgroq202 tok/s
3llama-4-scoutgroq187 tok/s
4llama-3.1-8bcerebras182 tok/s
5llama-3.3-70bgroq177 tok/s

πŸ“Š Speed Distribution πŸ“Š

πŸ“š Full Results πŸ“š

Showing 94 of 94 modelsFlagged statuses: likely_deprecated, deprecated, failing, stale, never_succeeded, disabled
Status
groqllama-3.1-8bActive3h ago287.00130424110.00
groqllama-4-maverickActive22d ago222.0013021190.00
groqqwen-3-32bActive3h ago202.0011287210.00
groqllama-4-scoutActive3h ago187.0043333250.00
cerebrasllama-3.1-8bActive3h ago182.0013531260.00
groqllama-3.3-70bActive3h ago177.0040340170.00
cerebrasgpt-oss-120bActive19d ago170.0013801570.00
togetherllama-3.1-8bActive17d ago144.0018228250.00
groqkimi-k2Active3h ago126.0012215400.00
bedrocknova-microActive39m ago123.0069152260.00
openaio3 MiniNever Succeeded(Medium)3h ago111.0081690.00
openaio3-mini-2025-01-31Active3h ago106.00151600.00
bedrockllama-4-maverickActive39m ago105.001145530.00
bedrocknova-liteActive39m ago99.3020132300.00
bedrockllama-4-scoutActive39m ago98.303130310.00
bedrockllama-3.3-70bActive39m ago94.502128320.00
openaiGPT-5.4-nanoActive3h ago90.8042129400.00
deepinframistral-7bStale(Medium)3h ago90.4010148480.00
openaiGPT-5.1-codex-maxActive3h ago89.50141171170.00
togetherqwen-2.5-7bActive3h ago88.901141540.00
openaiGPT-5.4-nano-2026-03-17Active3h ago86.6036125460.00
openaio1Active3h ago82.302114740.00
deepinfradevstral-smallNever Succeeded(Medium)3h ago78.309140520.00
bedrocknova-proActive39m ago78.0019118390.00
openaiGPT-5.4-miniActive3h ago77.2016111490.00
fireworksmixtral-8x22bActive3h ago74.9028111320.00
openaiGPT-5.4-mini-2026-03-17Active3h ago74.909119520.00
googlegemini-2.5-flash-liteActive3h ago74.7010117530.00
openaigpt-4.1-nanoActive3h ago74.5020139420.00
openaigpt-3.5-turboActive3h ago73.104125520.00
googlegemini-2.5-flashNever Succeeded(Medium)3h ago64.3061051030.00
togetherdeepseek-r1Active3h ago62.301113750.00
openaigpt-4oActive3h ago59.8051421600.00
togethermixtral-8x7bActive3h ago58.608114190.00
fireworksllama-3.3-70bActive3h ago57.0011081310.00
togetherllama-3.3-70bActive3h ago53.4011211150.00
openaigpt-4.1-miniActive3h ago53.0018109400.00
openaio4-mini-2025-04-16Active3h ago52.2028770.00
openaiGPT-5-chat-latestActive3h ago51.701382550.00
openaio4 MiniNever Succeeded(Medium)3h ago50.804770.00
anthropicclaude-haiku-4.5Active3h ago48.40373670.00
togetherllama-3.2-3bActive25d ago47.3051181860.00
bedrockllama-3.2-90bActive39m ago46.60250380.00
deepinfrallama-3-8bStale(Medium)3h ago45.601869320.00
deepinfraQwen 2.5 Coder 32BNever Succeeded(Medium)3h ago43.501842680.00
openaigpt-4.1Active3h ago42.701585510.00
deepinfrallama-3.2-1bStale(Medium)3h ago42.403100750.00
deepinfrallama-3.2-3bStale(Medium)3h ago41.30399770.00
bedrockmistral-largeActive39m ago40.80247540.00
openaio3-2025-04-16Active3h ago40.009710.00
googlegemini-2.5-proNever Succeeded(Medium)3h ago39.702651660.00
bedrockclaude-haiku-4.5Active40m ago39.103651190.00
openaigpt-4o-miniActive3h ago39.00764420.00
openaio3Active3h ago38.8010690.00
deepinfrallama-3.1-8bStale(Medium)3h ago35.50278770.00
openaiGPT-5.1-2025-11-13Active3h ago33.901062830.00
openaiGPT-5.1Active3h ago33.30264960.00
openaigpt-4-turboActive6d ago32.70149520.00
deepinfrallama-3.2-90bStale(Medium)3h ago32.50482850.00
bedrockclaude-3-5-haikuActive40m ago32.40538660.00
bedrockclaude-3-5-sonnetActive19h ago32.00146790.00
bedrockclaude-3-7-sonnetActive40m ago31.80242800.00
openaiGPT-5.4-2026-03-05Active3h ago30.40842690.00
openaiGPT-5.4Active3h ago30.10945770.00
deepinfrallama-3-70bStale(Medium)3h ago29.90551560.00
deepinfrallama-2-70bStale(Medium)3h ago29.90452600.00
openaiGPT-5.2Active3h ago29.50447830.00
openaiGPT-5.2-2025-12-11Active3h ago29.501643750.00
deepinfraqwen-2.5-72bStale(Medium)3h ago28.601462530.00
openaiGPT-5.1-chat-latestActive3h ago28.50352960.00
openaiGPT-5.1-codexActive3h ago28.401521170.00
openaiGPT-5.1-codex-miniActive3h ago26.301511140.00
openaigpt-4Active3h ago25.90446640.00
openaiGPT-5.3-codexActive3h ago25.40740860.00
deepinfrallama-3.1-405bStale(Medium)3h ago22.101391080.00
deepinfrallama-3.2-11bStale(Medium)3h ago21.701811570.00
anthropicclaude-opus-4.5Active3h ago21.102331680.00
deepinfrallama-3.1-70bStale(Medium)3h ago21.101421080.00
bedrockclaude-sonnet-4.5Active40m ago21.001291810.00
anthropicclaude-4-sonnetActive3h ago18.907321940.00
deepinfrallama-3.3-70bNever Succeeded(Medium)3h ago18.701432260.00
bedrockclaude-opus-4.5Active40m ago18.501272410.00
anthropicClaude Opus 4.1Active3h ago17.707251470.00
anthropicclaude-4-opusActive3h ago17.108221340.00
openaigpt-5.2-codexActive3h ago16.901371450.00
openaiGPT-5.2-chat-latestActive6h ago10.501271530.00
openaio1-proLikely Deprecated(Medium)3h ago9.98118560.00
openaiGPT-5.2-proActive3h ago9.094144640.00
openaiGPT-5-codexActive6h ago8.081171920.00
openaio3-proActive3h ago7.60115280.00
openaio3-pro-2025-06-10Active3h ago7.32214540.00
deepinfraqwen-3-235bNever Succeeded(Medium)6h ago6.851535960.00
openaiGPT-5-proActive3h ago4.00180.00
openaiGPT-5.2-pro-2025-12-11Active3h ago1.92148050.00
Lifecycle snapshot
Loading status summary…

πŸ“ˆ Time Series πŸ“ˆ