Cloud BenchmarksLocal Benchmarks
API Status

☁️ Cloud Benchmarks ☁️

I run cron jobs to periodically test the token generation speed of different cloud LLM providers. The chart helps visualize the distributions of different speeds, as they can vary somewhat depending on the loads. For readability not all models are shown, but you can see the full results in the table below.

Every provider and model now has a dedicated landing page with narrative insights, SEO-friendly metadata, and structured data for search engines. Click any provider or model in the table to explore performance in depth.

I am working daily to add more providers and models, looking anywhere that does not require purchasing dedicated endpoints for hosting (why some models may appear to be missing). If you have any more suggestions let me know on GitHub!! 😊

Pick A Path In 10 Seconds

Quick recommendations from the latest 7-day benchmark slice. Use one path, jump into full results, then drill into provider/model pages.

Loading 7-day recommendations…

Fastest Models Right Now (updated <24h)

#ModelProviderSpeed
1llama-3.1-8bgroq287 tok/s
2qwen-3-32bgroq202 tok/s
3llama-3.1-8bcerebras191 tok/s
4llama-4-scoutgroq186 tok/s
5llama-3.3-70bgroq175 tok/s

πŸ“Š Speed Distribution πŸ“Š

πŸ“š Full Results πŸ“š

Showing 94 of 94 modelsFlagged statuses: likely_deprecated, deprecated, failing, stale, never_succeeded, disabled
Status
groqllama-3.1-8bActive35m ago287.00130424100.00
groqllama-4-maverickActive24d ago224.0013021560.00
groqqwen-3-32bActive35m ago202.0011284210.00
cerebrasllama-3.1-8bActive36m ago191.0013531020.00
groqllama-4-scoutActive35m ago186.007333290.00
groqllama-3.3-70bActive35m ago175.0040340180.00
cerebrasgpt-oss-120bActive21d ago175.0013481740.00
togetherllama-3.1-8bActive19d ago148.0041228220.00
groqkimi-k2Active35m ago129.0012211340.00
bedrocknova-microActive16m ago124.0064152260.00
openaio3 MiniNever Succeeded(Medium)32m ago110.0081690.00
bedrockllama-4-maverickActive16m ago105.001145530.00
openaio3-mini-2025-01-31Active33m ago105.00151600.00
bedrocknova-liteActive16m ago98.8020132300.00
bedrockllama-4-scoutActive16m ago98.203130310.00
bedrockllama-3.3-70bActive16m ago94.102128310.00
openaiGPT-5.4-nanoActive33m ago91.6042134400.00
openaiGPT-5.1-codex-maxActive33m ago91.00141171110.00
deepinframistral-7bStale(Medium)35m ago89.1010148490.00
togetherqwen-2.5-7bActive31m ago88.101139530.00
openaiGPT-5.4-nano-2026-03-17Active34m ago86.9036125450.00
openaio1Active33m ago81.902114740.00
deepinfradevstral-smallNever Succeeded(Medium)36m ago77.309140530.00
bedrocknova-proActive16m ago76.7019118390.00
openaiGPT-5.4-miniActive33m ago76.6016111480.00
googlegemini-2.5-flash-liteActive31m ago74.8010117530.00
openaigpt-4.1-nanoActive32m ago74.7018139420.00
fireworksmixtral-8x22bActive35m ago74.7028111330.00
openaiGPT-5.4-mini-2026-03-17Active34m ago73.809119520.00
openaigpt-3.5-turboActive32m ago73.104125520.00
togetherdeepseek-r1Active32m ago64.705113570.00
googlegemini-2.5-flashNever Succeeded(Medium)31m ago64.7061051020.00
openaigpt-4oActive32m ago59.3051421570.00
togethermixtral-8x7bActive31m ago58.308114190.00
fireworksllama-3.3-70bActive35m ago57.4011081270.00
openaigpt-4.1-miniActive32m ago53.5018109390.00
openaiGPT-5-chat-latestActive35m ago52.401383550.00
openaio4-mini-2025-04-16Active33m ago52.2028770.00
togetherllama-3.3-70bActive32m ago52.002121920.00
openaio4 MiniNever Succeeded(Medium)32m ago50.604770.00
togetherllama-3.2-3bActive27d ago50.50101091480.00
anthropicclaude-haiku-4.5Active36m ago48.20373670.00
bedrockllama-3.2-90bActive16m ago46.60250380.00
deepinfraQwen 2.5 Coder 32BNever Succeeded(Medium)36m ago46.001842110.00
deepinfrallama-3-8bStale(Medium)35m ago45.501869320.00
openaigpt-4.1Active32m ago43.601585510.00
bedrockmistral-largeActive16m ago41.00247520.00
googlegemini-2.5-proNever Succeeded(Medium)31m ago40.507651520.00
deepinfrallama-3.2-1bStale(Medium)35m ago40.203100750.00
openaio3-2025-04-16Active33m ago40.009710.00
deepinfrallama-3.2-3bStale(Medium)35m ago39.50399750.00
openaigpt-4o-miniActive32m ago39.00764430.00
openaio3Active33m ago38.809690.00
bedrockclaude-haiku-4.5Active17m ago38.703651210.00
deepinfrallama-3.1-8bStale(Medium)35m ago35.70278760.00
openaiGPT-5.1Active32m ago33.90264940.00
openaiGPT-5.1-2025-11-13Active34m ago33.801062840.00
openaigpt-4-turboActive8d ago33.10149520.00
bedrockclaude-3-5-haikuActive17m ago32.50538660.00
bedrockclaude-3-5-sonnetActive2d ago31.90146800.00
bedrockclaude-3-7-sonnetActive17m ago31.80242800.00
deepinfrallama-3.2-90bStale(Medium)35m ago31.30482820.00
openaiGPT-5.4Active33m ago30.10945760.00
openaiGPT-5.4-2026-03-05Active34m ago30.10842700.00
openaiGPT-5.2Active33m ago29.70447820.00
openaiGPT-5.2-2025-12-11Active34m ago29.501643770.00
deepinfrallama-3-70bStale(Medium)35m ago28.90451570.00
deepinfrallama-2-70bStale(Medium)35m ago28.80452630.00
openaiGPT-5.1-codexActive32m ago28.801521160.00
deepinfraqwen-2.5-72bStale(Medium)35m ago28.701462470.00
openaiGPT-5.1-chat-latestActive34m ago28.40352970.00
openaiGPT-5.1-codex-miniActive33m ago26.001511210.00
openaigpt-4Active32m ago25.80446650.00
openaiGPT-5.3-codexActive33m ago25.70740840.00
deepinfrallama-3.2-11bStale(Medium)35m ago25.101811440.00
deepinfrallama-3.1-405bStale(Medium)35m ago21.701391120.00
anthropicclaude-opus-4.5Active36m ago21.304311560.00
bedrockclaude-sonnet-4.5Active17m ago20.901291810.00
deepinfrallama-3.1-70bStale(Medium)35m ago20.601421080.00
anthropicclaude-4-sonnetActive36m ago18.907321940.00
deepinfrallama-3.3-70bNever Succeeded(Medium)36m ago18.701432210.00
bedrockclaude-opus-4.5Active17m ago18.201272460.00
anthropicClaude Opus 4.1Active36m ago17.807251460.00
openaigpt-5.2-codexActive33m ago17.201371420.00
anthropicclaude-4-opusActive36m ago17.108231330.00
openaiGPT-5.2-chat-latestActive34m ago10.701271520.00
openaio1-proLikely Deprecated(Medium)32m ago9.90118700.00
openaiGPT-5.2-proActive33m ago9.114144540.00
openaiGPT-5-codexActive15h ago8.131231940.00
openaio3-proActive33m ago7.55115360.00
openaio3-pro-2025-06-10Active33m ago7.46214470.00
deepinfraqwen-3-235bNever Succeeded(Medium)36m ago7.051535820.00
openaiGPT-5-proActive35m ago4.04180.00
openaiGPT-5.2-pro-2025-12-11Active6h ago1.90148040.00
Lifecycle snapshot
Loading status summary…

πŸ“ˆ Time Series πŸ“ˆ