Cloud BenchmarksLocal Benchmarks
API Status

☁️ Cloud Benchmarks ☁️

I run cron jobs to periodically test the token generation speed of different cloud LLM providers. The chart helps visualize the distributions of different speeds, as they can vary somewhat depending on the loads. For readability not all models are shown, but you can see the full results in the table below.

Every provider and model now has a dedicated landing page with narrative insights, SEO-friendly metadata, and structured data for search engines. Click any provider or model in the table to explore performance in depth.

I am working daily to add more providers and models, looking anywhere that does not require purchasing dedicated endpoints for hosting (why some models may appear to be missing). If you have any more suggestions let me know on GitHub!! 😊

Pick A Path In 10 Seconds

Quick recommendations from the latest 7-day benchmark slice. Use one path, jump into full results, then drill into provider/model pages.

Loading 7-day recommendations…

Fastest Models Right Now (updated <24h)

#ModelProviderSpeed
1llama-3.1-8bgroq290 tok/s
2qwen-3-32bgroq203 tok/s
3llama-3.1-8bcerebras197 tok/s
4llama-4-scoutgroq189 tok/s
5llama-3.3-70bgroq170 tok/s

πŸ“Š Speed Distribution πŸ“Š

πŸ“š Full Results πŸ“š

Showing 93 of 93 modelsFlagged statuses: likely_deprecated, deprecated, failing, stale, never_succeeded, disabled
Status
groqllama-3.1-8bActive3h ago290.00130424100.00
groqllama-4-maverickActive27d ago246.0073302170.00
groqqwen-3-32bActive3h ago203.0011284200.00
cerebrasgpt-oss-120bActive24d ago197.0013482140.00
cerebrasllama-3.1-8bActive3h ago197.003353760.00
groqllama-4-scoutActive3h ago189.007333280.00
groqllama-3.3-70bActive3h ago170.0040340190.00
togetherllama-3.1-8bActive22d ago149.0041228220.00
groqkimi-k2Active3h ago136.0012211290.00
bedrocknova-microActive36m ago124.0064154260.00
openaio3 MiniNever Succeeded(Medium)3d ago111.0081690.00
bedrockllama-4-maverickActive36m ago105.001145530.00
openaio3-mini-2025-01-31Active3d ago105.00151600.00
bedrocknova-liteActive36m ago98.0020132310.00
bedrockllama-4-scoutActive36m ago98.003130320.00
bedrockllama-3.3-70bActive36m ago93.902128300.00
openaiGPT-5.1-codex-maxActive3d ago91.90141171100.00
openaiGPT-5.4-nanoActive3d ago91.6042134400.00
togetherqwen-2.5-7bActive3h ago87.601138530.00
deepinframistral-7bStale(Medium)3h ago87.005148550.00
openaiGPT-5.4-nano-2026-03-17Active3d ago86.9036125450.00
openaio1Active3d ago81.902114740.00
openaiGPT-5.4-miniActive3d ago76.6016111480.00
bedrocknova-proActive36m ago76.0019118390.00
openaigpt-4.1-nanoActive3d ago75.8018139390.00
deepinfradevstral-smallNever Succeeded(Medium)3h ago74.5010140550.00
fireworksmixtral-8x22bActive3h ago74.5028111330.00
googlegemini-2.5-flash-liteActive3h ago73.8018117520.00
openaiGPT-5.4-mini-2026-03-17Active3d ago73.809119520.00
openaigpt-3.5-turboActive3d ago72.804125520.00
togetherdeepseek-r1Active3h ago65.505109590.00
googlegemini-2.5-flashNever Succeeded(Medium)3h ago65.3061051010.00
togethermixtral-8x7bActive3h ago60.408114190.00
openaigpt-4oActive3d ago58.7051421610.00
fireworksllama-3.3-70bActive3h ago57.9011081180.00
openaigpt-4.1-miniActive3d ago53.9018109390.00
openaiGPT-5-chat-latestActive3d ago52.401383550.00
openaio4-mini-2025-04-16Active3d ago52.2028770.00
togetherllama-3.3-70bActive3h ago51.602118910.00
openaio4 MiniNever Succeeded(Medium)3d ago51.004770.00
deepinfraQwen 2.5 Coder 32BNever Succeeded(Medium)3h ago49.801841720.00
anthropicclaude-haiku-4.5Active3h ago48.00373680.00
bedrockllama-3.2-90bActive36m ago46.60250380.00
deepinfrallama-3-8bStale(Medium)3h ago45.901869320.00
openaigpt-4.1Active3d ago44.001585510.00
bedrockmistral-largeActive36m ago41.10247510.00
googlegemini-2.5-proNever Succeeded(Medium)3h ago40.407651520.00
openaio3-2025-04-16Active3d ago40.009710.00
bedrockclaude-haiku-4.5Active36m ago39.103651190.00
openaigpt-4o-miniActive3d ago39.10764430.00
openaio3Active3d ago38.809690.00
deepinfrallama-3.2-1bStale(Medium)3h ago37.10399740.00
deepinfrallama-3.1-8bStale(Medium)3h ago37.00278660.00
deepinfrallama-3.2-3bStale(Medium)3h ago36.50399750.00
openaiGPT-5.1Active3d ago34.20264930.00
openaiGPT-5.1-2025-11-13Active3d ago33.801062840.00
openaigpt-4-turboActive11d ago33.00149520.00
bedrockclaude-3-5-haikuActive36m ago32.60342650.00
bedrockclaude-3-7-sonnetActive36m ago32.10242790.00
bedrockclaude-3-5-sonnetActive5d ago31.80146830.00
deepinfrallama-3.2-11bStale(Medium)3h ago30.201811210.00
deepinfrallama-3.2-90bStale(Medium)3h ago30.10382830.00
openaiGPT-5.4Active3d ago30.10945760.00
openaiGPT-5.4-2026-03-05Active3d ago30.10842700.00
openaiGPT-5.2Active3d ago29.90447820.00
openaiGPT-5.2-2025-12-11Active3d ago29.501643770.00
deepinfraqwen-2.5-72bStale(Medium)3h ago29.101462450.00
openaiGPT-5.1-codexActive3d ago29.101521150.00
deepinfrallama-3-70bStale(Medium)3h ago28.80351590.00
deepinfrallama-2-70bStale(Medium)3h ago28.70452600.00
openaiGPT-5.1-chat-latestActive3d ago28.40352970.00
openaiGPT-5.1-codex-miniActive3d ago26.102511210.00
openaiGPT-5.3-codexActive3d ago25.70740840.00
openaigpt-4Active3d ago25.60446650.00
anthropicclaude-opus-4.5Active3h ago21.804371510.00
deepinfrallama-3.1-70bStale(Medium)3h ago21.80142700.00
bedrockclaude-sonnet-4.5Active36m ago21.001291780.00
deepinfrallama-3.1-405bStale(Medium)3h ago20.901391160.00
deepinfrallama-3.3-70bNever Succeeded(Medium)3h ago19.101421740.00
anthropicclaude-4-sonnetActive3h ago18.807321970.00
bedrockclaude-opus-4.5Active36m ago18.101272480.00
anthropicClaude Opus 4.1Active3h ago18.007251430.00
openaigpt-5.2-codexActive3d ago17.602371370.00
anthropicclaude-4-opusActive3h ago17.108241320.00
openaiGPT-5.2-chat-latestActive3d ago10.701271520.00
openaio1-proLikely Deprecated(Medium)3d ago9.92118780.00
openaiGPT-5.2-proActive3d ago9.105144790.00
openaiGPT-5-codexActive3d ago8.131231940.00
deepinfraqwen-3-235bNever Succeeded(Medium)3h ago7.761535280.00
openaio3-proActive3d ago7.55115360.00
openaio3-pro-2025-06-10Active3d ago7.46214470.00
openaiGPT-5-proActive3d ago4.04180.00
openaiGPT-5.2-pro-2025-12-11Active3d ago1.90148040.00
Lifecycle snapshot
Loading status summary…

πŸ“ˆ Time Series πŸ“ˆ