Cloud BenchmarksLocal Benchmarks
API Status

☁️ Cloud Benchmarks ☁️

I run cron jobs to periodically test the token generation speed of different cloud LLM providers. The chart helps visualize the distributions of different speeds, as they can vary somewhat depending on the loads. For readability not all models are shown, but you can see the full results in the table below.

Every provider and model now has a dedicated landing page with narrative insights, SEO-friendly metadata, and structured data for search engines. Click any provider or model in the table to explore performance in depth.

I am working daily to add more providers and models, looking anywhere that does not require purchasing dedicated endpoints for hosting (why some models may appear to be missing). If you have any more suggestions let me know on GitHub!! 😊

Pick A Path In 10 Seconds

Quick recommendations from the latest 7-day benchmark slice. Use one path, jump into full results, then drill into provider/model pages.

Loading 7-day recommendations…

Fastest Models Right Now (updated <24h)

#ModelProviderSpeed
1llama-3.1-8bgroq291 tok/s
2qwen-3-32bgroq205 tok/s
3llama-4-scoutgroq189 tok/s
4llama-3.3-70bgroq181 tok/s
5llama-3.1-8bcerebras177 tok/s

πŸ“Š Speed Distribution πŸ“Š

πŸ“š Full Results πŸ“š

Showing 94 of 94 modelsFlagged statuses: likely_deprecated, deprecated, failing, stale, never_succeeded, disabled
Status
groqllama-3.1-8bActive38m ago291.00130433100.00
groqllama-4-maverickActive19d ago225.001307920.00
groqqwen-3-32bActive38m ago205.0015287210.00
groqllama-4-scoutActive38m ago189.0055333240.00
groqllama-3.3-70bActive38m ago181.0040340170.00
cerebrasgpt-oss-120bActive16d ago179.0013801390.00
cerebrasllama-3.1-8bActive39m ago177.0013531330.00
togetherllama-3.1-8bActive14d ago145.003228410.00
groqkimi-k2Active38m ago127.0012215370.00
bedrocknova-microActive28m ago124.0069152260.00
openaio3 MiniNever Succeeded(Medium)36m ago111.0081690.00
openaio3-mini-2025-01-31Active36m ago106.00151600.00
bedrockllama-4-maverickActive28m ago105.001145520.00
bedrocknova-liteActive28m ago100.0020132300.00
bedrockllama-4-scoutActive28m ago99.003130300.00
bedrockllama-3.3-70bActive28m ago95.902134310.00
openaiGPT-5.4-nanoActive36m ago91.5042129410.00
togetherqwen-2.5-7bActive35m ago89.501141530.00
deepinframistral-7bStale(Medium)38m ago89.3010148490.00
openaiGPT-5.1-codex-maxActive36m ago88.10141171170.00
openaiGPT-5.4-nano-2026-03-17Active37m ago87.2036125460.00
openaio1Active36m ago84.30211470.00
deepinfradevstral-smallNever Succeeded(Medium)39m ago80.509140530.00
bedrocknova-proActive28m ago80.3019118380.00
openaiGPT-5.4-miniActive36m ago78.4016111500.00
googlegemini-2.5-flash-liteActive35m ago74.8010117530.00
openaiGPT-5.4-mini-2026-03-17Active37m ago74.609119540.00
fireworksmixtral-8x22bActive38m ago73.8029111330.00
openaigpt-3.5-turboActive35m ago73.604125520.00
openaigpt-4.1-nanoActive35m ago73.3020139440.00
googlegemini-2.5-flashNever Succeeded(Medium)35m ago64.3061051040.00
openaigpt-4oActive35m ago62.0051421600.00
togetherdeepseek-r1Active35m ago60.501113720.00
togethermixtral-8x7bActive35m ago59.208114200.00
fireworksllama-3.3-70bActive38m ago56.8011081350.00
togetherllama-3.3-70bActive35m ago53.5011211070.00
openaigpt-4.1-miniActive35m ago52.9018109410.00
openaiGPT-5-chat-latestActive38m ago52.401382540.00
openaio4-mini-2025-04-16Active36m ago52.1028770.00
openaio4 MiniNever Succeeded(Medium)36m ago50.404770.00
togetherllama-3.2-3bActive22d ago50.3051181730.00
anthropicclaude-haiku-4.5Active39m ago49.40373640.00
bedrockllama-3.2-90bActive28m ago46.60250380.00
deepinfrallama-3-8bStale(Medium)38m ago45.501869320.00
openaigpt-4.1Active35m ago42.301585530.00
bedrockmistral-largeActive28m ago40.80247530.00
deepinfrallama-3.2-1bStale(Medium)38m ago40.703100750.00
openaio3-2025-04-16Active36m ago40.2013680.00
bedrockclaude-haiku-4.5Active28m ago40.103651140.00
deepinfraQwen 2.5 Coder 32BNever Succeeded(Medium)39m ago39.801843290.00
openaigpt-4o-miniActive35m ago39.70764410.00
googlegemini-2.5-proNever Succeeded(Medium)35m ago39.602651700.00
deepinfrallama-3.2-3bStale(Medium)38m ago39.40399780.00
deepinfrallama-3.1-8bStale(Medium)38m ago38.20278780.00
openaio3Active36m ago37.4012630.00
openaiGPT-5.1-2025-11-13Active37m ago35.401262790.00
deepinfrallama-3.2-90bStale(Medium)38m ago33.50482860.00
openaiGPT-5.1Active36m ago32.80264990.00
openaigpt-4-turboActive3d ago32.70151520.00
bedrockclaude-3-5-haikuActive28m ago32.30538660.00
bedrockclaude-3-5-sonnetActive28m ago32.20146780.00
bedrockclaude-3-7-sonnetActive28m ago31.90242800.00
deepinfrallama-2-70bStale(Medium)38m ago31.60356690.00
deepinfrallama-3-70bStale(Medium)38m ago31.30255670.00
openaiGPT-5.4-2026-03-05Active37m ago30.601842680.00
openaiGPT-5.2-2025-12-11Active37m ago30.101843710.00
deepinfraqwen-2.5-72bStale(Medium)38m ago30.001461930.00
openaiGPT-5.4Active36m ago30.001545780.00
openaiGPT-5.1-chat-latestActive37m ago29.501352940.00
openaiGPT-5.2Active36m ago28.80447880.00
openaiGPT-5.1-codexActive36m ago27.401521230.00
openaigpt-4Active35m ago26.50447640.00
openaiGPT-5.1-codex-miniActive36m ago26.001511150.00
openaiGPT-5.3-codexActive36m ago24.90740900.00
deepinfrallama-3.1-405bStale(Medium)38m ago22.501391030.00
deepinfrallama-3.1-70bStale(Medium)38m ago22.101421080.00
bedrockclaude-sonnet-4.5Active28m ago21.401291730.00
anthropicclaude-opus-4.5Active39m ago20.902331720.00
bedrockclaude-opus-4.5Active28m ago19.001272280.00
anthropicclaude-4-sonnetActive39m ago19.006321980.00
deepinfrallama-3.3-70bNever Succeeded(Medium)39m ago18.501432390.00
anthropicClaude Opus 4.1Active39m ago17.707271490.00
deepinfrallama-3.2-11bStale(Medium)38m ago17.601611740.00
anthropicclaude-4-opusActive39m ago17.105221340.00
openaigpt-5.2-codexActive36m ago16.001351520.00
openaiGPT-5.2-chat-latestActive3h ago11.101271520.00
openaio1-proLikely Deprecated(Medium)35m ago10.00118640.00
openaiGPT-5.2-proActive36m ago9.014144730.00
openaiGPT-5-codexActive6h ago8.131171810.00
openaio3-proActive36m ago7.21115350.00
openaio3-pro-2025-06-10Active36m ago7.07214690.00
deepinfraqwen-3-235bNever Succeeded(Medium)39m ago6.721535540.00
openaiGPT-5-proActive38m ago3.91170.00
openaiGPT-5.2-pro-2025-12-11Active37m ago1.94148110.00
Lifecycle snapshot
Loading status summary…

πŸ“ˆ Time Series πŸ“ˆ