Cloud BenchmarksLocal Benchmarks
API Status

☁️ Cloud Benchmarks ☁️

I run cron jobs to periodically test the token generation speed of different cloud LLM providers. The chart helps visualize the distributions of different speeds, as they can vary somewhat depending on the loads. For readability not all models are shown, but you can see the full results in the table below.

Every provider and model now has a dedicated landing page with narrative insights, SEO-friendly metadata, and structured data for search engines. Click any provider or model in the table to explore performance in depth.

I am working daily to add more providers and models, looking anywhere that does not require purchasing dedicated endpoints for hosting (why some models may appear to be missing). If you have any more suggestions let me know on GitHub!! 😊

Pick A Path In 10 Seconds

Quick recommendations from the latest 7-day benchmark slice. Use one path, jump into full results, then drill into provider/model pages.

Loading 7-day recommendations…

Fastest Models Right Now (updated <24h)

#ModelProviderSpeed
1llama-3.1-8bgroq293 tok/s
2qwen-3-32bgroq205 tok/s
3llama-4-scoutgroq187 tok/s
4llama-3.3-70bgroq181 tok/s
5llama-3.1-8bcerebras171 tok/s

πŸ“Š Speed Distribution πŸ“Š

πŸ“š Full Results πŸ“š

Showing 96 of 96 modelsFlagged statuses: likely_deprecated, deprecated, failing, stale, never_succeeded, disabled
Status
groqllama-3.1-8bActive3h ago293.00130450100.00
groqllama-4-maverickActive17d ago214.001307880.00
groqqwen-3-32bActive3h ago205.0015287210.00
groqllama-4-scoutActive3h ago187.0051333250.00
groqllama-3.3-70bActive3h ago181.0040340170.00
cerebrasgpt-oss-120bActive14d ago179.0013801340.00
cerebrasllama-3.1-8bActive6h ago171.0013531370.00
togetherllama-3.1-8bActive12d ago143.003228390.00
groqkimi-k2Active3h ago127.0012215370.00
bedrocknova-microActive29m ago124.0065152260.00
openaio3 MiniNever Succeeded(Medium)3h ago109.0081690.00
openaio3-mini-2025-01-31Active3h ago106.00151600.00
bedrockllama-4-maverickActive29m ago105.001145520.00
bedrocknova-liteActive29m ago100.0020132300.00
bedrockllama-4-scoutActive29m ago99.103130300.00
bedrockllama-3.3-70bActive29m ago95.703134320.00
togetherqwen-2.5-7bActive3h ago90.501141530.00
openaiGPT-5.4-nanoActive3h ago90.5043129410.00
deepinframistral-7bStale(Medium)3h ago88.405148520.00
openaio1Active3h ago87.10211470.00
openaiGPT-5.4-nano-2026-03-17Active3h ago86.9036125480.00
openaiGPT-5.1-codex-maxActive3h ago85.90141181160.00
togetherllama-3.1-70bActive29d ago82.8049107240.00
deepinfradevstral-smallNever Succeeded(Medium)3h ago81.409140540.00
bedrocknova-proActive29m ago80.5019121380.00
openaiGPT-5.4-miniActive3h ago78.5016111520.00
openaiGPT-5.4-mini-2026-03-17Active3h ago75.209119580.00
togethermistral-7bActive29d ago74.203987190.00
googlegemini-2.5-flash-liteActive3h ago74.2010117530.00
fireworksmixtral-8x22bActive3h ago73.9029111340.00
openaigpt-3.5-turboActive3h ago73.404126520.00
openaigpt-4.1-nanoActive3h ago70.8018149460.00
googlegemini-2.5-flashNever Succeeded(Medium)3h ago64.0061051040.00
openaigpt-4oActive3h ago62.9051421590.00
togethermixtral-8x7bActive3h ago60.108114200.00
togetherdeepseek-r1Active3h ago58.401113730.00
fireworksllama-3.3-70bActive3h ago55.1011081470.00
togetherllama-3.3-70bActive3h ago52.5011211190.00
openaio4-mini-2025-04-16Active3h ago52.1028740.00
openaigpt-4.1-miniActive3h ago51.9015109430.00
togetherllama-3.2-3bActive20d ago51.6051211700.00
openaiGPT-5-chat-latestActive3h ago51.601382540.00
anthropicclaude-haiku-4.5Active3h ago49.80373640.00
openaio4 MiniNever Succeeded(Medium)3h ago49.804770.00
bedrockllama-3.2-90bActive29m ago46.60250380.00
deepinfrallama-3-8bStale(Medium)3h ago45.001869330.00
openaigpt-4.1Active3h ago41.201583540.00
bedrockmistral-largeActive29m ago40.70247540.00
deepinfrallama-3.1-8bStale(Medium)3h ago40.00278780.00
deepinfrallama-3.2-1bStale(Medium)3h ago39.903100790.00
openaigpt-4o-miniActive3h ago39.60764410.00
bedrockclaude-haiku-4.5Active30m ago39.503621150.00
googlegemini-2.5-proNever Succeeded(Medium)3h ago39.302651730.00
deepinfrallama-3.2-3bStale(Medium)3h ago38.70299850.00
deepinfraQwen 2.5 Coder 32BNever Succeeded(Medium)3h ago38.101843520.00
openaio3-2025-04-16Active3h ago37.8013680.00
openaiGPT-5.1-2025-11-13Active3h ago36.201262750.00
openaio3Active3h ago35.9012630.00
deepinfrallama-3.2-90bStale(Medium)3h ago34.30482870.00
deepinfrallama-2-70bStale(Medium)3h ago32.90356670.00
deepinfrallama-3-70bStale(Medium)3h ago32.60255660.00
openaigpt-4-turboActive1d ago32.50151520.00
openaiGPT-5.1Active3h ago32.302641030.00
bedrockclaude-3-5-haikuActive30m ago32.20538660.00
bedrockclaude-3-5-sonnetActive30m ago32.20146770.00
bedrockclaude-3-7-sonnetActive30m ago31.80242800.00
deepinfraqwen-2.5-72bStale(Medium)6h ago30.801461420.00
openaiGPT-5.4-2026-03-05Active3h ago30.301842690.00
openaiGPT-5.2-2025-12-11Active3h ago29.801840690.00
openaiGPT-5.4Active3h ago29.501541810.00
openaiGPT-5.1-chat-latestActive3h ago29.301348940.00
openaiGPT-5.2Active3h ago28.20447910.00
openaiGPT-5.1-codexActive3h ago26.901521250.00
openaigpt-4Active3h ago26.40447630.00
openaiGPT-5.1-codex-miniActive3h ago25.701501160.00
openaiGPT-5.3-codexActive3h ago24.10737930.00
deepinfrallama-3.1-70bStale(Medium)3h ago22.901421080.00
deepinfrallama-3.1-405bStale(Medium)3h ago22.801391000.00
bedrockclaude-sonnet-4.5Active29m ago21.201281760.00
anthropicclaude-opus-4.5Active3h ago20.502331790.00
anthropicclaude-4-sonnetActive3h ago19.406321900.00
bedrockclaude-opus-4.5Active30m ago19.301272170.00
deepinfrallama-3.3-70bNever Succeeded(Medium)3h ago18.001432660.00
anthropicClaude Opus 4.1Active3h ago17.707271500.00
anthropicclaude-4-opusActive3h ago17.205221350.00
openaigpt-5.2-codexActive3h ago15.401351600.00
deepinfrallama-3.2-11bStale(Medium)3h ago15.301611870.00
openaiGPT-5.2-chat-latestActive6h ago10.901241560.00
openaio1-proLikely Deprecated(Medium)3h ago9.86118640.00
openaiGPT-5.2-proActive3h ago8.874144790.00
openaiGPT-5-codexActive15h ago8.301171800.00
deepinfraqwen-3-235bNever Succeeded(Medium)6h ago7.111535360.00
openaio3-proActive3h ago6.64113440.00
openaio3-pro-2025-06-10Active3h ago6.55211850.00
openaiGPT-5-proActive3h ago3.79160.00
openaiGPT-5.2-pro-2025-12-11Active9h ago1.89148070.00
Lifecycle snapshot
Loading status summary…

πŸ“ˆ Time Series πŸ“ˆ