Cloud BenchmarksLocal Benchmarks
API Status

☁️ Cloud Benchmarks ☁️

I run cron jobs to periodically test the token generation speed of different cloud LLM providers. The chart helps visualize the distributions of different speeds, as they can vary somewhat depending on the loads. For readability not all models are shown, but you can see the full results in the table below.

Every provider and model now has a dedicated landing page with narrative insights, SEO-friendly metadata, and structured data for search engines. Click any provider or model in the table to explore performance in depth.

I am working daily to add more providers and models, looking anywhere that does not require purchasing dedicated endpoints for hosting (why some models may appear to be missing). If you have any more suggestions let me know on GitHub!! 😊

Pick A Path In 10 Seconds

Quick recommendations from the latest 7-day benchmark slice. Use one path, jump into full results, then drill into provider/model pages.

Loading 7-day recommendations…

Fastest Models Right Now (updated <24h)

#ModelProviderSpeed
1llama-3.1-8bgroq306 tok/s
2qwen-3-32bgroq211 tok/s
3llama-3.3-70bgroq192 tok/s
4llama-4-scoutgroq191 tok/s
5llama-3.1-8bcerebras169 tok/s

πŸ“Š Speed Distribution πŸ“Š

πŸ“š Full Results πŸ“š

Showing 98 of 98 modelsFlagged statuses: likely_deprecated, deprecated, failing, stale, never_succeeded, disabled
Status
groqllama-3.1-8bActive3h ago306.00130459100.00
groqqwen-3-32bActive3h ago211.002368320.00
groqllama-4-maverickActive10d ago203.001307720.00
groqllama-3.3-70bActive3h ago192.0068340150.00
groqllama-4-scoutActive3h ago191.0038335250.00
cerebrasgpt-oss-120bActive7d ago184.0013801210.00
cerebrasllama-3.1-8bActive3h ago169.0013531320.00
togetherllama-3.1-8bActive5d ago141.003228340.00
groqkimi-k2Active3h ago136.0012215330.00
bedrocknova-microActive54m ago122.0065152270.00
openaio3 MiniNever Succeeded(Medium)3h ago109.0081640.00
openaio3-mini-2025-01-31Active3h ago108.00151550.00
bedrockllama-4-maverickActive54m ago107.003139290.00
bedrocknova-liteActive54m ago100.0022132300.00
bedrockllama-4-scoutActive54m ago100.003130290.00
bedrockllama-3.3-70bActive54m ago96.403136300.00
togetherqwen-2.5-7bActive3h ago93.501145500.00
openaiGPT-5.4-nanoActive3h ago91.0064120400.00
openaiGPT-5.4-nano-2026-03-17Active3h ago89.7036120530.00
openaio1Active3h ago84.40411410.00
bedrocknova-proActive54m ago84.3019121370.00
openaiGPT-5.1-codex-maxActive3h ago82.30111181280.00
openaiGPT-5.4-mini-2026-03-17Active3h ago81.9042119470.00
deepinframistral-7bStale(Medium)6h ago80.205148610.00
openaiGPT-5.4-miniActive3h ago77.0016111730.00
deepinfradevstral-smallNever Succeeded(Medium)3h ago76.609140590.00
togetherllama-3.1-70bActive22d ago74.6015129330.00
openaigpt-3.5-turboActive3h ago74.5013126520.00
googlegemini-2.5-flash-liteActive3h ago72.4010117550.00
fireworksmixtral-8x22bActive3h ago70.1029111390.00
openaigpt-4.1-nanoActive3h ago70.109149480.00
togethermistral-7bActive22d ago69.80690410.00
openaigpt-4oActive3h ago67.5081731500.00
googlegemini-2.5-flashNever Succeeded(Medium)3h ago65.805105990.00
togethermixtral-8x7bActive3h ago60.9014114170.00
togetherdeepseek-r1Active3h ago55.201113730.00
togetherllama-3.2-3bActive13d ago55.0051211460.00
deepinframixtral-8x22bStale(Medium)29d ago54.405257540.00
fireworksllama-3.3-70bActive3h ago54.4011081650.00
togetherllama-3.3-70bActive3h ago52.0011461220.00
openaiGPT-5-chat-latestActive3h ago51.501776700.00
openaigpt-4.1-miniActive3h ago51.2015109440.00
anthropicclaude-haiku-4.5Active3h ago51.10373630.00
openaio4-mini-2025-04-16Active3h ago50.6028680.00
openaio4 MiniNever Succeeded(Medium)3h ago49.104720.00
bedrockllama-3.2-90bActive54m ago46.60250370.00
deepinfrallama-3-8bStale(Medium)3h ago45.101869320.00
deepinfrallama-3.1-8bStale(Medium)3h ago44.60385690.00
openaigpt-4.1Active3h ago41.501583510.00
bedrockmistral-largeActive53m ago40.60247570.00
googlegemini-2.5-proNever Succeeded(Medium)3h ago40.502721730.00
deepinfrallama-3.2-1bStale(Medium)3h ago40.101100860.00
bedrockclaude-haiku-4.5Active54m ago39.503621170.00
openaigpt-4o-miniActive3h ago39.50764400.00
deepinfrallama-3.2-3bStale(Medium)3h ago39.10299840.00
openaiGPT-5.1-2025-11-13Active3h ago35.702148740.00
openaio3-2025-04-16Active3h ago35.4013610.00
openaio3Active3h ago35.2019630.00
deepinfrallama-2-70bStale(Medium)3h ago34.80357600.00
deepinfrallama-3.2-90bStale(Medium)6h ago34.50382810.00
deepinfrallama-3-70bStale(Medium)3h ago34.10255650.00
deepinfraQwen 2.5 Coder 32BNever Succeeded(Medium)3h ago33.401823520.00
openaiGPT-5.4-2026-03-05Active3h ago32.802539620.00
bedrockclaude-3-5-sonnetActive54m ago32.70146650.00
deepinfraqwen-2.5-72bStale(Medium)3h ago32.50150780.00
openaigpt-4-turboActive3h ago32.30752530.00
bedrockclaude-3-7-sonnetActive54m ago32.10242770.00
bedrockclaude-3-5-haikuActive54m ago31.80938640.00
openaiGPT-5.2-2025-12-11Active3h ago30.802240670.00
openaiGPT-5.1Active3h ago29.902571100.00
openaiGPT-5.4Active3h ago28.101540950.00
openaiGPT-5.2Active3h ago27.40440950.00
openaiGPT-5.1-chat-latestActive3h ago27.4017411040.00
openaigpt-4Active3h ago27.20847630.00
openaiGPT-5.1-codexActive3h ago26.401481250.00
openaiGPT-5.1-codex-miniActive3h ago25.501521190.00
deepinfrallama-3.1-405bStale(Medium)3h ago24.70139850.00
deepinfrallama-3.1-70bStale(Medium)3h ago23.301441100.00
bedrockclaude-sonnet-4.5Active54m ago21.901281690.00
openaiGPT-5.3-codexActive3h ago21.607321170.00
anthropicclaude-opus-4.5Active3h ago20.602331790.00
anthropicclaude-4-sonnetActive3h ago19.406311910.00
bedrockclaude-opus-4.5Active54m ago19.301271980.00
bedrockclaude-3-opusActive28d ago19.20821930.00
anthropicClaude Opus 4.1Active3h ago17.507271570.00
deepinfrallama-3.3-70bNever Succeeded(Medium)3h ago17.401462590.00
anthropicclaude-4-opusActive3h ago17.305221350.00
openaigpt-5.2-codexActive6h ago13.801271700.00
openaio1-proLikely Deprecated(Medium)3h ago9.70118670.00
deepinfrallama-3.2-11bStale(Medium)3h ago9.061602420.00
openaiGPT-5.2-proActive3h ago8.564144950.00
deepinfraqwen-3-235bNever Succeeded(Medium)3h ago8.291535570.00
openaiGPT-5-codexActive9h ago7.551141570.00
openaiGPT-5.2-chat-latestActive3h ago7.273171790.00
openaio3-proActive3h ago6.43310620.00
openaio3-pro-2025-06-10Active3h ago6.003113250.00
openaiGPT-5-proActive3h ago3.68250.00
openaiGPT-5.2-pro-2025-12-11Active3h ago1.90138330.00
Lifecycle snapshot
Loading status summary…

πŸ“ˆ Time Series πŸ“ˆ