☁️ Cloud Benchmarks ☁️

I run cron jobs to periodically test the token generation speed of different cloud LLM providers. The chart helps visualize the distributions of different speeds, as they can vary somewhat depending on the loads. For readability not all models are shown, but you can see the full results in the table below.

Every provider and model now has a dedicated landing page with narrative insights, SEO-friendly metadata, and structured data for search engines. Click any provider or model in the table to explore performance in depth.

I am working daily to add more providers and models, looking anywhere that does not require purchasing dedicated endpoints for hosting (why some models may appear to be missing). If you have any more suggestions let me know on GitHub!! 😊

Pick A Path In 10 Seconds

Quick recommendations from the latest 7-day benchmark slice. Use one path, jump into full results, then drill into provider/model pages.

Loading 7-day recommendations…

Fastest Models Right Now (updated <24h)

#	Model	Provider	Speed
1	llama-3.1-8b	groq	229 tok/s
2	qwen3.6-27b	groq	222 tok/s
3	Google: Nano Banana (Gemini 2.5 Flash Image)	google	163 tok/s
4	llama-3.3-70b	groq	154 tok/s
5	nova-micro	bedrock	125 tok/s

📊 Speed Distribution 📊

📚 Full Results 📚

Showing 76 of 76 modelsFlagged statuses: likely_deprecated, deprecated, failing, stale, never_succeeded, disabledTable throughput uses visible output tokens when available; generated average and charts show total generated work, including reasoning tokens.

		Status
groq	GPT-oss-safeguard-20b	Active	16d ago	299.00	643.00	Visible	7	632	0.00
groq	llama-3.1-8b	Active	40m ago	229.00	229.00	Visible	1	363	0.00
groq	qwen3.6-27b	Active	39m ago	222.00	215.00	Visible	1	321	0.00
cerebras	gemma-4-31b	Active	23d ago	183.00	183.00	Visible	3	217	380.00
groq	qwen-3-32b	Active	15d ago	169.00	167.00	Visible	1	241	0.00
google	Google: Nano Banana (Gemini 2.5 Flash Image)	Active	39m ago	163.00	163.00	Visible	9	249	920.00
groq	llama-3.3-70b	Active	39m ago	154.00	154.00	Visible	1	252	0.00
bedrock	nova-micro	Active	25m ago	125.00	125.00	Visible	42	173	290.00
openai	o3 Mini	Never Succeeded(Medium)	26d ago	118.00	154.00	Visible	32	230	1430.00
groq	llama-4-scout	Active	15d ago	112.00	112.00	Visible	3	200	0.00
together	LFM2.5-8B-A1B	Active	36m ago	108.00	106.00	Visible	33	138	0.00
bedrock	llama-4-maverick	Active	25m ago	105.00	105.00	Visible	23	136	290.00
fireworks	GPT-oss-120b	Active	55m ago	105.00	179.00	Visible	7	211	0.00
bedrock	llama-3.1-8b	Active	25m ago	95.80	95.80	Visible	7	112	380.00
bedrock	llama-4-scout	Active	25m ago	93.30	93.30	Visible	9	130	310.00
bedrock	nova-pro	Active	25m ago	93.20	93.20	Visible	37	132	380.00
bedrock	nova-lite	Active	25m ago	91.90	91.90	Visible	1	129	370.00
openai	o1	Active	27d ago	87.90	113.00	Visible	33	148	1870.00
openai	GPT-5 Nano	Never Succeeded(Medium)	42m ago	85.50	121.00	Visible	14	158	1890.00
bedrock	llama-3.3-70b	Active	25m ago	84.40	84.40	Visible	4	120	420.00
openai	GPT-5.1-codex-mini	Active	12d ago	84.40	84.50	Visible	7	106	1010.00
bedrock	mistral-7b	Active	25m ago	82.20	82.20	Visible	18	87	190.00
together	Kimi-K2.7-Code	Active	20m ago	78.00	147.00	Visible	2	199	0.00
bedrock	llama-3-8b	Active	25m ago	77.90	77.90	Visible	1	87	260.00
google	gemini-2.5-flash-lite	Active	1h ago	75.40	75.40	Visible	23	120	530.00
bedrock	mixtral-8x7b	Active	25m ago	73.40	73.40	Visible	19	81	230.00
openai	GPT-5.1-codex	Active	12d ago	68.70	68.80	Visible	8	84	1030.00
fireworks	minimax-m2p7	Active	30m ago	68.00	66.00	Visible	8	136	0.00
together	GLM-5.2	Active	19d ago	66.90	87.90	Visible	1	145	0.00
openai	GPT-5.1-codex-max	Active	12d ago	60.70	107.00	Visible	6	122	2910.00
openai	GPT-5-chat-latest	Active	12d ago	60.60	60.60	Visible	12	78	1110.00
openai	GPT-5-codex	Active	12d ago	57.50	59.10	Visible	1	80	1510.00
bedrock	mistral-small	Active	25m ago	56.30	56.30	Visible	23	60	210.00
openai	gpt-4.1-nano	Active	35m ago	51.60	51.60	Visible	3	88	900.00
openai	GPT-5.1-chat-latest	Active	12d ago	50.90	55.00	Visible	7	72	1700.00
openai	gpt-5.2-codex	Active	12d ago	50.30	50.50	Visible	2	62	1360.00
openai	gpt-4.1	Active	33m ago	49.80	49.80	Visible	4	79	740.00
openai	o4 Mini	Never Succeeded(Medium)	19m ago	49.00	74.40	Visible	3	167	1690.00
bedrock	llama-3.2-90b	Active	17d ago	46.90	46.90	Visible	13	50	350.00
bedrock	llama-3.1-70b	Active	25m ago	44.70	44.70	Visible	2	74	520.00
openai	GPT-5.4-mini	Active	25m ago	44.00	62.40	Visible	5	77	1230.00
anthropic	claude-haiku-4.5	Active	32m ago	43.90	43.90	Visible	6	67	900.00
bedrock	claude-haiku-4.5	Active	26m ago	43.20	43.20	Visible	1	60	930.00
openai	gpt-3.5-turbo	Active	23m ago	42.30	42.30	Visible	4	76	1170.00
bedrock	mistral-large	Active	25m ago	42.10	42.10	Visible	5	47	260.00
fireworks	kimi-k2p6	Active	59m ago	42.00	41.70	Visible	1	80	0.00
openai	GPT-5.2-chat-latest	Active	19m ago	39.60	41.20	Visible	6	53	1970.00
openai	gpt-4o	Active	24m ago	38.90	38.90	Visible	5	64	930.00
openai	gpt-4o-mini	Active	25m ago	37.20	37.20	Visible	9	57	970.00
openai	gpt-4.1-mini	Active	25m ago	37.00	37.00	Visible	1	58	1120.00
openai	GPT-5	Never Succeeded(Medium)	58m ago	36.50	55.80	Visible	3	74	4190.00
bedrock	llama-3-70b	Active	25m ago	36.50	36.50	Visible	1	42	490.00
fireworks	glm-5p2	Active	23m ago	32.30	32.60	Visible	2	109	0.00
openai	GPT-5.1	Active	21m ago	31.70	43.20	Visible	3	59	1680.00
anthropic	claude-sonnet-5	Active	30m ago	31.40	31.40	Visible	10	43	960.00
fireworks	glm-5p1	Active	12d ago	28.20	28.20	Visible	2	64	0.00
bedrock	claude-sonnet-4.6	Active	25m ago	27.40	27.40	Visible	6	34	950.00
bedrock	claude-opus-4.7	Active	25m ago	27.20	27.20	Visible	1	39	1870.00
openai	GPT-5.6-terra	Active	22m ago	26.30	42.60	Visible	8	37	1850.00
openai	GPT-5.6-luna	Active	22m ago	26.10	46.80	Visible	7	47	2310.00
openai	GPT-5.4	Active	19m ago	25.90	41.50	Visible	3	43	1820.00
anthropic	claude-sonnet-4.6	Active	57m ago	24.10	24.10	Visible	7	34	1200.00
openai	GPT-5.2	Active	18m ago	23.50	33.80	Visible	4	38	1830.00
openai	gpt-4	Active	19m ago	23.30	23.30	Visible	3	37	1340.00
openai	GPT-5.5	Active	1h ago	22.20	27.90	Visible	4	32	1950.00
bedrock	claude-sonnet-4.5	Active	25m ago	22.20	22.20	Visible	1	30	1490.00
bedrock	claude-opus-4.6	Active	25m ago	21.10	21.10	Visible	3	25	1700.00
bedrock	claude-opus-4.5	Active	25m ago	20.10	20.10	Visible	2	24	1660.00
together	Inkling	Active	3d ago	19.90	106.00	Visible	14	31	0.00
openai	GPT-5.6-sol	Active	21m ago	17.80	30.30	Visible	4	28	2450.00
openai	o3-pro	Active	46m ago	17.70	23.20	Visible	3	37	25100.00
openai	GPT-5.3-codex	Active	19m ago	17.30	41.20	Visible	1	37	2260.00
bedrock	llama-3.1-405b	Active	24d ago	17.00	17.00	Visible	8	19	3750.00
openai	GPT-5.2-pro	Active	30m ago	14.60	17.60	Visible	5	23	29900.00
openai	GPT-5.5-pro	Active	40m ago	11.70	24.50	Visible	0	36	25300.00
bedrock	claude-opus-4.1	Active	26m ago	8.50	8.50	Visible	1	16	3490.00

Lifecycle snapshot

Loading status summary…

☁️ Cloud Benchmarks ☁️

Pick A Path In 10 Seconds

Fastest Models Right Now (updated <24h)

📊 Speed Distribution 📊

📚 Full Results 📚

📈 Time Series 📈