Cloudflare

AGGREGATEDINFERENCE

N/A

Uptime

N/A

Rating

100%

2026-05-222026-06-20

IBM: Granite 4.0 Micro445ms TTFT · 29 TPS

Meta: Llama 3.2 1B Instruct280ms TTFT · 95 TPS

Meta: Llama 3.2 3B Instruct212ms TTFT · 200 TPS

Z.ai: GLM 4.7 Flash468ms TTFT · 28 TPS

Google: Gemma 4 26B A4B 471ms TTFT · 61 TPS

DeepSeek: DeepSeek V4 Flash767ms TTFT · 23 TPS

Mistral: Mistral 7B Instruct v0.1309ms TTFT · 15 TPS

Meta: Llama 3.1 8B Instruct415ms TTFT · 18 TPS

Meta: Llama 3.3 70B Instruct398ms TTFT · 44 TPS

Mistral: Mistral Small 3.1 24B438ms TTFT · 35 TPS

Qwen2.5 Coder 32B Instruct538ms TTFT · 30 TPS

MoonshotAI: Kimi K2.6834ms TTFT · 34 TPS

MoonshotAI: Kimi K2.7 Code1029ms TTFT · 53 TPS

Z.ai: GLM 5.21643ms TTFT · 27 TPS

Inference Models

Model	Input $/M	Output $/M	TTFT	TPS
IBM: Granite 4.0 Micro	$0.02	$0.11	445ms	29
Meta: Llama 3.2 1B Instruct	$0.03	$0.20	280ms	95
Meta: Llama 3.2 3B Instruct	$0.05	$0.34	212ms	200
Z.ai: GLM 4.7 Flash	$0.06	$0.40	468ms	28
Google: Gemma 4 26B A4B	$0.10	$0.30	471ms	61
DeepSeek: DeepSeek V4 Flash	$0.10	$0.20	767ms	23
Mistral: Mistral 7B Instruct v0.1	$0.11	$0.19	309ms	15
Meta: Llama 3.1 8B Instruct	$0.15	$0.29	415ms	18
Meta: Llama 3.3 70B Instruct	$0.29	$2.25	398ms	44
Mistral: Mistral Small 3.1 24B	$0.35	$0.55	438ms	35
Llama Guard 3 8B	$0.48	$0.03	—	—
Qwen2.5 Coder 32B Instruct	$0.66	$1.00	538ms	30
MoonshotAI: Kimi K2.6	$0.74	$3.50	834ms	34
MoonshotAI: Kimi K2.7 Code	$0.95	$4.00	1029ms	53
Z.ai: GLM 5.2	$1.40	$4.40	1643ms	27

4.5★★★★★(2 reviews)

clouduser42

★★★★★2025-06-15

Reliable service, great API documentation.

mlresearcher

★★★★☆2025-06-10

Good performance but support could be faster.