LIVE
Models: —+Providers: —+Cheapest H100: $2.49/hrUpdated: 04:27 PMModels: —+Providers: —+Cheapest H100: $2.49/hrUpdated: 04:27 PM
Marketplace
Providers Models
C

Cloudflare

AGGREGATEDINFERENCE
N/A
Uptime
N/A
Rating

30-Day Uptime

100%
2026-05-222026-06-20

Inference Latency

IBM: Granite 4.0 Micro445ms TTFT · 29 TPS
Meta: Llama 3.2 1B Instruct280ms TTFT · 95 TPS
Meta: Llama 3.2 3B Instruct212ms TTFT · 200 TPS
Z.ai: GLM 4.7 Flash468ms TTFT · 28 TPS
Google: Gemma 4 26B A4B 471ms TTFT · 61 TPS
DeepSeek: DeepSeek V4 Flash767ms TTFT · 23 TPS
Mistral: Mistral 7B Instruct v0.1309ms TTFT · 15 TPS
Meta: Llama 3.1 8B Instruct415ms TTFT · 18 TPS
Meta: Llama 3.3 70B Instruct398ms TTFT · 44 TPS
Mistral: Mistral Small 3.1 24B438ms TTFT · 35 TPS
Qwen2.5 Coder 32B Instruct538ms TTFT · 30 TPS
MoonshotAI: Kimi K2.6834ms TTFT · 34 TPS
MoonshotAI: Kimi K2.7 Code1029ms TTFT · 53 TPS
Z.ai: GLM 5.21643ms TTFT · 27 TPS

Inference Models

ModelInput $/MOutput $/MTTFTTPS
IBM: Granite 4.0 Micro$0.02$0.11445ms29
Meta: Llama 3.2 1B Instruct$0.03$0.20280ms95
Meta: Llama 3.2 3B Instruct$0.05$0.34212ms200
Z.ai: GLM 4.7 Flash$0.06$0.40468ms28
Google: Gemma 4 26B A4B $0.10$0.30471ms61
DeepSeek: DeepSeek V4 Flash$0.10$0.20767ms23
Mistral: Mistral 7B Instruct v0.1$0.11$0.19309ms15
Meta: Llama 3.1 8B Instruct$0.15$0.29415ms18
Meta: Llama 3.3 70B Instruct$0.29$2.25398ms44
Mistral: Mistral Small 3.1 24B$0.35$0.55438ms35
Llama Guard 3 8B$0.48$0.03
Qwen2.5 Coder 32B Instruct$0.66$1.00538ms30
MoonshotAI: Kimi K2.6$0.74$3.50834ms34
MoonshotAI: Kimi K2.7 Code$0.95$4.001029ms53
Z.ai: GLM 5.2$1.40$4.401643ms27

Community Reviews

4.5★★★★★(2 reviews)
clouduser42
★★★★★2025-06-15

Reliable service, great API documentation.

mlresearcher
★★★★2025-06-10

Good performance but support could be faster.