C
Cloudflare
AGGREGATEDINFERENCE
N/A
Uptime
N/A
Rating
30-Day Uptime
100%2026-05-222026-06-20
Inference Latency
IBM: Granite 4.0 Micro445ms TTFT · 29 TPS
Meta: Llama 3.2 1B Instruct280ms TTFT · 95 TPS
Meta: Llama 3.2 3B Instruct212ms TTFT · 200 TPS
Z.ai: GLM 4.7 Flash468ms TTFT · 28 TPS
Google: Gemma 4 26B A4B 471ms TTFT · 61 TPS
DeepSeek: DeepSeek V4 Flash767ms TTFT · 23 TPS
Mistral: Mistral 7B Instruct v0.1309ms TTFT · 15 TPS
Meta: Llama 3.1 8B Instruct415ms TTFT · 18 TPS
Meta: Llama 3.3 70B Instruct398ms TTFT · 44 TPS
Mistral: Mistral Small 3.1 24B438ms TTFT · 35 TPS
Qwen2.5 Coder 32B Instruct538ms TTFT · 30 TPS
MoonshotAI: Kimi K2.6834ms TTFT · 34 TPS
MoonshotAI: Kimi K2.7 Code1029ms TTFT · 53 TPS
Z.ai: GLM 5.21643ms TTFT · 27 TPS
Inference Models
| Model | Input $/M | Output $/M | TTFT | TPS |
|---|---|---|---|---|
| IBM: Granite 4.0 Micro | $0.02 | $0.11 | 445ms | 29 |
| Meta: Llama 3.2 1B Instruct | $0.03 | $0.20 | 280ms | 95 |
| Meta: Llama 3.2 3B Instruct | $0.05 | $0.34 | 212ms | 200 |
| Z.ai: GLM 4.7 Flash | $0.06 | $0.40 | 468ms | 28 |
| Google: Gemma 4 26B A4B | $0.10 | $0.30 | 471ms | 61 |
| DeepSeek: DeepSeek V4 Flash | $0.10 | $0.20 | 767ms | 23 |
| Mistral: Mistral 7B Instruct v0.1 | $0.11 | $0.19 | 309ms | 15 |
| Meta: Llama 3.1 8B Instruct | $0.15 | $0.29 | 415ms | 18 |
| Meta: Llama 3.3 70B Instruct | $0.29 | $2.25 | 398ms | 44 |
| Mistral: Mistral Small 3.1 24B | $0.35 | $0.55 | 438ms | 35 |
| Llama Guard 3 8B | $0.48 | $0.03 | — | — |
| Qwen2.5 Coder 32B Instruct | $0.66 | $1.00 | 538ms | 30 |
| MoonshotAI: Kimi K2.6 | $0.74 | $3.50 | 834ms | 34 |
| MoonshotAI: Kimi K2.7 Code | $0.95 | $4.00 | 1029ms | 53 |
| Z.ai: GLM 5.2 | $1.40 | $4.40 | 1643ms | 27 |
Community Reviews
4.5★★★★★(2 reviews)
clouduser42
★★★★★2025-06-15
Reliable service, great API documentation.
mlresearcher
★★★★☆2025-06-10
Good performance but support could be faster.