C
Chutes
AGGREGATEDINFERENCE
N/A
Uptime
N/A
Rating
30-Day Uptime
93.3%2026-05-222026-06-20
Inference Latency
Google: Gemma 4 31B2357ms TTFT · 6 TPS
MiniMax: MiniMax M2.51297ms TTFT · 31 TPS
TNG: DeepSeek R1T2 Chimera2087ms TTFT · 20 TPS
Qwen: Qwen3.6 27B2614ms TTFT · 25 TPS
MoonshotAI: Kimi K2.52055ms TTFT · 33 TPS
Qwen: Qwen3.5 397B A17B4097ms TTFT · 41 TPS
MoonshotAI: Kimi K2.62411ms TTFT · 33 TPS
Z.ai: GLM 53088ms TTFT · 35 TPS
Z.ai: GLM 5.13753ms TTFT · 34 TPS
Inference Models
| Model | Input $/M | Output $/M | TTFT | TPS |
|---|---|---|---|---|
| Google: Gemma 4 31B | $0.15 | $0.42 | 2357ms | 6 |
| MiniMax: MiniMax M2.5 | $0.15 | $1.20 | 1297ms | 31 |
| OpenGVLab: InternVL3 78B | $0.15 | $0.60 | — | — |
| TNG: DeepSeek R1T2 Chimera | $0.30 | $1.10 | 2087ms | 20 |
| Qwen: Qwen3.6 27B | $0.30 | $2.00 | 2614ms | 25 |
| MoonshotAI: Kimi K2.5 | $0.44 | $2.00 | 2055ms | 33 |
| Qwen: Qwen3.5 397B A17B | $0.45 | $3.00 | 4097ms | 41 |
| MoonshotAI: Kimi K2.6 | $0.74 | $3.50 | 2411ms | 33 |
| Z.ai: GLM 5 | $0.95 | $2.55 | 3088ms | 35 |
| Z.ai: GLM 5.1 | $0.98 | $3.08 | 3753ms | 34 |
Community Reviews
4.5★★★★★(2 reviews)
clouduser42
★★★★★2025-06-15
Reliable service, great API documentation.
mlresearcher
★★★★☆2025-06-10
Good performance but support could be faster.