C
Chutes
AGGREGATEDINFERENCE
N/A
Uptime
N/A
Rating
30-Day Uptime
100%2026-03-232026-04-21
Inference Latency
Qwen: Qwen3 32B5373ms TTFT · 2 TPS
Xiaomi: MiMo-V2-Flash4019ms TTFT · 5 TPS
MiniMax: MiniMax M2.51912ms TTFT · 32 TPS
DeepSeek: DeepSeek V3.12292ms TTFT · 11 TPS
DeepSeek: DeepSeek V3.22448ms TTFT · 5 TPS
TNG: DeepSeek R1T2 Chimera1311ms TTFT · 25 TPS
Qwen: Qwen3.5 397B A17B2102ms TTFT · 22 TPS
Z.ai: GLM 4.72407ms TTFT · 14 TPS
MoonshotAI: Kimi K2.53667ms TTFT · 13 TPS
Z.ai: GLM 5.12747ms TTFT · 27 TPS
Inference Models
| Model | Input $/M | Output $/M | TTFT | TPS |
|---|---|---|---|---|
| Qwen: Qwen3 32B | $0.08 | $0.24 | 5373ms | 2 |
| Xiaomi: MiMo-V2-Flash | $0.09 | $0.29 | 4019ms | 5 |
| MiniMax: MiniMax M2.5 | $0.15 | $1.20 | 1912ms | 32 |
| OpenGVLab: InternVL3 78B | $0.15 | $0.60 | — | — |
| DeepSeek: DeepSeek V3.1 | $0.27 | $1.00 | 2292ms | 11 |
| DeepSeek: DeepSeek V3.2 | $0.28 | $0.42 | 2448ms | 5 |
| TNG: DeepSeek R1T2 Chimera | $0.30 | $1.10 | 1311ms | 25 |
| Qwen: Qwen3.5 397B A17B | $0.39 | $2.34 | 2102ms | 22 |
| Z.ai: GLM 4.7 | $0.39 | $1.75 | 2407ms | 14 |
| MoonshotAI: Kimi K2.5 | $0.44 | $2.00 | 3667ms | 13 |
| Z.ai: GLM 5 | $0.95 | $2.55 | — | — |
| Z.ai: GLM 5.1 | $1.05 | $3.50 | 2747ms | 27 |
Community Reviews
4.5★★★★★(2 reviews)
clouduser42
★★★★★2025-06-15
Reliable service, great API documentation.
mlresearcher
★★★★☆2025-06-10
Good performance but support could be faster.