LIVE
Models: —+Providers: —+Cheapest H100: $2.49/hrUpdated: 05:23 PMModels: —+Providers: —+Cheapest H100: $2.49/hrUpdated: 05:23 PM
Marketplace
Providers Models
C

Chutes

AGGREGATEDINFERENCE
N/A
Uptime
N/A
Rating

30-Day Uptime

100%
2026-03-232026-04-21

Inference Latency

Qwen: Qwen3 32B5373ms TTFT · 2 TPS
Xiaomi: MiMo-V2-Flash4019ms TTFT · 5 TPS
MiniMax: MiniMax M2.51912ms TTFT · 32 TPS
DeepSeek: DeepSeek V3.12292ms TTFT · 11 TPS
DeepSeek: DeepSeek V3.22448ms TTFT · 5 TPS
TNG: DeepSeek R1T2 Chimera1311ms TTFT · 25 TPS
Qwen: Qwen3.5 397B A17B2102ms TTFT · 22 TPS
Z.ai: GLM 4.72407ms TTFT · 14 TPS
MoonshotAI: Kimi K2.53667ms TTFT · 13 TPS
Z.ai: GLM 5.12747ms TTFT · 27 TPS

Inference Models

ModelInput $/MOutput $/MTTFTTPS
Qwen: Qwen3 32B$0.08$0.245373ms2
Xiaomi: MiMo-V2-Flash$0.09$0.294019ms5
MiniMax: MiniMax M2.5$0.15$1.201912ms32
OpenGVLab: InternVL3 78B$0.15$0.60
DeepSeek: DeepSeek V3.1$0.27$1.002292ms11
DeepSeek: DeepSeek V3.2$0.28$0.422448ms5
TNG: DeepSeek R1T2 Chimera$0.30$1.101311ms25
Qwen: Qwen3.5 397B A17B$0.39$2.342102ms22
Z.ai: GLM 4.7$0.39$1.752407ms14
MoonshotAI: Kimi K2.5$0.44$2.003667ms13
Z.ai: GLM 5$0.95$2.55
Z.ai: GLM 5.1$1.05$3.502747ms27

Community Reviews

4.5★★★★★(2 reviews)
clouduser42
★★★★★2025-06-15

Reliable service, great API documentation.

mlresearcher
★★★★2025-06-10

Good performance but support could be faster.

Chutes — Provider Scorecard — NexusGPU | NexusGPU