LIVE
Models: —+Providers: —+Cheapest H100: $2.49/hrUpdated: 04:31 PMModels: —+Providers: —+Cheapest H100: $2.49/hrUpdated: 04:31 PM
Marketplace
Providers Models
C

Chutes

AGGREGATEDINFERENCE
N/A
Uptime
N/A
Rating

30-Day Uptime

93.3%
2026-05-222026-06-20

Inference Latency

Google: Gemma 4 31B2357ms TTFT · 6 TPS
MiniMax: MiniMax M2.51297ms TTFT · 31 TPS
TNG: DeepSeek R1T2 Chimera2087ms TTFT · 20 TPS
Qwen: Qwen3.6 27B2614ms TTFT · 25 TPS
MoonshotAI: Kimi K2.52055ms TTFT · 33 TPS
Qwen: Qwen3.5 397B A17B4097ms TTFT · 41 TPS
MoonshotAI: Kimi K2.62411ms TTFT · 33 TPS
Z.ai: GLM 53088ms TTFT · 35 TPS
Z.ai: GLM 5.13753ms TTFT · 34 TPS

Inference Models

ModelInput $/MOutput $/MTTFTTPS
Google: Gemma 4 31B$0.15$0.422357ms6
MiniMax: MiniMax M2.5$0.15$1.201297ms31
OpenGVLab: InternVL3 78B$0.15$0.60
TNG: DeepSeek R1T2 Chimera$0.30$1.102087ms20
Qwen: Qwen3.6 27B$0.30$2.002614ms25
MoonshotAI: Kimi K2.5$0.44$2.002055ms33
Qwen: Qwen3.5 397B A17B$0.45$3.004097ms41
MoonshotAI: Kimi K2.6$0.74$3.502411ms33
Z.ai: GLM 5$0.95$2.553088ms35
Z.ai: GLM 5.1$0.98$3.083753ms34

Community Reviews

4.5★★★★★(2 reviews)
clouduser42
★★★★★2025-06-15

Reliable service, great API documentation.

mlresearcher
★★★★2025-06-10

Good performance but support could be faster.