LIVE
Models: —+Providers: —+Cheapest H100: $2.49/hrUpdated: 04:24 PMModels: —+Providers: —+Cheapest H100: $2.49/hrUpdated: 04:24 PM
Marketplace
Providers Models
W

Wafer

AGGREGATEDINFERENCE
N/A
Uptime
N/A
Rating

30-Day Uptime

93.3%
2026-05-222026-06-20

Inference Latency

DeepSeek: DeepSeek V4 Flash986ms TTFT · 12 TPS
Z.ai: GLM 5.11179ms TTFT · 71 TPS
Z.ai: GLM 5.21823ms TTFT · 30 TPS

Inference Models

ModelInput $/MOutput $/MTTFTTPS
DeepSeek: DeepSeek V4 Flash$0.09$0.18986ms12
Z.ai: GLM 5.1$1.00$3.201179ms71
Z.ai: GLM 5.2$1.20$4.101823ms30

Community Reviews

4.5★★★★★(2 reviews)
clouduser42
★★★★★2025-06-15

Reliable service, great API documentation.

mlresearcher
★★★★2025-06-10

Good performance but support could be faster.

Wafer — Provider Scorecard — NexusGPU | NexusGPU