Wafer

AGGREGATEDINFERENCE

N/A

Uptime

N/A

Rating

93.3%

2026-05-222026-06-20

DeepSeek: DeepSeek V4 Flash986ms TTFT · 12 TPS

Z.ai: GLM 5.11179ms TTFT · 71 TPS

Z.ai: GLM 5.21823ms TTFT · 30 TPS

Inference Models

Model	Input $/M	Output $/M	TTFT	TPS
DeepSeek: DeepSeek V4 Flash	$0.09	$0.18	986ms	12
Z.ai: GLM 5.1	$1.00	$3.20	1179ms	71
Z.ai: GLM 5.2	$1.20	$4.10	1823ms	30

4.5★★★★★(2 reviews)

clouduser42

★★★★★2025-06-15

Reliable service, great API documentation.

mlresearcher

★★★★☆2025-06-10

Good performance but support could be faster.