LIVE
Models: —+Providers: —+Cheapest H100: $2.49/hrUpdated: 05:19 PMModels: —+Providers: —+Cheapest H100: $2.49/hrUpdated: 05:19 PM
Marketplace
Providers Models
Z

Z.AI

AGGREGATEDINFERENCE
N/A
Uptime
N/A
Rating

30-Day Uptime

96.7%
2026-03-232026-04-21

Inference Latency

Z.ai: GLM 4.5 Air (free)10686ms TTFT · 12 TPS
Z.ai: GLM 4.7 Flash1944ms TTFT · 74 TPS
Z.ai: GLM 4 32B 1092ms TTFT · 3 TPS
Z.ai: GLM 4.5 Air2193ms TTFT · 42 TPS
Z.ai: GLM 4.52625ms TTFT · 43 TPS
Z.ai: GLM 4.63660ms TTFT · 34 TPS
Z.ai: GLM 4.6 (exacto)2519ms TTFT · 78 TPS
Z.ai: GLM 4.5V1956ms TTFT · 55 TPS
Z.ai: GLM 4.74559ms TTFT · 28 TPS
Z.ai: GLM 54944ms TTFT · 30 TPS
Z.ai: GLM 5 Turbo2784ms TTFT · 34 TPS
Z.ai: GLM 5V Turbo7325ms TTFT · 23 TPS
Z.ai: GLM 5.16280ms TTFT · 18 TPS

Inference Models

ModelInput $/MOutput $/MTTFTTPS
Z.ai: GLM 4.5 Air (free)$0.00$0.0010686ms12
Z.ai: GLM 4.7 Flash$0.07$0.401944ms74
Z.ai: GLM 4 32B $0.10$0.101092ms3
Z.ai: GLM 4.5 Air$0.20$1.102193ms42
Z.ai: GLM 4.6V$0.30$0.90
Z.ai: GLM 4.5$0.60$2.202625ms43
Z.ai: GLM 4.6$0.60$2.203660ms34
Z.ai: GLM 4.6 (exacto)$0.60$2.202519ms78
Z.ai: GLM 4.5V$0.60$1.801956ms55
Z.ai: GLM 4.7$0.60$2.204559ms28
Z.ai: GLM 5$1.00$3.204944ms30
Z.ai: GLM 5 Turbo$1.20$4.002784ms34
Z.ai: GLM 5V Turbo$1.20$4.007325ms23
Z.ai: GLM 5.1$1.40$4.406280ms18

Community Reviews

4.5★★★★★(2 reviews)
clouduser42
★★★★★2025-06-15

Reliable service, great API documentation.

mlresearcher
★★★★2025-06-10

Good performance but support could be faster.