Z
Z.AI
AGGREGATEDINFERENCE
N/A
Uptime
N/A
Rating
30-Day Uptime
96.7%2026-03-232026-04-21
Inference Latency
Z.ai: GLM 4.5 Air (free)10686ms TTFT · 12 TPS
Z.ai: GLM 4.7 Flash1944ms TTFT · 74 TPS
Z.ai: GLM 4 32B 1092ms TTFT · 3 TPS
Z.ai: GLM 4.5 Air2193ms TTFT · 42 TPS
Z.ai: GLM 4.52625ms TTFT · 43 TPS
Z.ai: GLM 4.63660ms TTFT · 34 TPS
Z.ai: GLM 4.6 (exacto)2519ms TTFT · 78 TPS
Z.ai: GLM 4.5V1956ms TTFT · 55 TPS
Z.ai: GLM 4.74559ms TTFT · 28 TPS
Z.ai: GLM 54944ms TTFT · 30 TPS
Z.ai: GLM 5 Turbo2784ms TTFT · 34 TPS
Z.ai: GLM 5V Turbo7325ms TTFT · 23 TPS
Z.ai: GLM 5.16280ms TTFT · 18 TPS
Inference Models
| Model | Input $/M | Output $/M | TTFT | TPS |
|---|---|---|---|---|
| Z.ai: GLM 4.5 Air (free) | $0.00 | $0.00 | 10686ms | 12 |
| Z.ai: GLM 4.7 Flash | $0.07 | $0.40 | 1944ms | 74 |
| Z.ai: GLM 4 32B | $0.10 | $0.10 | 1092ms | 3 |
| Z.ai: GLM 4.5 Air | $0.20 | $1.10 | 2193ms | 42 |
| Z.ai: GLM 4.6V | $0.30 | $0.90 | — | — |
| Z.ai: GLM 4.5 | $0.60 | $2.20 | 2625ms | 43 |
| Z.ai: GLM 4.6 | $0.60 | $2.20 | 3660ms | 34 |
| Z.ai: GLM 4.6 (exacto) | $0.60 | $2.20 | 2519ms | 78 |
| Z.ai: GLM 4.5V | $0.60 | $1.80 | 1956ms | 55 |
| Z.ai: GLM 4.7 | $0.60 | $2.20 | 4559ms | 28 |
| Z.ai: GLM 5 | $1.00 | $3.20 | 4944ms | 30 |
| Z.ai: GLM 5 Turbo | $1.20 | $4.00 | 2784ms | 34 |
| Z.ai: GLM 5V Turbo | $1.20 | $4.00 | 7325ms | 23 |
| Z.ai: GLM 5.1 | $1.40 | $4.40 | 6280ms | 18 |
Community Reviews
4.5★★★★★(2 reviews)
clouduser42
★★★★★2025-06-15
Reliable service, great API documentation.
mlresearcher
★★★★☆2025-06-10
Good performance but support could be faster.