F
Friendli
AGGREGATEDINFERENCE
N/A
Uptime
N/A
Rating
30-Day Uptime
96.7%2026-05-222026-06-20
Inference Latency
Qwen: Qwen3 235B A22B Instruct 2507406ms TTFT · 26 TPS
MiniMax: MiniMax M2.5376ms TTFT · 113 TPS
DeepSeek: DeepSeek V3.2533ms TTFT · 31 TPS
Z.ai: GLM 5665ms TTFT · 85 TPS
Z.ai: GLM 5.1444ms TTFT · 67 TPS
Z.ai: GLM 5.24189ms TTFT · 25 TPS
Inference Models
| Model | Input $/M | Output $/M | TTFT | TPS |
|---|---|---|---|---|
| Qwen: Qwen3 235B A22B Instruct 2507 | $0.20 | $0.80 | 406ms | 26 |
| MiniMax: MiniMax M2.5 | $0.30 | $1.20 | 376ms | 113 |
| DeepSeek: DeepSeek V3.2 | $0.50 | $1.50 | 533ms | 31 |
| Z.ai: GLM 5 | $1.00 | $3.20 | 665ms | 85 |
| Z.ai: GLM 5.1 | $1.40 | $4.40 | 444ms | 67 |
| Z.ai: GLM 5.2 | $1.40 | $4.40 | 4189ms | 25 |
Community Reviews
4.5★★★★★(2 reviews)
clouduser42
★★★★★2025-06-15
Reliable service, great API documentation.
mlresearcher
★★★★☆2025-06-10
Good performance but support could be faster.