LIVE
Models: —+Providers: —+Cheapest H100: $2.49/hrUpdated: 04:34 PMModels: —+Providers: —+Cheapest H100: $2.49/hrUpdated: 04:34 PM
Marketplace
Providers Models
F

Friendli

AGGREGATEDINFERENCE
N/A
Uptime
N/A
Rating

30-Day Uptime

96.7%
2026-05-222026-06-20

Inference Latency

Qwen: Qwen3 235B A22B Instruct 2507406ms TTFT · 26 TPS
MiniMax: MiniMax M2.5376ms TTFT · 113 TPS
DeepSeek: DeepSeek V3.2533ms TTFT · 31 TPS
Z.ai: GLM 5665ms TTFT · 85 TPS
Z.ai: GLM 5.1444ms TTFT · 67 TPS
Z.ai: GLM 5.24189ms TTFT · 25 TPS

Inference Models

ModelInput $/MOutput $/MTTFTTPS
Qwen: Qwen3 235B A22B Instruct 2507$0.20$0.80406ms26
MiniMax: MiniMax M2.5$0.30$1.20376ms113
DeepSeek: DeepSeek V3.2$0.50$1.50533ms31
Z.ai: GLM 5$1.00$3.20665ms85
Z.ai: GLM 5.1$1.40$4.40444ms67
Z.ai: GLM 5.2$1.40$4.404189ms25

Community Reviews

4.5★★★★★(2 reviews)
clouduser42
★★★★★2025-06-15

Reliable service, great API documentation.

mlresearcher
★★★★2025-06-10

Good performance but support could be faster.