LIVE
Models: —+Providers: —+Cheapest H100: $2.49/hrUpdated: 05:19 PMModels: —+Providers: —+Cheapest H100: $2.49/hrUpdated: 05:19 PM
Marketplace
Providers Models
F

Friendli

AGGREGATEDINFERENCE
N/A
Uptime
N/A
Rating

30-Day Uptime

96.7%
2026-03-232026-04-21

Inference Latency

Meta: Llama 3.1 8B Instruct104ms TTFT · 172 TPS
Qwen: Qwen3 235B A22B Instruct 2507278ms TTFT · 32 TPS
MiniMax: MiniMax M2.5431ms TTFT · 54 TPS
DeepSeek: DeepSeek V3.22962ms TTFT · 22 TPS
Meta: Llama 3.3 70B Instruct203ms TTFT · 81 TPS
Z.ai: GLM 51993ms TTFT · 45 TPS
Z.ai: GLM 5.11114ms TTFT · 62 TPS

Inference Models

ModelInput $/MOutput $/MTTFTTPS
Meta: Llama 3.1 8B Instruct$0.10$0.10104ms172
Qwen: Qwen3 235B A22B Instruct 2507$0.20$0.80278ms32
MiniMax: MiniMax M2.5$0.30$1.20431ms54
DeepSeek: DeepSeek V3.2$0.50$1.502962ms22
Meta: Llama 3.3 70B Instruct$0.60$0.60203ms81
Z.ai: GLM 5$1.00$3.201993ms45
Z.ai: GLM 5.1$1.40$4.401114ms62

Community Reviews

4.5★★★★★(2 reviews)
clouduser42
★★★★★2025-06-15

Reliable service, great API documentation.

mlresearcher
★★★★2025-06-10

Good performance but support could be faster.