F
Friendli
AGGREGATEDINFERENCE
N/A
Uptime
N/A
Rating
30-Day Uptime
96.7%2026-03-232026-04-21
Inference Latency
Meta: Llama 3.1 8B Instruct104ms TTFT · 172 TPS
Qwen: Qwen3 235B A22B Instruct 2507278ms TTFT · 32 TPS
MiniMax: MiniMax M2.5431ms TTFT · 54 TPS
DeepSeek: DeepSeek V3.22962ms TTFT · 22 TPS
Meta: Llama 3.3 70B Instruct203ms TTFT · 81 TPS
Z.ai: GLM 51993ms TTFT · 45 TPS
Z.ai: GLM 5.11114ms TTFT · 62 TPS
Inference Models
| Model | Input $/M | Output $/M | TTFT | TPS |
|---|---|---|---|---|
| Meta: Llama 3.1 8B Instruct | $0.10 | $0.10 | 104ms | 172 |
| Qwen: Qwen3 235B A22B Instruct 2507 | $0.20 | $0.80 | 278ms | 32 |
| MiniMax: MiniMax M2.5 | $0.30 | $1.20 | 431ms | 54 |
| DeepSeek: DeepSeek V3.2 | $0.50 | $1.50 | 2962ms | 22 |
| Meta: Llama 3.3 70B Instruct | $0.60 | $0.60 | 203ms | 81 |
| Z.ai: GLM 5 | $1.00 | $3.20 | 1993ms | 45 |
| Z.ai: GLM 5.1 | $1.40 | $4.40 | 1114ms | 62 |
Community Reviews
4.5★★★★★(2 reviews)
clouduser42
★★★★★2025-06-15
Reliable service, great API documentation.
mlresearcher
★★★★☆2025-06-10
Good performance but support could be faster.