F
Fireworks
AGGREGATEDINFERENCE
N/A
Uptime
N/A
Rating
30-Day Uptime
96.7%2026-03-232026-04-21
Inference Latency
OpenAI: gpt-oss-20b545ms TTFT · 78 TPS
OpenAI: gpt-oss-120b661ms TTFT · 71 TPS
MiniMax: MiniMax M2.75858ms TTFT · 24 TPS
DeepSeek: DeepSeek V3.11405ms TTFT · 23 TPS
MoonshotAI: Kimi K2.5453ms TTFT · 53 TPS
Z.ai: GLM 54235ms TTFT · 48 TPS
Z.ai: GLM 5.11434ms TTFT · 24 TPS
Inference Models
| Model | Input $/M | Output $/M | TTFT | TPS |
|---|---|---|---|---|
| OpenAI: gpt-oss-20b | $0.07 | $0.30 | 545ms | 78 |
| OpenAI: gpt-oss-120b | $0.15 | $0.60 | 661ms | 71 |
| MiniMax: MiniMax M2.7 | $0.30 | $1.20 | 5858ms | 24 |
| DeepSeek: DeepSeek V3.1 | $0.56 | $1.68 | 1405ms | 23 |
| MoonshotAI: Kimi K2.5 | $0.60 | $3.00 | 453ms | 53 |
| Z.ai: GLM 5 | $1.00 | $3.20 | 4235ms | 48 |
| Z.ai: GLM 5.1 | $1.40 | $4.40 | 1434ms | 24 |
Community Reviews
4.5★★★★★(2 reviews)
clouduser42
★★★★★2025-06-15
Reliable service, great API documentation.
mlresearcher
★★★★☆2025-06-10
Good performance but support could be faster.