F
Fireworks
AGGREGATEDINFERENCE
N/A
Uptime
N/A
Rating
30-Day Uptime
93.3%2026-05-222026-06-20
Inference Latency
OpenAI: gpt-oss-20b728ms TTFT · 43 TPS
DeepSeek: DeepSeek V4 Flash2482ms TTFT · 10 TPS
MiniMax: MiniMax M2.7857ms TTFT · 129 TPS
MoonshotAI: Kimi K2.6613ms TTFT · 125 TPS
Z.ai: GLM 5.21593ms TTFT · 43 TPS
Z.ai: GLM 5.11042ms TTFT · 66 TPS
DeepSeek: DeepSeek V4 Pro1316ms TTFT · 35 TPS
Inference Models
| Model | Input $/M | Output $/M | TTFT | TPS |
|---|---|---|---|---|
| OpenAI: gpt-oss-20b | $0.07 | $0.30 | 728ms | 43 |
| DeepSeek: DeepSeek V4 Flash | $0.14 | $0.28 | 2482ms | 10 |
| MiniMax: MiniMax M2.7 | $0.30 | $1.20 | 857ms | 129 |
| MoonshotAI: Kimi K2.6 | $0.95 | $4.00 | 613ms | 125 |
| Z.ai: GLM 5.2 | $1.40 | $4.40 | 1593ms | 43 |
| Z.ai: GLM 5.1 | $1.40 | $4.40 | 1042ms | 66 |
| DeepSeek: DeepSeek V4 Pro | $1.74 | $3.48 | 1316ms | 35 |
Community Reviews
4.5★★★★★(2 reviews)
clouduser42
★★★★★2025-06-15
Reliable service, great API documentation.
mlresearcher
★★★★☆2025-06-10
Good performance but support could be faster.