S
SambaNova
AGGREGATEDINFERENCE
N/A
Uptime
N/A
Rating
30-Day Uptime
100%2026-03-232026-04-21
Inference Latency
OpenAI: gpt-oss-120b1255ms TTFT · 231 TPS
DeepSeek: DeepSeek V3.12446ms TTFT · 11 TPS
MiniMax: MiniMax M2.51611ms TTFT · 120 TPS
Google: Gemma 3 12B1314ms TTFT · 64 TPS
Meta: Llama 3.3 70B Instruct3003ms TTFT · 23 TPS
Meta: Llama 3.3 70B Instruct2641ms TTFT · 23 TPS
Meta: Llama 4 Maverick1544ms TTFT · 116 TPS
DeepSeek: DeepSeek V3.13461ms TTFT · 25 TPS
Inference Models
| Model | Input $/M | Output $/M | TTFT | TPS |
|---|---|---|---|---|
| OpenAI: gpt-oss-120b | $0.14 | $0.95 | 1255ms | 231 |
| DeepSeek: DeepSeek V3.1 | $0.15 | $0.75 | 2446ms | 11 |
| MiniMax: MiniMax M2.5 | $0.22 | $0.59 | 1611ms | 120 |
| Google: Gemma 3 12B | $0.22 | $0.59 | 1314ms | 64 |
| Meta: Llama 3.3 70B Instruct | $0.45 | $0.90 | 3003ms | 23 |
| Meta: Llama 3.3 70B Instruct | $0.60 | $1.20 | 2641ms | 23 |
| Meta: Llama 4 Maverick | $0.63 | $1.80 | 1544ms | 116 |
| DeepSeek: DeepSeek V3.1 | $0.65 | $1.50 | 3461ms | 25 |
Community Reviews
4.5★★★★★(2 reviews)
clouduser42
★★★★★2025-06-15
Reliable service, great API documentation.
mlresearcher
★★★★☆2025-06-10
Good performance but support could be faster.