S
SambaNova
AGGREGATEDINFERENCE
N/A
Uptime
N/A
Rating
30-Day Uptime
96.7%2026-05-222026-06-20
Inference Latency
OpenAI: gpt-oss-120b2156ms TTFT · 122 TPS
Meta: Llama 3.3 70B Instruct925ms TTFT · 85 TPS
Meta: Llama 3.3 70B Instruct1366ms TTFT · 95 TPS
MiniMax: MiniMax M2.7909ms TTFT · 202 TPS
DeepSeek: DeepSeek V3.11351ms TTFT · 56 TPS
MiniMax: MiniMax M2.7826ms TTFT · 229 TPS
Inference Models
| Model | Input $/M | Output $/M | TTFT | TPS |
|---|---|---|---|---|
| OpenAI: gpt-oss-120b | $0.14 | $0.95 | 2156ms | 122 |
| Meta: Llama 3.3 70B Instruct | $0.45 | $0.90 | 925ms | 85 |
| Meta: Llama 3.3 70B Instruct | $0.60 | $1.20 | 1366ms | 95 |
| MiniMax: MiniMax M2.7 | $0.60 | $2.40 | 909ms | 202 |
| DeepSeek: DeepSeek V3.1 | $0.65 | $1.50 | 1351ms | 56 |
| MiniMax: MiniMax M2.7 | $1.60 | $6.40 | 826ms | 229 |
| DeepSeek: DeepSeek V3.2 | $3.00 | $4.50 | — | — |
Community Reviews
4.5★★★★★(2 reviews)
clouduser42
★★★★★2025-06-15
Reliable service, great API documentation.
mlresearcher
★★★★☆2025-06-10
Good performance but support could be faster.