SambaNova

AGGREGATEDINFERENCE

N/A

Uptime

N/A

Rating

96.7%

2026-05-222026-06-20

OpenAI: gpt-oss-120b2156ms TTFT · 122 TPS

Meta: Llama 3.3 70B Instruct925ms TTFT · 85 TPS

Meta: Llama 3.3 70B Instruct1366ms TTFT · 95 TPS

MiniMax: MiniMax M2.7909ms TTFT · 202 TPS

DeepSeek: DeepSeek V3.11351ms TTFT · 56 TPS

MiniMax: MiniMax M2.7826ms TTFT · 229 TPS

Inference Models

Model	Input $/M	Output $/M	TTFT	TPS
OpenAI: gpt-oss-120b	$0.14	$0.95	2156ms	122
Meta: Llama 3.3 70B Instruct	$0.45	$0.90	925ms	85
Meta: Llama 3.3 70B Instruct	$0.60	$1.20	1366ms	95
MiniMax: MiniMax M2.7	$0.60	$2.40	909ms	202
DeepSeek: DeepSeek V3.1	$0.65	$1.50	1351ms	56
MiniMax: MiniMax M2.7	$1.60	$6.40	826ms	229
DeepSeek: DeepSeek V3.2	$3.00	$4.50	—	—

4.5★★★★★(2 reviews)

clouduser42

★★★★★2025-06-15

Reliable service, great API documentation.

mlresearcher

★★★★☆2025-06-10

Good performance but support could be faster.