Nvidia

AGGREGATEDINFERENCE

N/A

Uptime

N/A

Rating

96.7%

2026-05-222026-06-20

NVIDIA: Nemotron 3.5 Content Safety (free)328ms TTFT · 76 TPS

NVIDIA: Nemotron 3 Nano 30B A3B (free)597ms TTFT · 191 TPS

NVIDIA: Nemotron 3 Nano Omni (free)418ms TTFT · 181 TPS

NVIDIA: Nemotron 3 Ultra (free)35997ms TTFT · 3 TPS

NVIDIA: Nemotron Nano 9B V2 (free)1029ms TTFT · 47 TPS

Meta: Llama Guard 4 12B (free)290ms TTFT · 6 TPS

NVIDIA: Nemotron 3 Super (free)3248ms TTFT · 21 TPS

NVIDIA: Nemotron Nano 12B 2 VL (free)1317ms TTFT · 25 TPS

Inference Models

Model	Input $/M	Output $/M	TTFT	TPS
NVIDIA: Nemotron 3.5 Content Safety (free)	$0.00	$0.00	328ms	76
NVIDIA: Nemotron 3 Nano 30B A3B (free)	$0.00	$0.00	597ms	191
NVIDIA: Nemotron 3 Nano Omni (free)	$0.00	$0.00	418ms	181
NVIDIA: Nemotron 3 Ultra (free)	$0.00	$0.00	35997ms	3
NVIDIA: Nemotron Nano 9B V2 (free)	$0.00	$0.00	1029ms	47
Meta: Llama Guard 4 12B (free)	$0.00	$0.00	290ms	6
NVIDIA: Nemotron 3 Super (free)	$0.00	$0.00	3248ms	21
NVIDIA: Nemotron Nano 12B 2 VL (free)	$0.00	$0.00	1317ms	25

4.5★★★★★(2 reviews)

clouduser42

★★★★★2025-06-15

Reliable service, great API documentation.

mlresearcher

★★★★☆2025-06-10

Good performance but support could be faster.