LIVE
Models: —+Providers: —+Cheapest H100: $2.49/hrUpdated: 04:26 PMModels: —+Providers: —+Cheapest H100: $2.49/hrUpdated: 04:26 PM
Marketplace
Providers Models
N

Nvidia

AGGREGATEDINFERENCE
N/A
Uptime
N/A
Rating

30-Day Uptime

96.7%
2026-05-222026-06-20

Inference Latency

NVIDIA: Nemotron 3.5 Content Safety (free)328ms TTFT · 76 TPS
NVIDIA: Nemotron 3 Nano 30B A3B (free)597ms TTFT · 191 TPS
NVIDIA: Nemotron 3 Nano Omni (free)418ms TTFT · 181 TPS
NVIDIA: Nemotron 3 Ultra (free)35997ms TTFT · 3 TPS
NVIDIA: Nemotron Nano 9B V2 (free)1029ms TTFT · 47 TPS
Meta: Llama Guard 4 12B (free)290ms TTFT · 6 TPS
NVIDIA: Nemotron 3 Super (free)3248ms TTFT · 21 TPS
NVIDIA: Nemotron Nano 12B 2 VL (free)1317ms TTFT · 25 TPS

Inference Models

ModelInput $/MOutput $/MTTFTTPS
NVIDIA: Nemotron 3.5 Content Safety (free)$0.00$0.00328ms76
NVIDIA: Nemotron 3 Nano 30B A3B (free)$0.00$0.00597ms191
NVIDIA: Nemotron 3 Nano Omni (free)$0.00$0.00418ms181
NVIDIA: Nemotron 3 Ultra (free)$0.00$0.0035997ms3
NVIDIA: Nemotron Nano 9B V2 (free)$0.00$0.001029ms47
Meta: Llama Guard 4 12B (free)$0.00$0.00290ms6
NVIDIA: Nemotron 3 Super (free)$0.00$0.003248ms21
NVIDIA: Nemotron Nano 12B 2 VL (free)$0.00$0.001317ms25

Community Reviews

4.5★★★★★(2 reviews)
clouduser42
★★★★★2025-06-15

Reliable service, great API documentation.

mlresearcher
★★★★2025-06-10

Good performance but support could be faster.