LIVE
Models: —+Providers: —+Cheapest H100: $2.49/hrUpdated: 05:16 PMModels: —+Providers: —+Cheapest H100: $2.49/hrUpdated: 05:16 PM
Marketplace
Providers Models
N

Nvidia

AGGREGATEDINFERENCE
N/A
Uptime
N/A
Rating

30-Day Uptime

96.7%
2026-03-232026-04-21

Inference Latency

NVIDIA: Nemotron 3 Super (free)12143ms TTFT · 23 TPS
NVIDIA: Nemotron Nano 9B V2 (free)2605ms TTFT · 28 TPS
NVIDIA: Nemotron 3 Nano 30B A3B (free)676ms TTFT · 123 TPS
Meta: Llama Guard 4 12B (free)290ms TTFT · 6 TPS
NVIDIA: Nemotron Nano 12B 2 VL (free)2595ms TTFT · 30 TPS

Inference Models

ModelInput $/MOutput $/MTTFTTPS
NVIDIA: Nemotron 3 Super (free)$0.00$0.0012143ms23
NVIDIA: Nemotron Nano 9B V2 (free)$0.00$0.002605ms28
NVIDIA: Nemotron 3 Nano 30B A3B (free)$0.00$0.00676ms123
Meta: Llama Guard 4 12B (free)$0.00$0.00290ms6
NVIDIA: Nemotron Nano 12B 2 VL (free)$0.00$0.002595ms30

Community Reviews

4.5★★★★★(2 reviews)
clouduser42
★★★★★2025-06-15

Reliable service, great API documentation.

mlresearcher
★★★★2025-06-10

Good performance but support could be faster.