N
Nvidia
AGGREGATEDINFERENCE
N/A
Uptime
N/A
Rating
30-Day Uptime
96.7%2026-05-222026-06-20
Inference Latency
NVIDIA: Nemotron 3.5 Content Safety (free)328ms TTFT · 76 TPS
NVIDIA: Nemotron 3 Nano 30B A3B (free)597ms TTFT · 191 TPS
NVIDIA: Nemotron 3 Nano Omni (free)418ms TTFT · 181 TPS
NVIDIA: Nemotron 3 Ultra (free)35997ms TTFT · 3 TPS
NVIDIA: Nemotron Nano 9B V2 (free)1029ms TTFT · 47 TPS
Meta: Llama Guard 4 12B (free)290ms TTFT · 6 TPS
NVIDIA: Nemotron 3 Super (free)3248ms TTFT · 21 TPS
NVIDIA: Nemotron Nano 12B 2 VL (free)1317ms TTFT · 25 TPS
Inference Models
| Model | Input $/M | Output $/M | TTFT | TPS |
|---|---|---|---|---|
| NVIDIA: Nemotron 3.5 Content Safety (free) | $0.00 | $0.00 | 328ms | 76 |
| NVIDIA: Nemotron 3 Nano 30B A3B (free) | $0.00 | $0.00 | 597ms | 191 |
| NVIDIA: Nemotron 3 Nano Omni (free) | $0.00 | $0.00 | 418ms | 181 |
| NVIDIA: Nemotron 3 Ultra (free) | $0.00 | $0.00 | 35997ms | 3 |
| NVIDIA: Nemotron Nano 9B V2 (free) | $0.00 | $0.00 | 1029ms | 47 |
| Meta: Llama Guard 4 12B (free) | $0.00 | $0.00 | 290ms | 6 |
| NVIDIA: Nemotron 3 Super (free) | $0.00 | $0.00 | 3248ms | 21 |
| NVIDIA: Nemotron Nano 12B 2 VL (free) | $0.00 | $0.00 | 1317ms | 25 |
Community Reviews
4.5★★★★★(2 reviews)
clouduser42
★★★★★2025-06-15
Reliable service, great API documentation.
mlresearcher
★★★★☆2025-06-10
Good performance but support could be faster.