N
Nvidia
AGGREGATEDINFERENCE
N/A
Uptime
N/A
Rating
30-Day Uptime
96.7%2026-03-232026-04-21
Inference Latency
NVIDIA: Nemotron 3 Super (free)12143ms TTFT · 23 TPS
NVIDIA: Nemotron Nano 9B V2 (free)2605ms TTFT · 28 TPS
NVIDIA: Nemotron 3 Nano 30B A3B (free)676ms TTFT · 123 TPS
Meta: Llama Guard 4 12B (free)290ms TTFT · 6 TPS
NVIDIA: Nemotron Nano 12B 2 VL (free)2595ms TTFT · 30 TPS
Inference Models
| Model | Input $/M | Output $/M | TTFT | TPS |
|---|---|---|---|---|
| NVIDIA: Nemotron 3 Super (free) | $0.00 | $0.00 | 12143ms | 23 |
| NVIDIA: Nemotron Nano 9B V2 (free) | $0.00 | $0.00 | 2605ms | 28 |
| NVIDIA: Nemotron 3 Nano 30B A3B (free) | $0.00 | $0.00 | 676ms | 123 |
| Meta: Llama Guard 4 12B (free) | $0.00 | $0.00 | 290ms | 6 |
| NVIDIA: Nemotron Nano 12B 2 VL (free) | $0.00 | $0.00 | 2595ms | 30 |
Community Reviews
4.5★★★★★(2 reviews)
clouduser42
★★★★★2025-06-15
Reliable service, great API documentation.
mlresearcher
★★★★☆2025-06-10
Good performance but support could be faster.