LIVE
Models: —+Providers: —+Cheapest H100: $2.49/hrUpdated: 04:32 PMModels: —+Providers: —+Cheapest H100: $2.49/hrUpdated: 04:32 PM
Marketplace
Providers Models
O

OpenInference

AGGREGATEDINFERENCE
N/A
Uptime
N/A
Rating

30-Day Uptime

100%
2026-05-222026-06-20

Inference Latency

Google: Gemma 4 31B (free)1002ms TTFT · 36 TPS
OpenAI: gpt-oss-20b (free)529ms TTFT · 31 TPS
OpenAI: gpt-oss-120b (free)852ms TTFT · 23 TPS

Inference Models

ModelInput $/MOutput $/MTTFTTPS
Google: Gemma 4 31B (free)$0.00$0.001002ms36
OpenAI: gpt-oss-20b (free)$0.00$0.00529ms31
OpenAI: gpt-oss-120b (free)$0.00$0.00852ms23
MiniMax: MiniMax M2.5 (free)$0.00$0.00

Community Reviews

4.5★★★★★(2 reviews)
clouduser42
★★★★★2025-06-15

Reliable service, great API documentation.

mlresearcher
★★★★2025-06-10

Good performance but support could be faster.