LIVE
Models: —+Providers: —+Cheapest H100: $2.49/hrUpdated: 04:30 PMModels: —+Providers: —+Cheapest H100: $2.49/hrUpdated: 04:30 PM
Marketplace
Providers Models
B

BaseTen

AGGREGATEDINFERENCE
N/A
Uptime
N/A
Rating

30-Day Uptime

96.7%
2026-05-222026-06-20

Inference Latency

OpenAI: gpt-oss-120b165ms TTFT · 346 TPS
MoonshotAI: Kimi K2.62454ms TTFT · 117 TPS
Z.ai: GLM 5.1510ms TTFT · 111 TPS

Inference Models

ModelInput $/MOutput $/MTTFTTPS
OpenAI: gpt-oss-120b$0.10$0.50165ms346
MoonshotAI: Kimi K2.6$0.95$4.002454ms117
Z.ai: GLM 5.1$1.30$4.30510ms111

Community Reviews

4.5★★★★★(2 reviews)
clouduser42
★★★★★2025-06-15

Reliable service, great API documentation.

mlresearcher
★★★★2025-06-10

Good performance but support could be faster.