LIVE
Models: —+Providers: —+Cheapest H100: $2.49/hrUpdated: 05:14 PMModels: —+Providers: —+Cheapest H100: $2.49/hrUpdated: 05:14 PM
Marketplace
Providers Models
C

Cerebras

AGGREGATEDINFERENCE
N/A
Uptime
N/A
Rating

30-Day Uptime

96.7%
2026-03-232026-04-21

Inference Latency

Meta: Llama 3.1 8B Instruct200ms TTFT · 60 TPS
OpenAI: gpt-oss-120b791ms TTFT · 301 TPS
Qwen: Qwen3 235B A22B Instruct 2507837ms TTFT · 3 TPS
Z.ai: GLM 4.7450ms TTFT · 352 TPS

Inference Models

ModelInput $/MOutput $/MTTFTTPS
Meta: Llama 3.1 8B Instruct$0.10$0.10200ms60
OpenAI: gpt-oss-120b$0.35$0.75791ms301
Qwen: Qwen3 235B A22B Instruct 2507$0.60$1.20837ms3
Z.ai: GLM 4.7$2.25$2.75450ms352

Community Reviews

4.5★★★★★(2 reviews)
clouduser42
★★★★★2025-06-15

Reliable service, great API documentation.

mlresearcher
★★★★2025-06-10

Good performance but support could be faster.