LIVE
Models: —+Providers: —+Cheapest H100: $2.49/hrUpdated: 05:20 PMModels: —+Providers: —+Cheapest H100: $2.49/hrUpdated: 05:20 PM
Marketplace
Providers Models
G

Groq

AGGREGATEDINFERENCE
N/A
Uptime
N/A
Rating

30-Day Uptime

96.7%
2026-03-232026-04-21

Inference Latency

Meta: Llama 3.1 8B Instruct122ms TTFT · 237 TPS
OpenAI: gpt-oss-safeguard-20b188ms TTFT · 546 TPS
OpenAI: gpt-oss-20b196ms TTFT · 566 TPS
Meta: Llama 4 Scout675ms TTFT · 106 TPS
OpenAI: gpt-oss-120b (exacto)183ms TTFT · 439 TPS
OpenAI: gpt-oss-120b305ms TTFT · 319 TPS
Qwen: Qwen3 32B371ms TTFT · 373 TPS
Meta: Llama 3.3 70B Instruct334ms TTFT · 126 TPS
MoonshotAI: Kimi K2 0905192ms TTFT · 207 TPS
MoonshotAI: Kimi K2 0905 (exacto)114ms TTFT · 192 TPS

Inference Models

ModelInput $/MOutput $/MTTFTTPS
Meta: Llama 3.1 8B Instruct$0.05$0.08122ms237
OpenAI: gpt-oss-safeguard-20b$0.08$0.30188ms546
OpenAI: gpt-oss-20b$0.08$0.30196ms566
Meta: Llama 4 Scout$0.11$0.34675ms106
OpenAI: gpt-oss-120b (exacto)$0.15$0.60183ms439
OpenAI: gpt-oss-120b$0.15$0.60305ms319
Qwen: Qwen3 32B$0.29$0.59371ms373
Meta: Llama 3.3 70B Instruct$0.59$0.79334ms126
MoonshotAI: Kimi K2 0905$1.00$3.00192ms207
MoonshotAI: Kimi K2 0905 (exacto)$1.00$3.00114ms192

Community Reviews

4.5★★★★★(2 reviews)
clouduser42
★★★★★2025-06-15

Reliable service, great API documentation.

mlresearcher
★★★★2025-06-10

Good performance but support could be faster.