LIVE
Models: —+Providers: —+Cheapest H100: $2.49/hrUpdated: 04:31 PMModels: —+Providers: —+Cheapest H100: $2.49/hrUpdated: 04:31 PM
Marketplace
Providers Models
G

Groq

AGGREGATEDINFERENCE
N/A
Uptime
N/A
Rating

30-Day Uptime

96.7%
2026-05-222026-06-20

Inference Latency

Meta: Llama 3.1 8B Instruct233ms TTFT · 133 TPS
OpenAI: gpt-oss-safeguard-20b221ms TTFT · 620 TPS
OpenAI: gpt-oss-20b331ms TTFT · 411 TPS
Meta: Llama 4 Scout267ms TTFT · 141 TPS
OpenAI: gpt-oss-120b149ms TTFT · 404 TPS
OpenAI: gpt-oss-120b (exacto)183ms TTFT · 439 TPS
Qwen: Qwen3 32B299ms TTFT · 306 TPS
Meta: Llama 3.3 70B Instruct236ms TTFT · 205 TPS
MoonshotAI: Kimi K2 0905206ms TTFT · 207 TPS
MoonshotAI: Kimi K2 0905 (exacto)114ms TTFT · 192 TPS

Inference Models

ModelInput $/MOutput $/MTTFTTPS
Meta: Llama 3.1 8B Instruct$0.05$0.08233ms133
OpenAI: gpt-oss-safeguard-20b$0.08$0.30221ms620
OpenAI: gpt-oss-20b$0.08$0.30331ms411
Meta: Llama 4 Scout$0.11$0.34267ms141
OpenAI: gpt-oss-120b$0.15$0.60149ms404
OpenAI: gpt-oss-120b (exacto)$0.15$0.60183ms439
Qwen: Qwen3 32B$0.29$0.59299ms306
Meta: Llama 3.3 70B Instruct$0.59$0.79236ms205
MoonshotAI: Kimi K2 0905$1.00$3.00206ms207
MoonshotAI: Kimi K2 0905 (exacto)$1.00$3.00114ms192

Community Reviews

4.5★★★★★(2 reviews)
clouduser42
★★★★★2025-06-15

Reliable service, great API documentation.

mlresearcher
★★★★2025-06-10

Good performance but support could be faster.