G
Groq
AGGREGATEDINFERENCE
N/A
Uptime
N/A
Rating
30-Day Uptime
96.7%2026-03-232026-04-21
Inference Latency
Meta: Llama 3.1 8B Instruct122ms TTFT · 237 TPS
OpenAI: gpt-oss-safeguard-20b188ms TTFT · 546 TPS
OpenAI: gpt-oss-20b196ms TTFT · 566 TPS
Meta: Llama 4 Scout675ms TTFT · 106 TPS
OpenAI: gpt-oss-120b (exacto)183ms TTFT · 439 TPS
OpenAI: gpt-oss-120b305ms TTFT · 319 TPS
Qwen: Qwen3 32B371ms TTFT · 373 TPS
Meta: Llama 3.3 70B Instruct334ms TTFT · 126 TPS
MoonshotAI: Kimi K2 0905192ms TTFT · 207 TPS
MoonshotAI: Kimi K2 0905 (exacto)114ms TTFT · 192 TPS
Inference Models
| Model | Input $/M | Output $/M | TTFT | TPS |
|---|---|---|---|---|
| Meta: Llama 3.1 8B Instruct | $0.05 | $0.08 | 122ms | 237 |
| OpenAI: gpt-oss-safeguard-20b | $0.08 | $0.30 | 188ms | 546 |
| OpenAI: gpt-oss-20b | $0.08 | $0.30 | 196ms | 566 |
| Meta: Llama 4 Scout | $0.11 | $0.34 | 675ms | 106 |
| OpenAI: gpt-oss-120b (exacto) | $0.15 | $0.60 | 183ms | 439 |
| OpenAI: gpt-oss-120b | $0.15 | $0.60 | 305ms | 319 |
| Qwen: Qwen3 32B | $0.29 | $0.59 | 371ms | 373 |
| Meta: Llama 3.3 70B Instruct | $0.59 | $0.79 | 334ms | 126 |
| MoonshotAI: Kimi K2 0905 | $1.00 | $3.00 | 192ms | 207 |
| MoonshotAI: Kimi K2 0905 (exacto) | $1.00 | $3.00 | 114ms | 192 |
Community Reviews
4.5★★★★★(2 reviews)
clouduser42
★★★★★2025-06-15
Reliable service, great API documentation.
mlresearcher
★★★★☆2025-06-10
Good performance but support could be faster.