G
AGGREGATEDINFERENCE
N/A
Uptime
N/A
Rating
30-Day Uptime
100%2026-05-222026-06-20
Inference Latency
OpenAI: gpt-oss-20b804ms TTFT · 23 TPS
Google: Gemini 2.0 Flash Lite423ms TTFT · 90 TPS
OpenAI: gpt-oss-120b380ms TTFT · 102 TPS
Google: Gemini 2.5 Flash Lite Preview 09-2025373ms TTFT · 142 TPS
Google: Gemini 2.5 Flash Lite563ms TTFT · 74 TPS
Google: Gemini 2.5 Flash Lite552ms TTFT · 69 TPS
Google: Gemini 2.0 Flash499ms TTFT · 37 TPS
Qwen: Qwen3 Next 80B A3B Instruct853ms TTFT · 62 TPS
Google: Gemma 4 26B A4B 353ms TTFT · 57 TPS
Qwen: Qwen3 Coder 480B A35B (exacto)1707ms TTFT · 45 TPS
Qwen: Qwen3 Coder 480B A35B1474ms TTFT · 25 TPS
Google: Gemini 3.1 Flash Lite984ms TTFT · 106 TPS
Qwen: Qwen3 235B A22B Instruct 2507425ms TTFT · 63 TPS
Meta: Llama 4 Scout439ms TTFT · 82 TPS
Google: Gemini 3.1 Flash Lite Preview655ms TTFT · 97 TPS
Google: Gemini 2.5 Flash767ms TTFT · 84 TPS
MiniMax: MiniMax M2363ms TTFT · 125 TPS
Google: Nano Banana (Gemini 2.5 Flash Image)7172ms TTFT · 169 TPS
Google: Gemini 2.5 Flash1895ms TTFT · 72 TPS
Google: Gemini 2.5 Flash653ms TTFT · 81 TPS
Inference Models
| Model | Input $/M | Output $/M | TTFT | TPS |
|---|---|---|---|---|
| OpenAI: gpt-oss-20b | $0.07 | $0.25 | 804ms | 23 |
| Google: Gemini 2.0 Flash Lite | $0.08 | $0.30 | 423ms | 90 |
| OpenAI: gpt-oss-120b | $0.09 | $0.36 | 380ms | 102 |
| Google: Gemini 2.5 Flash Lite Preview 09-2025 | $0.10 | $0.40 | 373ms | 142 |
| Google: Gemini 2.5 Flash Lite | $0.10 | $0.40 | 563ms | 74 |
| Google: Gemini 2.5 Flash Lite | $0.10 | $0.40 | 552ms | 69 |
| Google: Gemini 2.0 Flash | $0.10 | $0.40 | 499ms | 37 |
| Qwen: Qwen3 Next 80B A3B Instruct | $0.15 | $1.20 | 853ms | 62 |
| Qwen: Qwen3 Next 80B A3B Thinking | $0.15 | $1.20 | — | — |
| Google: Gemma 4 26B A4B | $0.15 | $0.60 | 353ms | 57 |
| Qwen: Qwen3 Coder 480B A35B (exacto) | $0.22 | $1.80 | 1707ms | 45 |
| Qwen: Qwen3 Coder 480B A35B | $0.22 | $1.80 | 1474ms | 25 |
| Google: Gemini 3.1 Flash Lite | $0.25 | $1.50 | 984ms | 106 |
| Qwen: Qwen3 235B A22B Instruct 2507 | $0.25 | $1.00 | 425ms | 63 |
| Meta: Llama 4 Scout | $0.25 | $0.70 | 439ms | 82 |
| Google: Gemini 3.1 Flash Lite Preview | $0.25 | $1.50 | 655ms | 97 |
| Google: Gemini 2.5 Flash | $0.30 | $2.50 | 767ms | 84 |
| MiniMax: MiniMax M2 | $0.30 | $1.20 | 363ms | 125 |
| Google: Nano Banana (Gemini 2.5 Flash Image) | $0.30 | $2.50 | 7172ms | 169 |
| Google: Gemini 2.5 Flash | $0.30 | $2.50 | 1895ms | 72 |
| Google: Gemini 2.5 Flash | $0.30 | $2.50 | 653ms | 81 |
| Meta: Llama 4 Maverick | $0.35 | $1.15 | 760ms | 62 |
| Google: Gemini 3 Flash Preview | $0.50 | $3.00 | 1098ms | 68 |
| Google: Nano Banana 2 (Gemini 3.1 Flash Image) | $0.50 | $3.00 | 10814ms | 91 |
| Google: Nano Banana 2 (Gemini 3.1 Flash Image Preview) | $0.50 | $3.00 | 10797ms | 80 |
| DeepSeek: DeepSeek V3.2 | $0.56 | $1.68 | 981ms | 17 |
| MoonshotAI: Kimi K2 Thinking | $0.60 | $2.50 | 449ms | 193 |
| Z.ai: GLM 4.7 | $0.60 | $2.20 | 485ms | 159 |
| DeepSeek: DeepSeek V3.1 | $0.60 | $1.70 | 520ms | 106 |
| Meta: Llama 3.3 70B Instruct | $0.72 | $0.72 | 267ms | 52 |
| Meta: Llama 3.3 70B Instruct | $0.72 | $0.72 | 164ms | 35 |
| Anthropic: Claude Haiku 4.5 | $1.00 | $5.00 | 407ms | 90 |
| Anthropic: Claude Haiku 4.5 | $1.00 | $5.00 | 904ms | 92 |
| Google: Gemini 2.5 Pro Preview 06-05 | $1.25 | $10.00 | 2094ms | 106 |
| Google: Gemini 2.5 Pro | $1.25 | $10.00 | 2078ms | 85 |
| Google: Gemini 2.5 Pro | $1.25 | $10.00 | 1083ms | 72 |
| Google: Gemini 2.5 Pro Preview 05-06 | $1.25 | $10.00 | 2094ms | 106 |
| Google: Gemini 2.5 Pro Preview 05-06 | $1.25 | $10.00 | 1083ms | 72 |
| Google: Gemini 2.5 Pro Preview 06-05 | $1.25 | $10.00 | 1083ms | 72 |
| Google: Gemini 2.5 Pro | $1.25 | $10.00 | 2094ms | 106 |
| Google: Gemini 2.5 Pro Preview 06-05 | $1.25 | $10.00 | 2078ms | 85 |
| Google: Gemini 2.5 Pro Preview 05-06 | $1.25 | $10.00 | 2078ms | 85 |
| Google: Gemini 3.5 Flash | $1.50 | $9.00 | 1606ms | 132 |
| Google: Nano Banana Pro (Gemini 3 Pro Image Preview) | $2.00 | $12.00 | 20103ms | 61 |
| Google: Gemini 3.1 Pro Preview | $2.00 | $12.00 | 2860ms | 88 |
| Google: Gemini 3 Pro Preview | $2.00 | $12.00 | 6924ms | 66 |
| Google: Nano Banana Pro (Gemini 3 Pro Image) | $2.00 | $12.00 | 18660ms | 64 |
| Anthropic: Claude Sonnet 4.6 | $3.00 | $15.00 | 1674ms | 26 |
| Anthropic: Claude Sonnet 4.6 | $3.00 | $15.00 | 1317ms | 36 |
| Anthropic: Claude Sonnet 4 | $3.00 | $15.00 | 1012ms | 39 |
| Anthropic: Claude 3.7 Sonnet | $3.00 | $15.00 | — | — |
| Anthropic: Claude Sonnet 4.6 | $3.00 | $15.00 | 1232ms | 40 |
| Anthropic: Claude Sonnet 4 | $3.00 | $15.00 | — | — |
| Anthropic: Claude Sonnet 4.5 | $3.00 | $15.00 | 844ms | 36 |
| Anthropic: Claude 3.7 Sonnet | $3.00 | $15.00 | 859ms | 44 |
| Anthropic: Claude 3.7 Sonnet (thinking) | $3.00 | $15.00 | 1374ms | 49 |
| Anthropic: Claude Sonnet 4 | $3.00 | $15.00 | 803ms | 30 |
| Anthropic: Claude Sonnet 4.5 | $3.00 | $15.00 | 1482ms | 37 |
| Anthropic: Claude Opus 4.7 | $5.00 | $25.00 | 2337ms | 65 |
| Anthropic: Claude Opus 4.5 | $5.00 | $25.00 | 749ms | 45 |
| Anthropic: Claude Opus 4.8 | $5.00 | $25.00 | — | — |
| Anthropic: Claude Opus 4.6 | $5.00 | $25.00 | 2110ms | 43 |
| Anthropic: Claude Opus 4.6 | $5.00 | $25.00 | — | — |
| Anthropic: Claude Opus 4.8 | $5.00 | $25.00 | 1406ms | 46 |
| Anthropic: Claude Opus 4.7 | $5.00 | $25.00 | 1322ms | 51 |
| Anthropic: Claude Fable 5 | $10.00 | $50.00 | — | — |
| Anthropic: Claude Opus 4.1 | $15.00 | $75.00 | 6001ms | 21 |
| Anthropic: Claude Opus 4 | $15.00 | $75.00 | 1968ms | 21 |
Community Reviews
4.5★★★★★(2 reviews)
clouduser42
★★★★★2025-06-15
Reliable service, great API documentation.
mlresearcher
★★★★☆2025-06-10
Good performance but support could be faster.