G
AGGREGATEDINFERENCE
N/A
Uptime
N/A
Rating
30-Day Uptime
100%2026-03-232026-04-21
Inference Latency
OpenAI: gpt-oss-20b343ms TTFT · 186 TPS
Google: Gemini 2.0 Flash Lite450ms TTFT · 74 TPS
OpenAI: gpt-oss-120b418ms TTFT · 62 TPS
Google: Gemini 2.0 Flash575ms TTFT · 65 TPS
Google: Gemini 2.5 Flash Lite Preview 09-2025418ms TTFT · 124 TPS
Google: Gemini 2.5 Flash Lite395ms TTFT · 135 TPS
Qwen: Qwen3 Next 80B A3B Instruct460ms TTFT · 152 TPS
Qwen: Qwen3 Next 80B A3B Thinking519ms TTFT · 125 TPS
Qwen: Qwen3 Coder 480B A35B428ms TTFT · 11 TPS
Qwen: Qwen3 Coder 480B A35B (exacto)1707ms TTFT · 45 TPS
Google: Gemini 3.1 Flash Lite Preview1359ms TTFT · 73 TPS
Qwen: Qwen3 235B A22B Instruct 2507267ms TTFT · 63 TPS
Meta: Llama 4 Scout2422ms TTFT · 81 TPS
Google: Gemini 2.5 Flash1905ms TTFT · 41 TPS
Google: Nano Banana (Gemini 2.5 Flash Image)8293ms TTFT · 133 TPS
MiniMax: MiniMax M2363ms TTFT · 71 TPS
Google: Gemini 2.5 Flash840ms TTFT · 62 TPS
Google: Gemini 3 Flash Preview1328ms TTFT · 57 TPS
Google: Nano Banana 2 (Gemini 3.1 Flash Image Preview)18183ms TTFT · 75 TPS
DeepSeek: DeepSeek V3.2893ms TTFT · 22 TPS
Inference Models
| Model | Input $/M | Output $/M | TTFT | TPS |
|---|---|---|---|---|
| OpenAI: gpt-oss-20b | $0.07 | $0.25 | 343ms | 186 |
| Google: Gemini 2.0 Flash Lite | $0.08 | $0.30 | 450ms | 74 |
| OpenAI: gpt-oss-120b | $0.09 | $0.36 | 418ms | 62 |
| Google: Gemini 2.0 Flash | $0.10 | $0.40 | 575ms | 65 |
| Google: Gemini 2.5 Flash Lite Preview 09-2025 | $0.10 | $0.40 | 418ms | 124 |
| Google: Gemini 2.5 Flash Lite | $0.10 | $0.40 | 395ms | 135 |
| Qwen: Qwen3 Next 80B A3B Instruct | $0.15 | $1.20 | 460ms | 152 |
| Qwen: Qwen3 Next 80B A3B Thinking | $0.15 | $1.20 | 519ms | 125 |
| Qwen: Qwen3 Coder 480B A35B | $0.22 | $1.80 | 428ms | 11 |
| Qwen: Qwen3 Coder 480B A35B (exacto) | $0.22 | $1.80 | 1707ms | 45 |
| Google: Gemini 3.1 Flash Lite Preview | $0.25 | $1.50 | 1359ms | 73 |
| Qwen: Qwen3 235B A22B Instruct 2507 | $0.25 | $1.00 | 267ms | 63 |
| Anthropic: Claude 3 Haiku | $0.25 | $1.25 | — | — |
| Meta: Llama 4 Scout | $0.25 | $0.70 | 2422ms | 81 |
| Google: Gemini 2.5 Flash | $0.30 | $2.50 | 1905ms | 41 |
| Google: Nano Banana (Gemini 2.5 Flash Image) | $0.30 | $2.50 | 8293ms | 133 |
| MiniMax: MiniMax M2 | $0.30 | $1.20 | 363ms | 71 |
| Google: Gemini 2.5 Flash | $0.30 | $2.50 | 840ms | 62 |
| Google: Gemini 3 Flash Preview | $0.50 | $3.00 | 1328ms | 57 |
| Google: Nano Banana 2 (Gemini 3.1 Flash Image Preview) | $0.50 | $3.00 | 18183ms | 75 |
| DeepSeek: DeepSeek V3.2 | $0.56 | $1.68 | 893ms | 22 |
| MoonshotAI: Kimi K2 Thinking | $0.60 | $2.50 | 4345ms | 30 |
| DeepSeek: DeepSeek V3.1 | $0.60 | $1.70 | 768ms | 81 |
| Z.ai: GLM 4.7 | $0.60 | $2.20 | 1561ms | 86 |
| Meta: Llama 3.3 70B Instruct | $0.72 | $0.72 | 262ms | 36 |
| Anthropic: Claude 3.5 Haiku | $0.80 | $4.00 | — | — |
| Anthropic: Claude Haiku 4.5 | $1.00 | $5.00 | 564ms | 77 |
| Google: Gemini 2.5 Pro | $1.25 | $10.00 | 2898ms | 84 |
| Google: Gemini 2.5 Pro Preview 05-06 | $1.25 | $10.00 | 2898ms | 84 |
| Google: Gemini 2.5 Pro Preview 06-05 | $1.25 | $10.00 | 2638ms | 74 |
| Google: Gemini 2.5 Pro | $1.25 | $10.00 | 2570ms | 73 |
| Google: Gemini 2.5 Pro Preview 06-05 | $1.25 | $10.00 | 2892ms | 84 |
| Google: Gemini 2.5 Pro Preview 05-06 | $1.25 | $10.00 | 2570ms | 73 |
| Google: Gemini 3 Pro Preview | $2.00 | $12.00 | 6924ms | 66 |
| Google: Nano Banana Pro (Gemini 3 Pro Image Preview) | $2.00 | $12.00 | 34803ms | 36 |
| Google: Gemini 3.1 Pro Preview | $2.00 | $12.00 | 7025ms | 63 |
| Anthropic: Claude Sonnet 4.5 | $3.00 | $15.00 | 1662ms | 36 |
| Anthropic: Claude 3.7 Sonnet | $3.00 | $15.00 | — | — |
| Anthropic: Claude Sonnet 4 | $3.00 | $15.00 | — | — |
| Anthropic: Claude 3.7 Sonnet | $3.00 | $15.00 | — | — |
| Anthropic: Claude Sonnet 4.6 | $3.00 | $15.00 | 1478ms | 37 |
| Anthropic: Claude Sonnet 4.6 | $3.00 | $15.00 | 1125ms | 45 |
| Anthropic: Claude Sonnet 4 | $3.00 | $15.00 | 810ms | 36 |
| Anthropic: Claude Sonnet 4.5 | $3.00 | $15.00 | 1439ms | 36 |
| Anthropic: Claude 3.7 Sonnet (thinking) | $3.00 | $15.00 | 1001ms | 50 |
| Anthropic: Claude 3.7 Sonnet | $3.00 | $15.00 | 667ms | 44 |
| Anthropic: Claude Sonnet 4 | $3.00 | $15.00 | — | — |
| Anthropic: Claude Opus 4.7 | $5.00 | $25.00 | 2181ms | 53 |
| Anthropic: Claude Opus 4.5 | $5.00 | $25.00 | 1107ms | 41 |
| Anthropic: Claude Opus 4.6 | $5.00 | $25.00 | 1667ms | 39 |
| Anthropic: Claude Opus 4.1 | $15.00 | $75.00 | — | — |
| Anthropic: Claude Opus 4 | $15.00 | $75.00 | — | — |
| Anthropic: Claude Opus 4.1 | $15.00 | $75.00 | — | — |
| Anthropic: Claude Opus 4.1 | $15.00 | $75.00 | — | — |
| Anthropic: Claude Opus 4 | $15.00 | $75.00 | 6488ms | 17 |
Community Reviews
4.5★★★★★(2 reviews)
clouduser42
★★★★★2025-06-15
Reliable service, great API documentation.
mlresearcher
★★★★☆2025-06-10
Good performance but support could be faster.