A
Alibaba
AGGREGATEDINFERENCE
N/A
Uptime
N/A
Rating
30-Day Uptime
96.7%2026-03-232026-04-21
Inference Latency
Qwen: Qwen3.6 Plus Preview (free)903ms TTFT · 34 TPS
Qwen: Qwen3.6 Plus (free)1481ms TTFT · 44 TPS
Qwen: Qwen-Turbo466ms TTFT · 76 TPS
Qwen: Qwen3.5-Flash491ms TTFT · 80 TPS
Qwen: Qwen3 Next 80B A3B Instruct591ms TTFT · 44 TPS
Qwen: Qwen3 Next 80B A3B Thinking356ms TTFT · 165 TPS
Qwen: Qwen3 VL 32B Instruct943ms TTFT · 56 TPS
Qwen: Qwen3 32B722ms TTFT · 26 TPS
Qwen: Qwen3 8B435ms TTFT · 38 TPS
Qwen: Qwen3 VL 8B Instruct720ms TTFT · 110 TPS
Qwen: Qwen3 30B A3B Instruct 2507372ms TTFT · 67 TPS
Qwen: Qwen3 30B A3B581ms TTFT · 83 TPS
Qwen: Qwen3 VL 30B A3B Thinking550ms TTFT · 115 TPS
Qwen: Qwen3 30B A3B Thinking 2507606ms TTFT · 135 TPS
Qwen: Qwen3 VL 30B A3B Instruct661ms TTFT · 49 TPS
Qwen: Qwen VL Plus640ms TTFT · 76 TPS
Qwen: Qwen3 235B A22B Instruct 2507771ms TTFT · 32 TPS
Qwen: Qwen3 235B A22B Thinking 2507646ms TTFT · 46 TPS
Qwen: Qwen3.5-35B-A3B828ms TTFT · 125 TPS
Qwen: Qwen3 Coder Flash1435ms TTFT · 47 TPS
Inference Models
| Model | Input $/M | Output $/M | TTFT | TPS |
|---|---|---|---|---|
| Qwen: Qwen3.6 Plus Preview (free) | $0.00 | $0.00 | 903ms | 34 |
| Qwen: Qwen3.6 Plus (free) | $0.00 | $0.00 | 1481ms | 44 |
| Qwen: Qwen-Turbo | $0.03 | $0.13 | 466ms | 76 |
| Qwen: Qwen3.5-Flash | $0.07 | $0.26 | 491ms | 80 |
| Qwen: Qwen3 Next 80B A3B Instruct | $0.10 | $0.78 | 591ms | 44 |
| Qwen: Qwen3 Next 80B A3B Thinking | $0.10 | $0.78 | 356ms | 165 |
| Qwen: Qwen3 VL 32B Instruct | $0.10 | $0.42 | 943ms | 56 |
| Qwen: Qwen3 32B | $0.10 | $0.42 | 722ms | 26 |
| Qwen: Qwen3 VL 8B Thinking | $0.12 | $1.37 | — | — |
| Qwen: Qwen3 8B | $0.12 | $0.45 | 435ms | 38 |
| Qwen: Qwen3 VL 8B Instruct | $0.12 | $0.45 | 720ms | 110 |
| Qwen: Qwen3 30B A3B Instruct 2507 | $0.13 | $0.52 | 372ms | 67 |
| Qwen: Qwen3 30B A3B | $0.13 | $0.52 | 581ms | 83 |
| Qwen: Qwen3 VL 30B A3B Thinking | $0.13 | $1.56 | 550ms | 115 |
| Qwen: Qwen3 30B A3B Thinking 2507 | $0.13 | $1.56 | 606ms | 135 |
| Qwen: Qwen3 VL 30B A3B Instruct | $0.13 | $0.52 | 661ms | 49 |
| Qwen: Qwen VL Plus | $0.14 | $0.41 | 640ms | 76 |
| Qwen: Qwen3 235B A22B Instruct 2507 | $0.15 | $0.60 | 771ms | 32 |
| Qwen: Qwen3 235B A22B Thinking 2507 | $0.15 | $1.50 | 646ms | 46 |
| Qwen: Qwen3.5-35B-A3B | $0.16 | $1.30 | 828ms | 125 |
| Qwen: Qwen3 Coder Flash | $0.20 | $0.98 | 1435ms | 47 |
| Qwen: Qwen3.5-27B | $0.20 | $1.56 | 451ms | 12 |
| Qwen: Qwen3 14B | $0.23 | $0.91 | 208ms | 62 |
| Qwen: Qwen3 VL 235B A22B Thinking | $0.26 | $2.60 | 604ms | 45 |
| Qwen: Qwen3 VL 235B A22B Instruct | $0.26 | $1.04 | 729ms | 43 |
| Qwen: Qwen3.5 Plus 2026-02-15 | $0.26 | $1.56 | 1457ms | 48 |
| Qwen: Qwen-Plus | $0.26 | $0.78 | 694ms | 44 |
| Qwen: Qwen Plus 0728 (thinking) | $0.26 | $0.78 | 583ms | 49 |
| Qwen: Qwen Plus 0728 | $0.26 | $0.78 | 599ms | 52 |
| Qwen: Qwen3.5-122B-A10B | $0.26 | $2.08 | 494ms | 136 |
| Qwen: Qwen3 Coder 30B A3B Instruct | $0.29 | $1.46 | 2632ms | 87 |
| Qwen: Qwen3.6 Plus | $0.33 | $1.95 | 1482ms | 46 |
| Qwen: Qwen3.5 397B A17B | $0.39 | $2.34 | 1155ms | 49 |
| Qwen: Qwen3 235B A22B | $0.45 | $1.82 | 606ms | 58 |
| Qwen: Qwen VL Max | $0.52 | $2.08 | — | — |
| DeepSeek: DeepSeek V3.2 | $0.57 | $1.71 | 1005ms | 38 |
| Qwen: Qwen3 Coder Plus | $0.65 | $3.25 | 1959ms | 36 |
| Qwen: Qwen3 Max Thinking | $0.78 | $3.90 | 1020ms | 32 |
| Qwen: Qwen3 Max | $0.78 | $3.90 | 863ms | 23 |
| Qwen: Qwen3 Coder 480B A35B | $0.98 | $4.88 | — | — |
| Qwen: Qwen-Max | $1.04 | $4.16 | 1961ms | 36 |
Community Reviews
4.5★★★★★(2 reviews)
clouduser42
★★★★★2025-06-15
Reliable service, great API documentation.
mlresearcher
★★★★☆2025-06-10
Good performance but support could be faster.