A
Alibaba
AGGREGATEDINFERENCE
N/A
Uptime
N/A
Rating
30-Day Uptime
100%2026-05-222026-06-20
Inference Latency
Qwen: Qwen3.6 Plus Preview (free)903ms TTFT · 34 TPS
Qwen: Qwen3.6 Plus (free)1481ms TTFT · 44 TPS
Qwen: Qwen-Turbo534ms TTFT · 29 TPS
Qwen: Qwen3.5-Flash776ms TTFT · 70 TPS
Qwen: Qwen3 Next 80B A3B Thinking504ms TTFT · 155 TPS
Qwen: Qwen3 Next 80B A3B Instruct649ms TTFT · 71 TPS
Qwen: Qwen3 VL 32B Instruct444ms TTFT · 24 TPS
Qwen: Qwen3 32B386ms TTFT · 90 TPS
Qwen: Qwen3 VL 8B Instruct527ms TTFT · 77 TPS
Qwen: Qwen3 8B641ms TTFT · 38 TPS
Qwen: Qwen3 VL 8B Thinking391ms TTFT · 144 TPS
Qwen: Qwen3 30B A3B550ms TTFT · 92 TPS
Qwen: Qwen3 30B A3B Thinking 2507529ms TTFT · 142 TPS
Qwen: Qwen3 30B A3B Instruct 2507352ms TTFT · 5 TPS
Qwen: Qwen3 VL 30B A3B Thinking1063ms TTFT · 112 TPS
Qwen: Qwen3 VL 30B A3B Instruct653ms TTFT · 49 TPS
DeepSeek: DeepSeek V4 Flash838ms TTFT · 38 TPS
Qwen: Qwen VL Plus221ms TTFT · 102 TPS
Qwen: Qwen3 235B A22B Thinking 2507524ms TTFT · 77 TPS
Qwen: Qwen3 235B A22B Instruct 2507619ms TTFT · 28 TPS
Inference Models
| Model | Input $/M | Output $/M | TTFT | TPS |
|---|---|---|---|---|
| Qwen: Qwen3.6 Plus Preview (free) | $0.00 | $0.00 | 903ms | 34 |
| Qwen: Qwen3.6 Plus (free) | $0.00 | $0.00 | 1481ms | 44 |
| Qwen: Qwen-Turbo | $0.03 | $0.13 | 534ms | 29 |
| Qwen: Qwen3.5-Flash | $0.07 | $0.26 | 776ms | 70 |
| Qwen: Qwen3 Next 80B A3B Thinking | $0.10 | $0.78 | 504ms | 155 |
| Qwen: Qwen3 Next 80B A3B Instruct | $0.10 | $0.78 | 649ms | 71 |
| Qwen: Qwen3 VL 32B Instruct | $0.10 | $0.42 | 444ms | 24 |
| Qwen: Qwen3 32B | $0.10 | $0.42 | 386ms | 90 |
| Qwen: Qwen3 VL 8B Instruct | $0.12 | $0.45 | 527ms | 77 |
| Qwen: Qwen3 8B | $0.12 | $0.45 | 641ms | 38 |
| Qwen: Qwen3 VL 8B Thinking | $0.12 | $1.37 | 391ms | 144 |
| Qwen: Qwen3 30B A3B | $0.13 | $0.52 | 550ms | 92 |
| Qwen: Qwen3 30B A3B Thinking 2507 | $0.13 | $1.56 | 529ms | 142 |
| Qwen: Qwen3 30B A3B Instruct 2507 | $0.13 | $0.52 | 352ms | 5 |
| Qwen: Qwen3 VL 30B A3B Thinking | $0.13 | $1.56 | 1063ms | 112 |
| Qwen: Qwen3 VL 30B A3B Instruct | $0.13 | $0.52 | 653ms | 49 |
| DeepSeek: DeepSeek V4 Flash | $0.13 | $0.27 | 838ms | 38 |
| Qwen: Qwen VL Plus | $0.14 | $0.41 | 221ms | 102 |
| Qwen: Qwen3 235B A22B Thinking 2507 | $0.15 | $1.50 | 524ms | 77 |
| Qwen: Qwen3 235B A22B Instruct 2507 | $0.15 | $0.60 | 619ms | 28 |
| Qwen: Qwen3.5-35B-A3B | $0.16 | $1.30 | 582ms | 50 |
| Qwen: Qwen3.6 Flash | $0.19 | $1.13 | 622ms | 78 |
| Qwen: Qwen3.5-27B | $0.20 | $1.56 | 617ms | 22 |
| Qwen: Qwen3 Coder Flash | $0.20 | $0.98 | 914ms | 23 |
| Qwen: Qwen3 14B | $0.23 | $0.91 | — | — |
| Qwen: Qwen Plus 0728 (thinking) | $0.26 | $0.78 | — | — |
| Qwen: Qwen3 VL 235B A22B Thinking | $0.26 | $2.60 | 848ms | 59 |
| Qwen: Qwen-Plus | $0.26 | $0.78 | 464ms | 41 |
| Qwen: Qwen3.5-122B-A10B | $0.26 | $2.08 | 536ms | 63 |
| Qwen: Qwen3.5 Plus 2026-02-15 | $0.26 | $1.56 | 1620ms | 22 |
| Qwen: Qwen3 VL 235B A22B Instruct | $0.26 | $1.04 | 748ms | 39 |
| Qwen: Qwen Plus 0728 | $0.26 | $0.78 | — | — |
| Qwen: Qwen3 Coder 30B A3B Instruct | $0.29 | $1.46 | 871ms | 97 |
| Qwen: Qwen3.5 Plus 2026-04-20 | $0.30 | $1.80 | 1190ms | 45 |
| Qwen: Qwen3.7 Plus | $0.32 | $1.28 | 1096ms | 12 |
| Qwen: Qwen3.6 Plus | $0.33 | $1.95 | 945ms | 11 |
| DeepSeek: DeepSeek V3.2 | $0.37 | $1.11 | 1141ms | 33 |
| Qwen: Qwen3.5 397B A17B | $0.39 | $2.34 | 1636ms | 41 |
| Qwen: Qwen3.6 27B | $0.45 | $2.70 | 1199ms | 62 |
| Qwen: Qwen3 235B A22B | $0.45 | $1.82 | 553ms | 65 |
| Qwen: Qwen VL Max | $0.52 | $2.08 | 1111ms | 37 |
| Qwen: Qwen3 Coder Plus | $0.65 | $3.25 | 1253ms | 27 |
| Qwen: Qwen3 Max | $0.78 | $3.90 | 1153ms | 36 |
| Qwen: Qwen3 Max Thinking | $0.78 | $3.90 | 1217ms | 57 |
| Qwen: Qwen3 Coder 480B A35B | $0.98 | $4.88 | 1735ms | 25 |
| Qwen: Qwen-Max | $1.04 | $4.16 | 520ms | 51 |
| Qwen: Qwen3.6 Max Preview | $1.04 | $6.24 | 1470ms | 22 |
| Qwen: Qwen3.7 Max | $1.25 | $3.75 | 1370ms | 47 |
| DeepSeek: DeepSeek V4 Pro | $1.61 | $3.22 | 1040ms | 57 |
Community Reviews
4.5★★★★★(2 reviews)
clouduser42
★★★★★2025-06-15
Reliable service, great API documentation.
mlresearcher
★★★★☆2025-06-10
Good performance but support could be faster.