LIVE
Models: —+Providers: —+Cheapest H100: $2.49/hrUpdated: 05:17 PMModels: —+Providers: —+Cheapest H100: $2.49/hrUpdated: 05:17 PM
Marketplace
Providers Models
A

Alibaba

AGGREGATEDINFERENCE
N/A
Uptime
N/A
Rating

30-Day Uptime

96.7%
2026-03-232026-04-21

Inference Latency

Qwen: Qwen3.6 Plus Preview (free)903ms TTFT · 34 TPS
Qwen: Qwen3.6 Plus (free)1481ms TTFT · 44 TPS
Qwen: Qwen-Turbo466ms TTFT · 76 TPS
Qwen: Qwen3.5-Flash491ms TTFT · 80 TPS
Qwen: Qwen3 Next 80B A3B Instruct591ms TTFT · 44 TPS
Qwen: Qwen3 Next 80B A3B Thinking356ms TTFT · 165 TPS
Qwen: Qwen3 VL 32B Instruct943ms TTFT · 56 TPS
Qwen: Qwen3 32B722ms TTFT · 26 TPS
Qwen: Qwen3 8B435ms TTFT · 38 TPS
Qwen: Qwen3 VL 8B Instruct720ms TTFT · 110 TPS
Qwen: Qwen3 30B A3B Instruct 2507372ms TTFT · 67 TPS
Qwen: Qwen3 30B A3B581ms TTFT · 83 TPS
Qwen: Qwen3 VL 30B A3B Thinking550ms TTFT · 115 TPS
Qwen: Qwen3 30B A3B Thinking 2507606ms TTFT · 135 TPS
Qwen: Qwen3 VL 30B A3B Instruct661ms TTFT · 49 TPS
Qwen: Qwen VL Plus640ms TTFT · 76 TPS
Qwen: Qwen3 235B A22B Instruct 2507771ms TTFT · 32 TPS
Qwen: Qwen3 235B A22B Thinking 2507646ms TTFT · 46 TPS
Qwen: Qwen3.5-35B-A3B828ms TTFT · 125 TPS
Qwen: Qwen3 Coder Flash1435ms TTFT · 47 TPS

Inference Models

ModelInput $/MOutput $/MTTFTTPS
Qwen: Qwen3.6 Plus Preview (free)$0.00$0.00903ms34
Qwen: Qwen3.6 Plus (free)$0.00$0.001481ms44
Qwen: Qwen-Turbo$0.03$0.13466ms76
Qwen: Qwen3.5-Flash$0.07$0.26491ms80
Qwen: Qwen3 Next 80B A3B Instruct$0.10$0.78591ms44
Qwen: Qwen3 Next 80B A3B Thinking$0.10$0.78356ms165
Qwen: Qwen3 VL 32B Instruct$0.10$0.42943ms56
Qwen: Qwen3 32B$0.10$0.42722ms26
Qwen: Qwen3 VL 8B Thinking$0.12$1.37
Qwen: Qwen3 8B$0.12$0.45435ms38
Qwen: Qwen3 VL 8B Instruct$0.12$0.45720ms110
Qwen: Qwen3 30B A3B Instruct 2507$0.13$0.52372ms67
Qwen: Qwen3 30B A3B$0.13$0.52581ms83
Qwen: Qwen3 VL 30B A3B Thinking$0.13$1.56550ms115
Qwen: Qwen3 30B A3B Thinking 2507$0.13$1.56606ms135
Qwen: Qwen3 VL 30B A3B Instruct$0.13$0.52661ms49
Qwen: Qwen VL Plus$0.14$0.41640ms76
Qwen: Qwen3 235B A22B Instruct 2507$0.15$0.60771ms32
Qwen: Qwen3 235B A22B Thinking 2507$0.15$1.50646ms46
Qwen: Qwen3.5-35B-A3B$0.16$1.30828ms125
Qwen: Qwen3 Coder Flash$0.20$0.981435ms47
Qwen: Qwen3.5-27B$0.20$1.56451ms12
Qwen: Qwen3 14B$0.23$0.91208ms62
Qwen: Qwen3 VL 235B A22B Thinking$0.26$2.60604ms45
Qwen: Qwen3 VL 235B A22B Instruct$0.26$1.04729ms43
Qwen: Qwen3.5 Plus 2026-02-15$0.26$1.561457ms48
Qwen: Qwen-Plus$0.26$0.78694ms44
Qwen: Qwen Plus 0728 (thinking)$0.26$0.78583ms49
Qwen: Qwen Plus 0728$0.26$0.78599ms52
Qwen: Qwen3.5-122B-A10B$0.26$2.08494ms136
Qwen: Qwen3 Coder 30B A3B Instruct$0.29$1.462632ms87
Qwen: Qwen3.6 Plus$0.33$1.951482ms46
Qwen: Qwen3.5 397B A17B$0.39$2.341155ms49
Qwen: Qwen3 235B A22B$0.45$1.82606ms58
Qwen: Qwen VL Max$0.52$2.08
DeepSeek: DeepSeek V3.2$0.57$1.711005ms38
Qwen: Qwen3 Coder Plus$0.65$3.251959ms36
Qwen: Qwen3 Max Thinking$0.78$3.901020ms32
Qwen: Qwen3 Max$0.78$3.90863ms23
Qwen: Qwen3 Coder 480B A35B$0.98$4.88
Qwen: Qwen-Max $1.04$4.161961ms36

Community Reviews

4.5★★★★★(2 reviews)
clouduser42
★★★★★2025-06-15

Reliable service, great API documentation.

mlresearcher
★★★★2025-06-10

Good performance but support could be faster.