LIVE
Models: —+Providers: —+Cheapest H100: $2.49/hrUpdated: 04:31 PMModels: —+Providers: —+Cheapest H100: $2.49/hrUpdated: 04:31 PM
Marketplace
Providers Models
A

Alibaba

AGGREGATEDINFERENCE
N/A
Uptime
N/A
Rating

30-Day Uptime

100%
2026-05-222026-06-20

Inference Latency

Qwen: Qwen3.6 Plus Preview (free)903ms TTFT · 34 TPS
Qwen: Qwen3.6 Plus (free)1481ms TTFT · 44 TPS
Qwen: Qwen-Turbo534ms TTFT · 29 TPS
Qwen: Qwen3.5-Flash776ms TTFT · 70 TPS
Qwen: Qwen3 Next 80B A3B Thinking504ms TTFT · 155 TPS
Qwen: Qwen3 Next 80B A3B Instruct649ms TTFT · 71 TPS
Qwen: Qwen3 VL 32B Instruct444ms TTFT · 24 TPS
Qwen: Qwen3 32B386ms TTFT · 90 TPS
Qwen: Qwen3 VL 8B Instruct527ms TTFT · 77 TPS
Qwen: Qwen3 8B641ms TTFT · 38 TPS
Qwen: Qwen3 VL 8B Thinking391ms TTFT · 144 TPS
Qwen: Qwen3 30B A3B550ms TTFT · 92 TPS
Qwen: Qwen3 30B A3B Thinking 2507529ms TTFT · 142 TPS
Qwen: Qwen3 30B A3B Instruct 2507352ms TTFT · 5 TPS
Qwen: Qwen3 VL 30B A3B Thinking1063ms TTFT · 112 TPS
Qwen: Qwen3 VL 30B A3B Instruct653ms TTFT · 49 TPS
DeepSeek: DeepSeek V4 Flash838ms TTFT · 38 TPS
Qwen: Qwen VL Plus221ms TTFT · 102 TPS
Qwen: Qwen3 235B A22B Thinking 2507524ms TTFT · 77 TPS
Qwen: Qwen3 235B A22B Instruct 2507619ms TTFT · 28 TPS

Inference Models

ModelInput $/MOutput $/MTTFTTPS
Qwen: Qwen3.6 Plus Preview (free)$0.00$0.00903ms34
Qwen: Qwen3.6 Plus (free)$0.00$0.001481ms44
Qwen: Qwen-Turbo$0.03$0.13534ms29
Qwen: Qwen3.5-Flash$0.07$0.26776ms70
Qwen: Qwen3 Next 80B A3B Thinking$0.10$0.78504ms155
Qwen: Qwen3 Next 80B A3B Instruct$0.10$0.78649ms71
Qwen: Qwen3 VL 32B Instruct$0.10$0.42444ms24
Qwen: Qwen3 32B$0.10$0.42386ms90
Qwen: Qwen3 VL 8B Instruct$0.12$0.45527ms77
Qwen: Qwen3 8B$0.12$0.45641ms38
Qwen: Qwen3 VL 8B Thinking$0.12$1.37391ms144
Qwen: Qwen3 30B A3B$0.13$0.52550ms92
Qwen: Qwen3 30B A3B Thinking 2507$0.13$1.56529ms142
Qwen: Qwen3 30B A3B Instruct 2507$0.13$0.52352ms5
Qwen: Qwen3 VL 30B A3B Thinking$0.13$1.561063ms112
Qwen: Qwen3 VL 30B A3B Instruct$0.13$0.52653ms49
DeepSeek: DeepSeek V4 Flash$0.13$0.27838ms38
Qwen: Qwen VL Plus$0.14$0.41221ms102
Qwen: Qwen3 235B A22B Thinking 2507$0.15$1.50524ms77
Qwen: Qwen3 235B A22B Instruct 2507$0.15$0.60619ms28
Qwen: Qwen3.5-35B-A3B$0.16$1.30582ms50
Qwen: Qwen3.6 Flash$0.19$1.13622ms78
Qwen: Qwen3.5-27B$0.20$1.56617ms22
Qwen: Qwen3 Coder Flash$0.20$0.98914ms23
Qwen: Qwen3 14B$0.23$0.91
Qwen: Qwen Plus 0728 (thinking)$0.26$0.78
Qwen: Qwen3 VL 235B A22B Thinking$0.26$2.60848ms59
Qwen: Qwen-Plus$0.26$0.78464ms41
Qwen: Qwen3.5-122B-A10B$0.26$2.08536ms63
Qwen: Qwen3.5 Plus 2026-02-15$0.26$1.561620ms22
Qwen: Qwen3 VL 235B A22B Instruct$0.26$1.04748ms39
Qwen: Qwen Plus 0728$0.26$0.78
Qwen: Qwen3 Coder 30B A3B Instruct$0.29$1.46871ms97
Qwen: Qwen3.5 Plus 2026-04-20$0.30$1.801190ms45
Qwen: Qwen3.7 Plus$0.32$1.281096ms12
Qwen: Qwen3.6 Plus$0.33$1.95945ms11
DeepSeek: DeepSeek V3.2$0.37$1.111141ms33
Qwen: Qwen3.5 397B A17B$0.39$2.341636ms41
Qwen: Qwen3.6 27B$0.45$2.701199ms62
Qwen: Qwen3 235B A22B$0.45$1.82553ms65
Qwen: Qwen VL Max$0.52$2.081111ms37
Qwen: Qwen3 Coder Plus$0.65$3.251253ms27
Qwen: Qwen3 Max$0.78$3.901153ms36
Qwen: Qwen3 Max Thinking$0.78$3.901217ms57
Qwen: Qwen3 Coder 480B A35B$0.98$4.881735ms25
Qwen: Qwen-Max $1.04$4.16520ms51
Qwen: Qwen3.6 Max Preview$1.04$6.241470ms22
Qwen: Qwen3.7 Max$1.25$3.751370ms47
DeepSeek: DeepSeek V4 Pro$1.61$3.221040ms57

Community Reviews

4.5★★★★★(2 reviews)
clouduser42
★★★★★2025-06-15

Reliable service, great API documentation.

mlresearcher
★★★★2025-06-10

Good performance but support could be faster.