LIVE
Models: —+Providers: —+Cheapest H100: $2.49/hrUpdated: 04:34 PMModels: —+Providers: —+Cheapest H100: $2.49/hrUpdated: 04:34 PM
Marketplace
Providers Models
A

AtlasCloud

AGGREGATEDINFERENCE
N/A
Uptime
N/A
Rating

30-Day Uptime

100%
2026-05-222026-06-20

Inference Latency

Qwen: Qwen3 8B776ms TTFT · 38 TPS
Qwen: Qwen3 VL 8B Instruct573ms TTFT · 11 TPS
Qwen: Qwen3 30B A3B Thinking 2507593ms TTFT · 73 TPS
Tongyi DeepResearch 30B A3B275ms TTFT · 98 TPS
Qwen: Qwen3 32B1097ms TTFT · 13 TPS
Qwen: Qwen3 30B A3B Instruct 25072544ms TTFT · 18 TPS
DeepSeek: DeepSeek V4 Flash1908ms TTFT · 3 TPS
Qwen: Qwen3 Next 80B A3B Instruct1353ms TTFT · 20 TPS
Qwen: Qwen3 VL 30B A3B Instruct1346ms TTFT · 9 TPS
Qwen: Qwen3.6 35B A3B1109ms TTFT · 30 TPS
Qwen: Qwen3 Coder Next907ms TTFT · 10 TPS
Qwen: Qwen3 235B A22B Instruct 2507922ms TTFT · 15 TPS
DeepSeek: DeepSeek V3.1 Terminus (exacto)1319ms TTFT · 24 TPS
DeepSeek: DeepSeek V3 03241117ms TTFT · 13 TPS
Qwen: Qwen3.5-35B-A3B773ms TTFT · 4 TPS
MiniMax: MiniMax M21496ms TTFT · 21 TPS
DeepSeek: DeepSeek V3.21316ms TTFT · 16 TPS
Qwen: Qwen3.5-27B1280ms TTFT · 31 TPS
DeepSeek: DeepSeek V3.2 Exp1413ms TTFT · 12 TPS
Qwen: Qwen3 235B A22B Thinking 25071200ms TTFT · 43 TPS

Inference Models

ModelInput $/MOutput $/MTTFTTPS
Qwen: Qwen3 8B$0.05$0.40776ms38
Qwen: Qwen3 VL 8B Instruct$0.08$0.50573ms11
Qwen: Qwen3 30B A3B Thinking 2507$0.08$0.40593ms73
Tongyi DeepResearch 30B A3B$0.09$0.45275ms98
Qwen: Qwen3 32B$0.10$1.201097ms13
Qwen: Qwen3 30B A3B Instruct 2507$0.10$0.302544ms18
DeepSeek: DeepSeek V4 Flash$0.14$0.281908ms3
Qwen: Qwen3 Next 80B A3B Thinking$0.15$1.50
Qwen: Qwen3 Next 80B A3B Instruct$0.15$1.501353ms20
Qwen: Qwen3 VL 30B A3B Instruct$0.15$0.601346ms9
Qwen: Qwen3.6 35B A3B$0.16$0.971109ms30
Qwen: Qwen3 Coder Next$0.18$1.35907ms10
Meituan: LongCat Flash Chat$0.20$0.80
Qwen: Qwen3 235B A22B Instruct 2507$0.20$0.88922ms15
DeepSeek: DeepSeek V3.1 Terminus (exacto)$0.21$0.801319ms24
DeepSeek: DeepSeek V3 0324$0.22$0.881117ms13
Qwen: Qwen3.5-35B-A3B$0.22$1.80773ms4
MiniMax: MiniMax M2$0.26$1.001496ms21
DeepSeek: DeepSeek V3.2$0.26$0.381316ms16
Qwen: Qwen3.5-27B$0.27$2.161280ms31
DeepSeek: DeepSeek V3.2 Exp$0.27$0.411413ms12
Qwen: Qwen3 235B A22B Thinking 2507$0.28$2.301200ms43
DeepSeek: DeepSeek V3.2 Speciale$0.29$0.43
MiniMax: MiniMax M2.1$0.29$0.954590ms10
MiniMax: MiniMax M2.5$0.30$1.205051ms32
Qwen: Qwen3 VL 235B A22B Instruct$0.30$1.503059ms12
DeepSeek: DeepSeek V3.1 Terminus$0.30$0.951276ms3
Kwaipilot: KAT-Coder-Pro V2$0.30$1.203052ms13
Qwen: Qwen3.5-122B-A10B$0.30$2.40941ms7
DeepSeek: DeepSeek V3.1$0.30$0.951210ms9
MiniMax: MiniMax M3$0.42$1.681483ms31
MoonshotAI: Kimi K2.5$0.49$2.501741ms15
Z.ai: GLM 4.7$0.52$1.851441ms37
Qwen: Qwen3.5 397B A17B$0.55$3.501370ms40
DeepSeek: R1 0528$0.55$2.153064ms23
Z.ai: GLM 4.6$0.60$2.203031ms25
MoonshotAI: Kimi K2 0905$0.60$2.50923ms15
MoonshotAI: Kimi K2 Thinking$0.60$2.50750ms57
Qwen: Qwen3 Coder 480B A35B$0.78$3.801706ms11
MoonshotAI: Kimi K2.6$0.95$4.001080ms31
MoonshotAI: Kimi K2.7 Code$0.95$4.001219ms31
Z.ai: GLM 5$0.95$3.152705ms18
Z.ai: GLM 5 Turbo$1.20$4.001985ms12
Z.ai: GLM 5.1$1.26$3.961876ms27
Z.ai: GLM 5.2$1.40$4.401567ms24
DeepSeek: DeepSeek V4 Pro$1.68$3.382409ms53

Community Reviews

4.5★★★★★(2 reviews)
clouduser42
★★★★★2025-06-15

Reliable service, great API documentation.

mlresearcher
★★★★2025-06-10

Good performance but support could be faster.

AtlasCloud — Provider Scorecard — NexusGPU | NexusGPU