LIVE
Models: —+Providers: —+Cheapest H100: $2.49/hrUpdated: 05:20 PMModels: —+Providers: —+Cheapest H100: $2.49/hrUpdated: 05:20 PM
Marketplace
Providers Models
A

AtlasCloud

AGGREGATEDINFERENCE
N/A
Uptime
N/A
Rating

30-Day Uptime

100%
2026-03-232026-04-21

Inference Latency

Qwen: Qwen2.5 7B Instruct1120ms TTFT · 23 TPS
Qwen: Qwen3 8B1147ms TTFT · 52 TPS
Qwen: Qwen3 30B A3B Thinking 25071268ms TTFT · 99 TPS
Qwen: Qwen3 VL 8B Instruct711ms TTFT · 50 TPS
Tongyi DeepResearch 30B A3B188ms TTFT · 184 TPS
Qwen: Qwen3 30B A3B Instruct 25071597ms TTFT · 35 TPS
OpenAI: gpt-oss-120b593ms TTFT · 58 TPS
Qwen: Qwen3 32B1003ms TTFT · 6 TPS
Qwen: Qwen3 Next 80B A3B Thinking960ms TTFT · 142 TPS
Qwen: Qwen3 Next 80B A3B Instruct1156ms TTFT · 84 TPS
Qwen: Qwen3 Coder Next2444ms TTFT · 24 TPS
Qwen: Qwen3 235B A22B Instruct 25071099ms TTFT · 16 TPS
DeepSeek: DeepSeek V3.1 Terminus (exacto)1319ms TTFT · 24 TPS
DeepSeek: DeepSeek V3 03242106ms TTFT · 23 TPS
Qwen: Qwen3.5-35B-A3B1200ms TTFT · 129 TPS
MiniMax: MiniMax M22244ms TTFT · 28 TPS
DeepSeek: DeepSeek V3.21690ms TTFT · 23 TPS
DeepSeek: DeepSeek V3.2 Exp1344ms TTFT · 29 TPS
Qwen: Qwen3.5-27B1013ms TTFT · 6 TPS
Qwen: Qwen3 235B A22B Thinking 25071214ms TTFT · 45 TPS

Inference Models

ModelInput $/MOutput $/MTTFTTPS
Qwen: Qwen2.5 7B Instruct$0.04$0.101120ms23
Qwen: Qwen3 8B$0.05$0.401147ms52
Qwen: Qwen3 30B A3B Thinking 2507$0.08$0.401268ms99
Qwen: Qwen3 VL 8B Instruct$0.08$0.50711ms50
Tongyi DeepResearch 30B A3B$0.09$0.45188ms184
Qwen: Qwen3 30B A3B Instruct 2507$0.10$0.301597ms35
Xiaomi: MiMo-V2-Flash$0.10$0.30
OpenAI: gpt-oss-120b$0.10$0.40593ms58
Qwen: Qwen3 32B$0.10$1.201003ms6
Qwen: Qwen3 Next 80B A3B Thinking$0.15$1.50960ms142
Qwen: Qwen3 VL 30B A3B Instruct$0.15$0.60
Qwen: Qwen3 Next 80B A3B Instruct$0.15$1.501156ms84
Qwen: Qwen3 Coder Next$0.18$1.352444ms24
Qwen: Qwen3 235B A22B Instruct 2507$0.20$0.881099ms16
Meituan: LongCat Flash Chat$0.20$0.80
DeepSeek: DeepSeek V3.1 Terminus (exacto)$0.21$0.801319ms24
DeepSeek: DeepSeek V3 0324$0.22$0.882106ms23
Qwen: Qwen3.5-35B-A3B$0.22$1.801200ms129
MiniMax: MiniMax M2$0.26$1.002244ms28
DeepSeek: DeepSeek V3.2$0.26$0.381690ms23
DeepSeek: DeepSeek V3.2 Exp$0.27$0.411344ms29
Qwen: Qwen3.5-27B$0.27$2.161013ms6
Qwen: Qwen3 235B A22B Thinking 2507$0.28$2.301214ms45
MiniMax: MiniMax M2.1$0.29$0.953834ms45
MiniMax: MiniMax M2.5$0.30$1.204182ms35
Kwaipilot: KAT-Coder-Pro V2$0.30$1.201580ms27
DeepSeek: DeepSeek V3.1 Terminus$0.30$0.951874ms26
Qwen: Qwen3.5-122B-A10B$0.30$2.401001ms9
DeepSeek: DeepSeek V3.1$0.30$0.951636ms24
Qwen: Qwen3 VL 235B A22B Instruct$0.30$1.505087ms10
DeepSeek: DeepSeek V3.2 Speciale$0.40$1.202955ms24
MoonshotAI: Kimi K2.5$0.49$2.502315ms32
DeepSeek: DeepSeek V3.2$0.50$1.501532ms25
Z.ai: GLM 4.7$0.52$1.851345ms43
Qwen: Qwen3.5 397B A17B$0.55$3.501436ms71
DeepSeek: R1 0528$0.55$2.153226ms22
MoonshotAI: Kimi K2 Thinking$0.60$2.50683ms24
MoonshotAI: Kimi K2 0905$0.60$2.501296ms16
Z.ai: GLM 4.6$0.60$2.202027ms41
Qwen: Qwen3 Coder 480B A35B$0.78$3.80783ms4
Z.ai: GLM 5$0.95$3.151680ms43
Z.ai: GLM 5 Turbo$1.20$4.001202ms3
Z.ai: GLM 5.1$1.40$4.402679ms35

Community Reviews

4.5★★★★★(2 reviews)
clouduser42
★★★★★2025-06-15

Reliable service, great API documentation.

mlresearcher
★★★★2025-06-10

Good performance but support could be faster.