A
AtlasCloud
AGGREGATEDINFERENCE
N/A
Uptime
N/A
Rating
30-Day Uptime
100%2026-05-222026-06-20
Inference Latency
Qwen: Qwen3 8B776ms TTFT · 38 TPS
Qwen: Qwen3 VL 8B Instruct573ms TTFT · 11 TPS
Qwen: Qwen3 30B A3B Thinking 2507593ms TTFT · 73 TPS
Tongyi DeepResearch 30B A3B275ms TTFT · 98 TPS
Qwen: Qwen3 32B1097ms TTFT · 13 TPS
Qwen: Qwen3 30B A3B Instruct 25072544ms TTFT · 18 TPS
DeepSeek: DeepSeek V4 Flash1908ms TTFT · 3 TPS
Qwen: Qwen3 Next 80B A3B Instruct1353ms TTFT · 20 TPS
Qwen: Qwen3 VL 30B A3B Instruct1346ms TTFT · 9 TPS
Qwen: Qwen3.6 35B A3B1109ms TTFT · 30 TPS
Qwen: Qwen3 Coder Next907ms TTFT · 10 TPS
Qwen: Qwen3 235B A22B Instruct 2507922ms TTFT · 15 TPS
DeepSeek: DeepSeek V3.1 Terminus (exacto)1319ms TTFT · 24 TPS
DeepSeek: DeepSeek V3 03241117ms TTFT · 13 TPS
Qwen: Qwen3.5-35B-A3B773ms TTFT · 4 TPS
MiniMax: MiniMax M21496ms TTFT · 21 TPS
DeepSeek: DeepSeek V3.21316ms TTFT · 16 TPS
Qwen: Qwen3.5-27B1280ms TTFT · 31 TPS
DeepSeek: DeepSeek V3.2 Exp1413ms TTFT · 12 TPS
Qwen: Qwen3 235B A22B Thinking 25071200ms TTFT · 43 TPS
Inference Models
| Model | Input $/M | Output $/M | TTFT | TPS |
|---|---|---|---|---|
| Qwen: Qwen3 8B | $0.05 | $0.40 | 776ms | 38 |
| Qwen: Qwen3 VL 8B Instruct | $0.08 | $0.50 | 573ms | 11 |
| Qwen: Qwen3 30B A3B Thinking 2507 | $0.08 | $0.40 | 593ms | 73 |
| Tongyi DeepResearch 30B A3B | $0.09 | $0.45 | 275ms | 98 |
| Qwen: Qwen3 32B | $0.10 | $1.20 | 1097ms | 13 |
| Qwen: Qwen3 30B A3B Instruct 2507 | $0.10 | $0.30 | 2544ms | 18 |
| DeepSeek: DeepSeek V4 Flash | $0.14 | $0.28 | 1908ms | 3 |
| Qwen: Qwen3 Next 80B A3B Thinking | $0.15 | $1.50 | — | — |
| Qwen: Qwen3 Next 80B A3B Instruct | $0.15 | $1.50 | 1353ms | 20 |
| Qwen: Qwen3 VL 30B A3B Instruct | $0.15 | $0.60 | 1346ms | 9 |
| Qwen: Qwen3.6 35B A3B | $0.16 | $0.97 | 1109ms | 30 |
| Qwen: Qwen3 Coder Next | $0.18 | $1.35 | 907ms | 10 |
| Meituan: LongCat Flash Chat | $0.20 | $0.80 | — | — |
| Qwen: Qwen3 235B A22B Instruct 2507 | $0.20 | $0.88 | 922ms | 15 |
| DeepSeek: DeepSeek V3.1 Terminus (exacto) | $0.21 | $0.80 | 1319ms | 24 |
| DeepSeek: DeepSeek V3 0324 | $0.22 | $0.88 | 1117ms | 13 |
| Qwen: Qwen3.5-35B-A3B | $0.22 | $1.80 | 773ms | 4 |
| MiniMax: MiniMax M2 | $0.26 | $1.00 | 1496ms | 21 |
| DeepSeek: DeepSeek V3.2 | $0.26 | $0.38 | 1316ms | 16 |
| Qwen: Qwen3.5-27B | $0.27 | $2.16 | 1280ms | 31 |
| DeepSeek: DeepSeek V3.2 Exp | $0.27 | $0.41 | 1413ms | 12 |
| Qwen: Qwen3 235B A22B Thinking 2507 | $0.28 | $2.30 | 1200ms | 43 |
| DeepSeek: DeepSeek V3.2 Speciale | $0.29 | $0.43 | — | — |
| MiniMax: MiniMax M2.1 | $0.29 | $0.95 | 4590ms | 10 |
| MiniMax: MiniMax M2.5 | $0.30 | $1.20 | 5051ms | 32 |
| Qwen: Qwen3 VL 235B A22B Instruct | $0.30 | $1.50 | 3059ms | 12 |
| DeepSeek: DeepSeek V3.1 Terminus | $0.30 | $0.95 | 1276ms | 3 |
| Kwaipilot: KAT-Coder-Pro V2 | $0.30 | $1.20 | 3052ms | 13 |
| Qwen: Qwen3.5-122B-A10B | $0.30 | $2.40 | 941ms | 7 |
| DeepSeek: DeepSeek V3.1 | $0.30 | $0.95 | 1210ms | 9 |
| MiniMax: MiniMax M3 | $0.42 | $1.68 | 1483ms | 31 |
| MoonshotAI: Kimi K2.5 | $0.49 | $2.50 | 1741ms | 15 |
| Z.ai: GLM 4.7 | $0.52 | $1.85 | 1441ms | 37 |
| Qwen: Qwen3.5 397B A17B | $0.55 | $3.50 | 1370ms | 40 |
| DeepSeek: R1 0528 | $0.55 | $2.15 | 3064ms | 23 |
| Z.ai: GLM 4.6 | $0.60 | $2.20 | 3031ms | 25 |
| MoonshotAI: Kimi K2 0905 | $0.60 | $2.50 | 923ms | 15 |
| MoonshotAI: Kimi K2 Thinking | $0.60 | $2.50 | 750ms | 57 |
| Qwen: Qwen3 Coder 480B A35B | $0.78 | $3.80 | 1706ms | 11 |
| MoonshotAI: Kimi K2.6 | $0.95 | $4.00 | 1080ms | 31 |
| MoonshotAI: Kimi K2.7 Code | $0.95 | $4.00 | 1219ms | 31 |
| Z.ai: GLM 5 | $0.95 | $3.15 | 2705ms | 18 |
| Z.ai: GLM 5 Turbo | $1.20 | $4.00 | 1985ms | 12 |
| Z.ai: GLM 5.1 | $1.26 | $3.96 | 1876ms | 27 |
| Z.ai: GLM 5.2 | $1.40 | $4.40 | 1567ms | 24 |
| DeepSeek: DeepSeek V4 Pro | $1.68 | $3.38 | 2409ms | 53 |
Community Reviews
4.5★★★★★(2 reviews)
clouduser42
★★★★★2025-06-15
Reliable service, great API documentation.
mlresearcher
★★★★☆2025-06-10
Good performance but support could be faster.