A
AtlasCloud
AGGREGATEDINFERENCE
N/A
Uptime
N/A
Rating
30-Day Uptime
100%2026-03-232026-04-21
Inference Latency
Qwen: Qwen2.5 7B Instruct1120ms TTFT · 23 TPS
Qwen: Qwen3 8B1147ms TTFT · 52 TPS
Qwen: Qwen3 30B A3B Thinking 25071268ms TTFT · 99 TPS
Qwen: Qwen3 VL 8B Instruct711ms TTFT · 50 TPS
Tongyi DeepResearch 30B A3B188ms TTFT · 184 TPS
Qwen: Qwen3 30B A3B Instruct 25071597ms TTFT · 35 TPS
OpenAI: gpt-oss-120b593ms TTFT · 58 TPS
Qwen: Qwen3 32B1003ms TTFT · 6 TPS
Qwen: Qwen3 Next 80B A3B Thinking960ms TTFT · 142 TPS
Qwen: Qwen3 Next 80B A3B Instruct1156ms TTFT · 84 TPS
Qwen: Qwen3 Coder Next2444ms TTFT · 24 TPS
Qwen: Qwen3 235B A22B Instruct 25071099ms TTFT · 16 TPS
DeepSeek: DeepSeek V3.1 Terminus (exacto)1319ms TTFT · 24 TPS
DeepSeek: DeepSeek V3 03242106ms TTFT · 23 TPS
Qwen: Qwen3.5-35B-A3B1200ms TTFT · 129 TPS
MiniMax: MiniMax M22244ms TTFT · 28 TPS
DeepSeek: DeepSeek V3.21690ms TTFT · 23 TPS
DeepSeek: DeepSeek V3.2 Exp1344ms TTFT · 29 TPS
Qwen: Qwen3.5-27B1013ms TTFT · 6 TPS
Qwen: Qwen3 235B A22B Thinking 25071214ms TTFT · 45 TPS
Inference Models
| Model | Input $/M | Output $/M | TTFT | TPS |
|---|---|---|---|---|
| Qwen: Qwen2.5 7B Instruct | $0.04 | $0.10 | 1120ms | 23 |
| Qwen: Qwen3 8B | $0.05 | $0.40 | 1147ms | 52 |
| Qwen: Qwen3 30B A3B Thinking 2507 | $0.08 | $0.40 | 1268ms | 99 |
| Qwen: Qwen3 VL 8B Instruct | $0.08 | $0.50 | 711ms | 50 |
| Tongyi DeepResearch 30B A3B | $0.09 | $0.45 | 188ms | 184 |
| Qwen: Qwen3 30B A3B Instruct 2507 | $0.10 | $0.30 | 1597ms | 35 |
| Xiaomi: MiMo-V2-Flash | $0.10 | $0.30 | — | — |
| OpenAI: gpt-oss-120b | $0.10 | $0.40 | 593ms | 58 |
| Qwen: Qwen3 32B | $0.10 | $1.20 | 1003ms | 6 |
| Qwen: Qwen3 Next 80B A3B Thinking | $0.15 | $1.50 | 960ms | 142 |
| Qwen: Qwen3 VL 30B A3B Instruct | $0.15 | $0.60 | — | — |
| Qwen: Qwen3 Next 80B A3B Instruct | $0.15 | $1.50 | 1156ms | 84 |
| Qwen: Qwen3 Coder Next | $0.18 | $1.35 | 2444ms | 24 |
| Qwen: Qwen3 235B A22B Instruct 2507 | $0.20 | $0.88 | 1099ms | 16 |
| Meituan: LongCat Flash Chat | $0.20 | $0.80 | — | — |
| DeepSeek: DeepSeek V3.1 Terminus (exacto) | $0.21 | $0.80 | 1319ms | 24 |
| DeepSeek: DeepSeek V3 0324 | $0.22 | $0.88 | 2106ms | 23 |
| Qwen: Qwen3.5-35B-A3B | $0.22 | $1.80 | 1200ms | 129 |
| MiniMax: MiniMax M2 | $0.26 | $1.00 | 2244ms | 28 |
| DeepSeek: DeepSeek V3.2 | $0.26 | $0.38 | 1690ms | 23 |
| DeepSeek: DeepSeek V3.2 Exp | $0.27 | $0.41 | 1344ms | 29 |
| Qwen: Qwen3.5-27B | $0.27 | $2.16 | 1013ms | 6 |
| Qwen: Qwen3 235B A22B Thinking 2507 | $0.28 | $2.30 | 1214ms | 45 |
| MiniMax: MiniMax M2.1 | $0.29 | $0.95 | 3834ms | 45 |
| MiniMax: MiniMax M2.5 | $0.30 | $1.20 | 4182ms | 35 |
| Kwaipilot: KAT-Coder-Pro V2 | $0.30 | $1.20 | 1580ms | 27 |
| DeepSeek: DeepSeek V3.1 Terminus | $0.30 | $0.95 | 1874ms | 26 |
| Qwen: Qwen3.5-122B-A10B | $0.30 | $2.40 | 1001ms | 9 |
| DeepSeek: DeepSeek V3.1 | $0.30 | $0.95 | 1636ms | 24 |
| Qwen: Qwen3 VL 235B A22B Instruct | $0.30 | $1.50 | 5087ms | 10 |
| DeepSeek: DeepSeek V3.2 Speciale | $0.40 | $1.20 | 2955ms | 24 |
| MoonshotAI: Kimi K2.5 | $0.49 | $2.50 | 2315ms | 32 |
| DeepSeek: DeepSeek V3.2 | $0.50 | $1.50 | 1532ms | 25 |
| Z.ai: GLM 4.7 | $0.52 | $1.85 | 1345ms | 43 |
| Qwen: Qwen3.5 397B A17B | $0.55 | $3.50 | 1436ms | 71 |
| DeepSeek: R1 0528 | $0.55 | $2.15 | 3226ms | 22 |
| MoonshotAI: Kimi K2 Thinking | $0.60 | $2.50 | 683ms | 24 |
| MoonshotAI: Kimi K2 0905 | $0.60 | $2.50 | 1296ms | 16 |
| Z.ai: GLM 4.6 | $0.60 | $2.20 | 2027ms | 41 |
| Qwen: Qwen3 Coder 480B A35B | $0.78 | $3.80 | 783ms | 4 |
| Z.ai: GLM 5 | $0.95 | $3.15 | 1680ms | 43 |
| Z.ai: GLM 5 Turbo | $1.20 | $4.00 | 1202ms | 3 |
| Z.ai: GLM 5.1 | $1.40 | $4.40 | 2679ms | 35 |
Community Reviews
4.5★★★★★(2 reviews)
clouduser42
★★★★★2025-06-15
Reliable service, great API documentation.
mlresearcher
★★★★☆2025-06-10
Good performance but support could be faster.