S
SiliconFlow
AGGREGATEDINFERENCE
N/A
Uptime
N/A
Rating
30-Day Uptime
100%2026-03-232026-04-21
Inference Latency
OpenAI: gpt-oss-20b1530ms TTFT · 18 TPS
OpenAI: gpt-oss-120b2765ms TTFT · 8 TPS
Qwen: Qwen3 Coder 30B A3B Instruct4458ms TTFT · 19 TPS
Qwen: Qwen3 30B A3B Thinking 25072622ms TTFT · 51 TPS
Qwen: Qwen3 235B A22B Instruct 25072372ms TTFT · 13 TPS
Qwen: Qwen3 30B A3B Instruct 25071358ms TTFT · 15 TPS
StepFun: Step 3.5 Flash4316ms TTFT · 26 TPS
Qwen: Qwen3 235B A22B Thinking 25072011ms TTFT · 24 TPS
Nex AGI: DeepSeek V3.1 Nex N11488ms TTFT · 42 TPS
Qwen: Qwen3 32B5023ms TTFT · 18 TPS
Z.ai: GLM 4.5 Air1582ms TTFT · 12 TPS
Qwen: QwQ 32B1141ms TTFT · 34 TPS
DeepSeek: DeepSeek V3 03241303ms TTFT · 9 TPS
Qwen: Qwen3 Coder 480B A35B993ms TTFT · 2 TPS
DeepSeek: DeepSeek V3.23650ms TTFT · 14 TPS
DeepSeek: DeepSeek V3.12125ms TTFT · 10 TPS
DeepSeek: DeepSeek V3.1 Terminus2231ms TTFT · 15 TPS
DeepSeek: DeepSeek V3.2 Exp2529ms TTFT · 17 TPS
Qwen: Qwen3 VL 30B A3B Instruct4391ms TTFT · 8 TPS
MiniMax: MiniMax M2.51555ms TTFT · 71 TPS
Inference Models
| Model | Input $/M | Output $/M | TTFT | TPS |
|---|---|---|---|---|
| OpenAI: gpt-oss-20b | $0.04 | $0.18 | 1530ms | 18 |
| OpenAI: gpt-oss-120b | $0.05 | $0.45 | 2765ms | 8 |
| Qwen: Qwen3 Coder 30B A3B Instruct | $0.07 | $0.28 | 4458ms | 19 |
| Qwen: Qwen3 30B A3B Thinking 2507 | $0.09 | $0.30 | 2622ms | 51 |
| Qwen: Qwen3 235B A22B Instruct 2507 | $0.09 | $0.60 | 2372ms | 13 |
| Qwen: Qwen3 30B A3B Instruct 2507 | $0.09 | $0.30 | 1358ms | 15 |
| StepFun: Step 3.5 Flash | $0.10 | $0.30 | 4316ms | 26 |
| Qwen: Qwen3 235B A22B Thinking 2507 | $0.13 | $0.60 | 2011ms | 24 |
| Nex AGI: DeepSeek V3.1 Nex N1 | $0.14 | $0.50 | 1488ms | 42 |
| Qwen: Qwen3 32B | $0.14 | $0.57 | 5023ms | 18 |
| Tencent: Hunyuan A13B Instruct | $0.14 | $0.57 | — | — |
| Z.ai: GLM 4.5 Air | $0.14 | $0.86 | 1582ms | 12 |
| Qwen: QwQ 32B | $0.15 | $0.58 | 1141ms | 34 |
| DeepSeek: DeepSeek V3 0324 | $0.25 | $1.00 | 1303ms | 9 |
| Qwen: Qwen3 Coder 480B A35B | $0.25 | $1.00 | 993ms | 2 |
| DeepSeek: DeepSeek V3.2 | $0.26 | $0.42 | 3650ms | 14 |
| DeepSeek: DeepSeek V3.1 | $0.27 | $1.00 | 2125ms | 10 |
| DeepSeek: DeepSeek V3.1 Terminus | $0.27 | $1.00 | 2231ms | 15 |
| DeepSeek: DeepSeek V3.2 Exp | $0.27 | $0.41 | 2529ms | 17 |
| Baidu: ERNIE 4.5 300B A47B | $0.28 | $1.10 | — | — |
| Qwen: Qwen3 VL 30B A3B Instruct | $0.29 | $1.00 | 4391ms | 8 |
| Qwen: Qwen3 VL 30B A3B Thinking | $0.29 | $1.00 | — | — |
| MiniMax: MiniMax M2.5 | $0.30 | $1.20 | 1555ms | 71 |
| Qwen: Qwen3 VL 235B A22B Instruct | $0.30 | $1.50 | — | — |
| Z.ai: GLM 4.6V | $0.30 | $0.90 | 2213ms | 15 |
| Z.ai: GLM 4.6 | $0.39 | $1.90 | 2865ms | 18 |
| MoonshotAI: Kimi K2 0905 | $0.40 | $2.00 | 1700ms | 6 |
| Z.ai: GLM 4.7 | $0.45 | $2.20 | 2948ms | 41 |
| Qwen: Qwen3 VL 235B A22B Thinking | $0.45 | $3.50 | 2531ms | 16 |
| MoonshotAI: Kimi K2.5 | $0.45 | $2.25 | 2451ms | 26 |
| DeepSeek: R1 0528 | $0.50 | $2.18 | 6782ms | 14 |
| Z.ai: GLM 5 | $0.95 | $2.55 | 2075ms | 31 |
| Z.ai: GLM 5.1 | $1.40 | $4.40 | 2522ms | 26 |
Community Reviews
4.5★★★★★(2 reviews)
clouduser42
★★★★★2025-06-15
Reliable service, great API documentation.
mlresearcher
★★★★☆2025-06-10
Good performance but support could be faster.