S
SiliconFlow
AGGREGATEDINFERENCE
N/A
Uptime
N/A
Rating
30-Day Uptime
96.7%2026-05-222026-06-20
Inference Latency
Tencent: Hy3 preview (free)3863ms TTFT · 47 TPS
Nex AGI: Nex-N2-Pro (free)32051ms TTFT · 7 TPS
OpenAI: gpt-oss-20b1198ms TTFT · 61 TPS
OpenAI: gpt-oss-120b1361ms TTFT · 18 TPS
Tencent: Hy3 preview4354ms TTFT · 53 TPS
Qwen: Qwen3 Coder 30B A3B Instruct1254ms TTFT · 35 TPS
Qwen: Qwen3 30B A3B Instruct 25071700ms TTFT · 26 TPS
Qwen: Qwen3.5-9B1682ms TTFT · 16 TPS
StepFun: Step 3.5 Flash2134ms TTFT · 45 TPS
Google: Gemma 4 26B A4B 1589ms TTFT · 7 TPS
Google: Gemma 4 31B2423ms TTFT · 18 TPS
DeepSeek: DeepSeek V4 Flash1680ms TTFT · 56 TPS
Nex AGI: DeepSeek V3.1 Nex N12319ms TTFT · 32 TPS
Tencent: Hunyuan A13B Instruct1028ms TTFT · 7 TPS
Z.ai: GLM 4.5 Air1654ms TTFT · 21 TPS
Qwen: Qwen3 32B1703ms TTFT · 29 TPS
Qwen: QwQ 32B1325ms TTFT · 44 TPS
Qwen: Qwen3.6 35B A3B1835ms TTFT · 42 TPS
Qwen: Qwen3.5-35B-A3B1125ms TTFT · 18 TPS
DeepSeek: DeepSeek V3 0324883ms TTFT · 29 TPS
Inference Models
| Model | Input $/M | Output $/M | TTFT | TPS |
|---|---|---|---|---|
| Tencent: Hy3 preview (free) | $0.00 | $0.00 | 3863ms | 47 |
| Nex AGI: Nex-N2-Pro (free) | $0.00 | $0.00 | 32051ms | 7 |
| OpenAI: gpt-oss-20b | $0.04 | $0.18 | 1198ms | 61 |
| OpenAI: gpt-oss-120b | $0.05 | $0.45 | 1361ms | 18 |
| Tencent: Hy3 preview | $0.07 | $0.26 | 4354ms | 53 |
| Qwen: Qwen3 Coder 30B A3B Instruct | $0.07 | $0.28 | 1254ms | 35 |
| Qwen: Qwen3 30B A3B Instruct 2507 | $0.09 | $0.30 | 1700ms | 26 |
| Qwen: Qwen3.5-9B | $0.10 | $0.15 | 1682ms | 16 |
| StepFun: Step 3.5 Flash | $0.10 | $0.30 | 2134ms | 45 |
| Google: Gemma 4 26B A4B | $0.12 | $0.40 | 1589ms | 7 |
| Google: Gemma 4 31B | $0.13 | $0.40 | 2423ms | 18 |
| DeepSeek: DeepSeek V4 Flash | $0.13 | $0.28 | 1680ms | 56 |
| Nex AGI: DeepSeek V3.1 Nex N1 | $0.14 | $0.50 | 2319ms | 32 |
| Tencent: Hunyuan A13B Instruct | $0.14 | $0.57 | 1028ms | 7 |
| Z.ai: GLM 4.5 Air | $0.14 | $0.86 | 1654ms | 21 |
| Qwen: Qwen3 32B | $0.14 | $0.57 | 1703ms | 29 |
| Qwen: QwQ 32B | $0.15 | $0.58 | 1325ms | 44 |
| Qwen: Qwen3.6 35B A3B | $0.20 | $1.60 | 1835ms | 42 |
| Qwen: Qwen3.5-35B-A3B | $0.24 | $1.80 | 1125ms | 18 |
| DeepSeek: DeepSeek V3 0324 | $0.25 | $1.00 | 883ms | 29 |
| Qwen: Qwen3.5-27B | $0.25 | $2.00 | 6132ms | 15 |
| DeepSeek: DeepSeek V3.2 | $0.26 | $0.42 | 3158ms | 14 |
| Qwen: Qwen3.5-122B-A10B | $0.26 | $2.08 | 1054ms | 17 |
| DeepSeek: DeepSeek V3.1 | $0.27 | $1.00 | 1721ms | 18 |
| DeepSeek: DeepSeek V3.1 Terminus | $0.27 | $1.00 | 1501ms | 15 |
| DeepSeek: DeepSeek V3.2 Exp | $0.27 | $0.41 | 2782ms | 12 |
| Qwen: Qwen3 VL 30B A3B Thinking | $0.29 | $1.00 | 1253ms | 35 |
| Qwen: Qwen3 VL 30B A3B Instruct | $0.29 | $1.00 | 3215ms | 10 |
| Qwen: Qwen3.6 27B | $0.30 | $3.20 | 6273ms | 14 |
| MiniMax: MiniMax M2.5 | $0.30 | $1.20 | 1187ms | 47 |
| MoonshotAI: Kimi K2.5 | $0.45 | $2.25 | 1782ms | 28 |
| DeepSeek: R1 0528 | $0.50 | $2.18 | 1900ms | 16 |
| MoonshotAI: Kimi K2.6 | $0.77 | $4.00 | 996ms | 31 |
| MoonshotAI: Kimi K2.7 Code | $0.94 | $4.00 | 3588ms | 23 |
| Z.ai: GLM 5 | $0.95 | $2.55 | 2042ms | 41 |
| Z.ai: GLM 5.1 | $1.19 | $3.74 | 2467ms | 33 |
| DeepSeek: DeepSeek V4 Pro | $1.60 | $3.14 | 1632ms | 65 |
Community Reviews
4.5★★★★★(2 reviews)
clouduser42
★★★★★2025-06-15
Reliable service, great API documentation.
mlresearcher
★★★★☆2025-06-10
Good performance but support could be faster.