LIVE
Models: —+Providers: —+Cheapest H100: $2.49/hrUpdated: 04:29 PMModels: —+Providers: —+Cheapest H100: $2.49/hrUpdated: 04:29 PM
Marketplace
Providers Models
S

SiliconFlow

AGGREGATEDINFERENCE
N/A
Uptime
N/A
Rating

30-Day Uptime

96.7%
2026-05-222026-06-20

Inference Latency

Tencent: Hy3 preview (free)3863ms TTFT · 47 TPS
Nex AGI: Nex-N2-Pro (free)32051ms TTFT · 7 TPS
OpenAI: gpt-oss-20b1198ms TTFT · 61 TPS
OpenAI: gpt-oss-120b1361ms TTFT · 18 TPS
Tencent: Hy3 preview4354ms TTFT · 53 TPS
Qwen: Qwen3 Coder 30B A3B Instruct1254ms TTFT · 35 TPS
Qwen: Qwen3 30B A3B Instruct 25071700ms TTFT · 26 TPS
Qwen: Qwen3.5-9B1682ms TTFT · 16 TPS
StepFun: Step 3.5 Flash2134ms TTFT · 45 TPS
Google: Gemma 4 26B A4B 1589ms TTFT · 7 TPS
Google: Gemma 4 31B2423ms TTFT · 18 TPS
DeepSeek: DeepSeek V4 Flash1680ms TTFT · 56 TPS
Nex AGI: DeepSeek V3.1 Nex N12319ms TTFT · 32 TPS
Tencent: Hunyuan A13B Instruct1028ms TTFT · 7 TPS
Z.ai: GLM 4.5 Air1654ms TTFT · 21 TPS
Qwen: Qwen3 32B1703ms TTFT · 29 TPS
Qwen: QwQ 32B1325ms TTFT · 44 TPS
Qwen: Qwen3.6 35B A3B1835ms TTFT · 42 TPS
Qwen: Qwen3.5-35B-A3B1125ms TTFT · 18 TPS
DeepSeek: DeepSeek V3 0324883ms TTFT · 29 TPS

Inference Models

ModelInput $/MOutput $/MTTFTTPS
Tencent: Hy3 preview (free)$0.00$0.003863ms47
Nex AGI: Nex-N2-Pro (free)$0.00$0.0032051ms7
OpenAI: gpt-oss-20b$0.04$0.181198ms61
OpenAI: gpt-oss-120b$0.05$0.451361ms18
Tencent: Hy3 preview$0.07$0.264354ms53
Qwen: Qwen3 Coder 30B A3B Instruct$0.07$0.281254ms35
Qwen: Qwen3 30B A3B Instruct 2507$0.09$0.301700ms26
Qwen: Qwen3.5-9B$0.10$0.151682ms16
StepFun: Step 3.5 Flash$0.10$0.302134ms45
Google: Gemma 4 26B A4B $0.12$0.401589ms7
Google: Gemma 4 31B$0.13$0.402423ms18
DeepSeek: DeepSeek V4 Flash$0.13$0.281680ms56
Nex AGI: DeepSeek V3.1 Nex N1$0.14$0.502319ms32
Tencent: Hunyuan A13B Instruct$0.14$0.571028ms7
Z.ai: GLM 4.5 Air$0.14$0.861654ms21
Qwen: Qwen3 32B$0.14$0.571703ms29
Qwen: QwQ 32B$0.15$0.581325ms44
Qwen: Qwen3.6 35B A3B$0.20$1.601835ms42
Qwen: Qwen3.5-35B-A3B$0.24$1.801125ms18
DeepSeek: DeepSeek V3 0324$0.25$1.00883ms29
Qwen: Qwen3.5-27B$0.25$2.006132ms15
DeepSeek: DeepSeek V3.2$0.26$0.423158ms14
Qwen: Qwen3.5-122B-A10B$0.26$2.081054ms17
DeepSeek: DeepSeek V3.1$0.27$1.001721ms18
DeepSeek: DeepSeek V3.1 Terminus$0.27$1.001501ms15
DeepSeek: DeepSeek V3.2 Exp$0.27$0.412782ms12
Qwen: Qwen3 VL 30B A3B Thinking$0.29$1.001253ms35
Qwen: Qwen3 VL 30B A3B Instruct$0.29$1.003215ms10
Qwen: Qwen3.6 27B$0.30$3.206273ms14
MiniMax: MiniMax M2.5$0.30$1.201187ms47
MoonshotAI: Kimi K2.5$0.45$2.251782ms28
DeepSeek: R1 0528$0.50$2.181900ms16
MoonshotAI: Kimi K2.6$0.77$4.00996ms31
MoonshotAI: Kimi K2.7 Code$0.94$4.003588ms23
Z.ai: GLM 5$0.95$2.552042ms41
Z.ai: GLM 5.1$1.19$3.742467ms33
DeepSeek: DeepSeek V4 Pro$1.60$3.141632ms65

Community Reviews

4.5★★★★★(2 reviews)
clouduser42
★★★★★2025-06-15

Reliable service, great API documentation.

mlresearcher
★★★★2025-06-10

Good performance but support could be faster.