LIVE
Models: —+Providers: —+Cheapest H100: $2.49/hrUpdated: 05:14 PMModels: —+Providers: —+Cheapest H100: $2.49/hrUpdated: 05:14 PM
Marketplace
Providers Models
S

SiliconFlow

AGGREGATEDINFERENCE
N/A
Uptime
N/A
Rating

30-Day Uptime

100%
2026-03-232026-04-21

Inference Latency

OpenAI: gpt-oss-20b1530ms TTFT · 18 TPS
OpenAI: gpt-oss-120b2765ms TTFT · 8 TPS
Qwen: Qwen3 Coder 30B A3B Instruct4458ms TTFT · 19 TPS
Qwen: Qwen3 30B A3B Thinking 25072622ms TTFT · 51 TPS
Qwen: Qwen3 235B A22B Instruct 25072372ms TTFT · 13 TPS
Qwen: Qwen3 30B A3B Instruct 25071358ms TTFT · 15 TPS
StepFun: Step 3.5 Flash4316ms TTFT · 26 TPS
Qwen: Qwen3 235B A22B Thinking 25072011ms TTFT · 24 TPS
Nex AGI: DeepSeek V3.1 Nex N11488ms TTFT · 42 TPS
Qwen: Qwen3 32B5023ms TTFT · 18 TPS
Z.ai: GLM 4.5 Air1582ms TTFT · 12 TPS
Qwen: QwQ 32B1141ms TTFT · 34 TPS
DeepSeek: DeepSeek V3 03241303ms TTFT · 9 TPS
Qwen: Qwen3 Coder 480B A35B993ms TTFT · 2 TPS
DeepSeek: DeepSeek V3.23650ms TTFT · 14 TPS
DeepSeek: DeepSeek V3.12125ms TTFT · 10 TPS
DeepSeek: DeepSeek V3.1 Terminus2231ms TTFT · 15 TPS
DeepSeek: DeepSeek V3.2 Exp2529ms TTFT · 17 TPS
Qwen: Qwen3 VL 30B A3B Instruct4391ms TTFT · 8 TPS
MiniMax: MiniMax M2.51555ms TTFT · 71 TPS

Inference Models

ModelInput $/MOutput $/MTTFTTPS
OpenAI: gpt-oss-20b$0.04$0.181530ms18
OpenAI: gpt-oss-120b$0.05$0.452765ms8
Qwen: Qwen3 Coder 30B A3B Instruct$0.07$0.284458ms19
Qwen: Qwen3 30B A3B Thinking 2507$0.09$0.302622ms51
Qwen: Qwen3 235B A22B Instruct 2507$0.09$0.602372ms13
Qwen: Qwen3 30B A3B Instruct 2507$0.09$0.301358ms15
StepFun: Step 3.5 Flash$0.10$0.304316ms26
Qwen: Qwen3 235B A22B Thinking 2507$0.13$0.602011ms24
Nex AGI: DeepSeek V3.1 Nex N1$0.14$0.501488ms42
Qwen: Qwen3 32B$0.14$0.575023ms18
Tencent: Hunyuan A13B Instruct$0.14$0.57
Z.ai: GLM 4.5 Air$0.14$0.861582ms12
Qwen: QwQ 32B$0.15$0.581141ms34
DeepSeek: DeepSeek V3 0324$0.25$1.001303ms9
Qwen: Qwen3 Coder 480B A35B$0.25$1.00993ms2
DeepSeek: DeepSeek V3.2$0.26$0.423650ms14
DeepSeek: DeepSeek V3.1$0.27$1.002125ms10
DeepSeek: DeepSeek V3.1 Terminus$0.27$1.002231ms15
DeepSeek: DeepSeek V3.2 Exp$0.27$0.412529ms17
Baidu: ERNIE 4.5 300B A47B $0.28$1.10
Qwen: Qwen3 VL 30B A3B Instruct$0.29$1.004391ms8
Qwen: Qwen3 VL 30B A3B Thinking$0.29$1.00
MiniMax: MiniMax M2.5$0.30$1.201555ms71
Qwen: Qwen3 VL 235B A22B Instruct$0.30$1.50
Z.ai: GLM 4.6V$0.30$0.902213ms15
Z.ai: GLM 4.6$0.39$1.902865ms18
MoonshotAI: Kimi K2 0905$0.40$2.001700ms6
Z.ai: GLM 4.7$0.45$2.202948ms41
Qwen: Qwen3 VL 235B A22B Thinking$0.45$3.502531ms16
MoonshotAI: Kimi K2.5$0.45$2.252451ms26
DeepSeek: R1 0528$0.50$2.186782ms14
Z.ai: GLM 5$0.95$2.552075ms31
Z.ai: GLM 5.1$1.40$4.402522ms26

Community Reviews

4.5★★★★★(2 reviews)
clouduser42
★★★★★2025-06-15

Reliable service, great API documentation.

mlresearcher
★★★★2025-06-10

Good performance but support could be faster.