LIVE
Models: —+Providers: —+Cheapest H100: $2.49/hrUpdated: 05:19 PMModels: —+Providers: —+Cheapest H100: $2.49/hrUpdated: 05:19 PM
Marketplace
Providers Models
V

Venice

AGGREGATEDINFERENCE
N/A
Uptime
N/A
Rating

30-Day Uptime

96.7%
2026-03-232026-04-21

Inference Latency

Venice: Uncensored (free)1447ms TTFT · 36 TPS
Mistral: Mistral Small 3.2 24B562ms TTFT · 46 TPS
Qwen: Qwen3.5-9B575ms TTFT · 17 TPS
Z.ai: GLM 4.7 Flash605ms TTFT · 38 TPS
Mistral: Mistral Small 4790ms TTFT · 116 TPS
Qwen: Qwen3 VL 235B A22B Instruct1573ms TTFT · 14 TPS
Qwen: Qwen3 VL 30B A3B Instruct1708ms TTFT · 17 TPS
Arcee AI: Trinity Large Thinking532ms TTFT · 116 TPS
Qwen: Qwen3.5-35B-A3B995ms TTFT · 65 TPS
MiniMax: MiniMax M2.51376ms TTFT · 20 TPS
Qwen: Qwen3 Coder 480B A35B496ms TTFT · 4 TPS
Z.ai: GLM 4.71076ms TTFT · 23 TPS
MoonshotAI: Kimi K2.51295ms TTFT · 50 TPS
Z.ai: GLM 51172ms TTFT · 47 TPS
Z.ai: GLM 52797ms TTFT · 21 TPS
Z.ai: GLM 5.11015ms TTFT · 27 TPS

Inference Models

ModelInput $/MOutput $/MTTFTTPS
Nous: Hermes 3 405B Instruct (free)$0.00$0.00
Qwen: Qwen3 4B (free)$0.00$0.00
Meta: Llama 3.3 70B Instruct (free)$0.00$0.00
Venice: Uncensored (free)$0.00$0.001447ms36
Mistral: Mistral Small 3.1 24B (free)$0.00$0.00
Qwen: Qwen3 Coder 480B A35B (free)$0.00$0.00
Qwen: Qwen3 Next 80B A3B Instruct (free)$0.00$0.00
Meta: Llama 3.2 3B Instruct (free)$0.00$0.00
Mistral: Mistral Small 3.2 24B$0.09$0.25562ms46
Qwen: Qwen3.5-9B$0.10$0.15575ms17
Z.ai: GLM 4.7 Flash$0.13$0.50605ms38
Mistral: Mistral Small 4$0.19$0.75790ms116
Qwen: Qwen3 VL 235B A22B Instruct$0.25$1.501573ms14
Qwen: Qwen3 VL 30B A3B Instruct$0.25$0.901708ms17
Arcee AI: Trinity Large Thinking$0.31$1.13532ms116
Qwen: Qwen3.5-35B-A3B$0.31$1.25995ms65
MiniMax: MiniMax M2.5$0.34$1.191376ms20
Qwen: Qwen3 Coder 480B A35B$0.35$1.50496ms4
Z.ai: GLM 4.7$0.55$2.651076ms23
MoonshotAI: Kimi K2.5$0.56$3.501295ms50
Z.ai: GLM 5$1.00$3.201172ms47
Z.ai: GLM 5$1.10$4.152797ms21
Z.ai: GLM 5.1$1.75$5.501015ms27

Community Reviews

4.5★★★★★(2 reviews)
clouduser42
★★★★★2025-06-15

Reliable service, great API documentation.

mlresearcher
★★★★2025-06-10

Good performance but support could be faster.