V
Venice
AGGREGATEDINFERENCE
N/A
Uptime
N/A
Rating
30-Day Uptime
96.7%2026-03-232026-04-21
Inference Latency
Venice: Uncensored (free)1447ms TTFT · 36 TPS
Mistral: Mistral Small 3.2 24B562ms TTFT · 46 TPS
Qwen: Qwen3.5-9B575ms TTFT · 17 TPS
Z.ai: GLM 4.7 Flash605ms TTFT · 38 TPS
Mistral: Mistral Small 4790ms TTFT · 116 TPS
Qwen: Qwen3 VL 235B A22B Instruct1573ms TTFT · 14 TPS
Qwen: Qwen3 VL 30B A3B Instruct1708ms TTFT · 17 TPS
Arcee AI: Trinity Large Thinking532ms TTFT · 116 TPS
Qwen: Qwen3.5-35B-A3B995ms TTFT · 65 TPS
MiniMax: MiniMax M2.51376ms TTFT · 20 TPS
Qwen: Qwen3 Coder 480B A35B496ms TTFT · 4 TPS
Z.ai: GLM 4.71076ms TTFT · 23 TPS
MoonshotAI: Kimi K2.51295ms TTFT · 50 TPS
Z.ai: GLM 51172ms TTFT · 47 TPS
Z.ai: GLM 52797ms TTFT · 21 TPS
Z.ai: GLM 5.11015ms TTFT · 27 TPS
Inference Models
| Model | Input $/M | Output $/M | TTFT | TPS |
|---|---|---|---|---|
| Nous: Hermes 3 405B Instruct (free) | $0.00 | $0.00 | — | — |
| Qwen: Qwen3 4B (free) | $0.00 | $0.00 | — | — |
| Meta: Llama 3.3 70B Instruct (free) | $0.00 | $0.00 | — | — |
| Venice: Uncensored (free) | $0.00 | $0.00 | 1447ms | 36 |
| Mistral: Mistral Small 3.1 24B (free) | $0.00 | $0.00 | — | — |
| Qwen: Qwen3 Coder 480B A35B (free) | $0.00 | $0.00 | — | — |
| Qwen: Qwen3 Next 80B A3B Instruct (free) | $0.00 | $0.00 | — | — |
| Meta: Llama 3.2 3B Instruct (free) | $0.00 | $0.00 | — | — |
| Mistral: Mistral Small 3.2 24B | $0.09 | $0.25 | 562ms | 46 |
| Qwen: Qwen3.5-9B | $0.10 | $0.15 | 575ms | 17 |
| Z.ai: GLM 4.7 Flash | $0.13 | $0.50 | 605ms | 38 |
| Mistral: Mistral Small 4 | $0.19 | $0.75 | 790ms | 116 |
| Qwen: Qwen3 VL 235B A22B Instruct | $0.25 | $1.50 | 1573ms | 14 |
| Qwen: Qwen3 VL 30B A3B Instruct | $0.25 | $0.90 | 1708ms | 17 |
| Arcee AI: Trinity Large Thinking | $0.31 | $1.13 | 532ms | 116 |
| Qwen: Qwen3.5-35B-A3B | $0.31 | $1.25 | 995ms | 65 |
| MiniMax: MiniMax M2.5 | $0.34 | $1.19 | 1376ms | 20 |
| Qwen: Qwen3 Coder 480B A35B | $0.35 | $1.50 | 496ms | 4 |
| Z.ai: GLM 4.7 | $0.55 | $2.65 | 1076ms | 23 |
| MoonshotAI: Kimi K2.5 | $0.56 | $3.50 | 1295ms | 50 |
| Z.ai: GLM 5 | $1.00 | $3.20 | 1172ms | 47 |
| Z.ai: GLM 5 | $1.10 | $4.15 | 2797ms | 21 |
| Z.ai: GLM 5.1 | $1.75 | $5.50 | 1015ms | 27 |
Community Reviews
4.5★★★★★(2 reviews)
clouduser42
★★★★★2025-06-15
Reliable service, great API documentation.
mlresearcher
★★★★☆2025-06-10
Good performance but support could be faster.