LIVE
Models: —+Providers: —+Cheapest H100: $2.49/hrUpdated: 05:18 PMModels: —+Providers: —+Cheapest H100: $2.49/hrUpdated: 05:18 PM
Marketplace
Providers Models
P

Parasail

AGGREGATEDINFERENCE
N/A
Uptime
N/A
Rating

30-Day Uptime

100%
2026-03-232026-04-21

Inference Latency

OpenAI: gpt-oss-20b396ms TTFT · 102 TPS
Google: Gemma 3 27B603ms TTFT · 25 TPS
Mistral: Mistral Small 3.2 24B502ms TTFT · 28 TPS
OpenAI: gpt-oss-120b602ms TTFT · 19 TPS
AllenAI: Olmo 3 7B Instruct187ms TTFT · 44 TPS
Qwen: Qwen3 Next 80B A3B Instruct914ms TTFT · 59 TPS
ByteDance: UI-TARS 7B 827ms TTFT · 20 TPS
Google: Gemma 4 26B A4B 434ms TTFT · 42 TPS
Qwen: Qwen3 235B A22B Instruct 2507459ms TTFT · 27 TPS
AllenAI: Olmo 3 7B Think183ms TTFT · 78 TPS
Google: Gemma 4 31B749ms TTFT · 7 TPS
AllenAI: Olmo 3.1 32B Think244ms TTFT · 81 TPS
AllenAI: Olmo 3 32B Think263ms TTFT · 79 TPS
MiniMax: MiniMax M2.51262ms TTFT · 30 TPS
Qwen: Qwen3 Coder Next853ms TTFT · 72 TPS
Qwen: Qwen3.5-35B-A3B977ms TTFT · 56 TPS
AllenAI: Molmo2 8B283ms TTFT · 39 TPS
Qwen: Qwen3 VL 235B A22B Instruct1237ms TTFT · 18 TPS
Arcee AI: Trinity Large Thinking1035ms TTFT · 57 TPS
Meta: Llama 3.3 70B Instruct485ms TTFT · 29 TPS

Inference Models

ModelInput $/MOutput $/MTTFTTPS
OpenAI: gpt-oss-20b$0.04$0.20396ms102
Google: Gemma 3 27B$0.08$0.45603ms25
Mistral: Mistral Small 3.2 24B$0.09$0.60502ms28
OpenAI: gpt-oss-120b$0.10$0.75602ms19
AllenAI: Olmo 3 7B Instruct$0.10$0.20187ms44
Qwen: Qwen3 Next 80B A3B Instruct$0.10$1.10914ms59
ByteDance: UI-TARS 7B $0.10$0.20827ms20
Google: Gemma 4 26B A4B $0.10$0.40434ms42
Qwen: Qwen3 235B A22B Instruct 2507$0.10$0.60459ms27
AllenAI: Olmo 3 7B Think$0.12$0.20183ms78
Google: Gemma 4 31B$0.14$0.40749ms7
AllenAI: Olmo 3.1 32B Think$0.15$0.50244ms81
AllenAI: Olmo 3 32B Think$0.15$0.50263ms79
MiniMax: MiniMax M2.5$0.15$1.201262ms30
Qwen: Qwen3 Coder Next$0.15$0.80853ms72
Qwen: Qwen3.5-35B-A3B$0.20$1.00977ms56
AllenAI: Molmo2 8B$0.20$0.20283ms39
Qwen: Qwen3 VL 235B A22B Instruct$0.21$1.901237ms18
Arcee AI: Trinity Large Thinking$0.22$0.851035ms57
Meta: Llama 3.3 70B Instruct$0.22$0.50485ms29
Qwen: Qwen3 VL 8B Instruct$0.25$0.75956ms15
DeepSeek: DeepSeek V3.2$0.28$0.45844ms17
TheDrummer: Cydonia 24B V4.1$0.30$0.50616ms44
Meta: Llama 4 Maverick$0.35$1.00405ms51
Z.ai: GLM 4.7$0.45$2.10990ms27
TheDrummer: Skyfall 36B V2$0.55$0.80619ms41
Qwen: Qwen3.5 397B A17B$0.60$3.60829ms66
MoonshotAI: Kimi K2.5$0.60$2.80698ms36
MoonshotAI: Kimi K2.6$0.60$2.801802ms15
Qwen: Qwen2.5 VL 72B Instruct$0.80$1.001020ms22
Z.ai: GLM 5$1.00$3.20839ms22
Z.ai: GLM 5.1$1.40$4.40

Community Reviews

4.5★★★★★(2 reviews)
clouduser42
★★★★★2025-06-15

Reliable service, great API documentation.

mlresearcher
★★★★2025-06-10

Good performance but support could be faster.