LIVE
Models: —+Providers: —+Cheapest H100: $2.49/hrUpdated: 04:34 PMModels: —+Providers: —+Cheapest H100: $2.49/hrUpdated: 04:34 PM
Marketplace
Providers Models
P

Parasail

AGGREGATEDINFERENCE
N/A
Uptime
N/A
Rating

30-Day Uptime

100%
2026-05-222026-06-20

Inference Latency

OpenAI: gpt-oss-20b330ms TTFT · 204 TPS
Google: Gemma 3 27B628ms TTFT · 37 TPS
Mistral: Mistral Small 3.2 24B591ms TTFT · 35 TPS
Qwen: Qwen3 Next 80B A3B Instruct3058ms TTFT · 40 TPS
ByteDance: UI-TARS 7B 769ms TTFT · 21 TPS
OpenAI: gpt-oss-120b325ms TTFT · 132 TPS
AllenAI: Olmo 3 7B Instruct187ms TTFT · 44 TPS
Qwen: Qwen3 Coder Next1115ms TTFT · 25 TPS
AllenAI: Olmo 3 7B Think183ms TTFT · 78 TPS
Google: Gemma 4 26B A4B 1019ms TTFT · 19 TPS
Qwen: Qwen3 235B A22B Instruct 2507494ms TTFT · 31 TPS
DeepSeek: DeepSeek V4 Flash536ms TTFT · 25 TPS
AllenAI: Olmo 3.1 32B Think244ms TTFT · 81 TPS
AllenAI: Olmo 3 32B Think263ms TTFT · 79 TPS
Qwen: Qwen3.6 35B A3B538ms TTFT · 89 TPS
Google: Gemma 4 31B1017ms TTFT · 22 TPS
Qwen: Qwen3.5-35B-A3B840ms TTFT · 97 TPS
AllenAI: Molmo2 8B283ms TTFT · 39 TPS
Qwen: Qwen3 VL 235B A22B Instruct4218ms TTFT · 16 TPS
Meta: Llama 3.3 70B Instruct600ms TTFT · 40 TPS

Inference Models

ModelInput $/MOutput $/MTTFTTPS
OpenAI: gpt-oss-20b$0.04$0.20330ms204
Google: Gemma 3 27B$0.08$0.45628ms37
Mistral: Mistral Small 3.2 24B$0.09$0.30591ms35
Qwen: Qwen3 Next 80B A3B Instruct$0.10$1.103058ms40
ByteDance: UI-TARS 7B $0.10$0.20769ms21
OpenAI: gpt-oss-120b$0.10$0.75325ms132
AllenAI: Olmo 3 7B Instruct$0.10$0.20187ms44
Qwen: Qwen3 Coder Next$0.12$0.801115ms25
AllenAI: Olmo 3 7B Think$0.12$0.20183ms78
Google: Gemma 4 26B A4B $0.13$0.401019ms19
Qwen: Qwen3 235B A22B Instruct 2507$0.14$0.80494ms31
DeepSeek: DeepSeek V4 Flash$0.14$0.28536ms25
AllenAI: Olmo 3.1 32B Think$0.15$0.50244ms81
AllenAI: Olmo 3 32B Think$0.15$0.50263ms79
Qwen: Qwen3.6 35B A3B$0.15$1.00538ms89
Google: Gemma 4 31B$0.15$0.401017ms22
Qwen: Qwen3.5-35B-A3B$0.15$1.00840ms97
AllenAI: Molmo2 8B$0.20$0.20283ms39
Qwen: Qwen3 VL 235B A22B Instruct$0.21$1.904218ms16
Meta: Llama 3.3 70B Instruct$0.22$0.50600ms40
Qwen: Qwen3 VL 8B Instruct$0.25$0.75793ms44
TheDrummer: Cydonia 24B V4.1$0.30$0.50966ms30
MiniMax: MiniMax M3$0.30$1.201370ms22
MiniMax: MiniMax M2.5$0.30$1.20457ms118
Meta: Llama 4 Maverick$0.35$1.00431ms53
Qwen: Qwen3.5 397B A17B$0.50$3.60433ms86
TheDrummer: Skyfall 36B V2$0.55$0.80
MoonshotAI: Kimi K2.7 Code$0.75$3.501683ms39
MoonshotAI: Kimi K2.6$0.75$3.50716ms68
Qwen: Qwen2.5 VL 72B Instruct$0.80$1.001440ms16
Z.ai: GLM 5$1.00$3.20699ms34
Z.ai: GLM 5.2$1.40$4.407277ms9
Z.ai: GLM 5.1$1.40$4.401017ms53
DeepSeek: DeepSeek V4 Pro$1.74$3.48479ms12

Community Reviews

4.5★★★★★(2 reviews)
clouduser42
★★★★★2025-06-15

Reliable service, great API documentation.

mlresearcher
★★★★2025-06-10

Good performance but support could be faster.