P
Parasail
AGGREGATEDINFERENCE
N/A
Uptime
N/A
Rating
30-Day Uptime
100%2026-05-222026-06-20
Inference Latency
OpenAI: gpt-oss-20b330ms TTFT · 204 TPS
Google: Gemma 3 27B628ms TTFT · 37 TPS
Mistral: Mistral Small 3.2 24B591ms TTFT · 35 TPS
Qwen: Qwen3 Next 80B A3B Instruct3058ms TTFT · 40 TPS
ByteDance: UI-TARS 7B 769ms TTFT · 21 TPS
OpenAI: gpt-oss-120b325ms TTFT · 132 TPS
AllenAI: Olmo 3 7B Instruct187ms TTFT · 44 TPS
Qwen: Qwen3 Coder Next1115ms TTFT · 25 TPS
AllenAI: Olmo 3 7B Think183ms TTFT · 78 TPS
Google: Gemma 4 26B A4B 1019ms TTFT · 19 TPS
Qwen: Qwen3 235B A22B Instruct 2507494ms TTFT · 31 TPS
DeepSeek: DeepSeek V4 Flash536ms TTFT · 25 TPS
AllenAI: Olmo 3.1 32B Think244ms TTFT · 81 TPS
AllenAI: Olmo 3 32B Think263ms TTFT · 79 TPS
Qwen: Qwen3.6 35B A3B538ms TTFT · 89 TPS
Google: Gemma 4 31B1017ms TTFT · 22 TPS
Qwen: Qwen3.5-35B-A3B840ms TTFT · 97 TPS
AllenAI: Molmo2 8B283ms TTFT · 39 TPS
Qwen: Qwen3 VL 235B A22B Instruct4218ms TTFT · 16 TPS
Meta: Llama 3.3 70B Instruct600ms TTFT · 40 TPS
Inference Models
| Model | Input $/M | Output $/M | TTFT | TPS |
|---|---|---|---|---|
| OpenAI: gpt-oss-20b | $0.04 | $0.20 | 330ms | 204 |
| Google: Gemma 3 27B | $0.08 | $0.45 | 628ms | 37 |
| Mistral: Mistral Small 3.2 24B | $0.09 | $0.30 | 591ms | 35 |
| Qwen: Qwen3 Next 80B A3B Instruct | $0.10 | $1.10 | 3058ms | 40 |
| ByteDance: UI-TARS 7B | $0.10 | $0.20 | 769ms | 21 |
| OpenAI: gpt-oss-120b | $0.10 | $0.75 | 325ms | 132 |
| AllenAI: Olmo 3 7B Instruct | $0.10 | $0.20 | 187ms | 44 |
| Qwen: Qwen3 Coder Next | $0.12 | $0.80 | 1115ms | 25 |
| AllenAI: Olmo 3 7B Think | $0.12 | $0.20 | 183ms | 78 |
| Google: Gemma 4 26B A4B | $0.13 | $0.40 | 1019ms | 19 |
| Qwen: Qwen3 235B A22B Instruct 2507 | $0.14 | $0.80 | 494ms | 31 |
| DeepSeek: DeepSeek V4 Flash | $0.14 | $0.28 | 536ms | 25 |
| AllenAI: Olmo 3.1 32B Think | $0.15 | $0.50 | 244ms | 81 |
| AllenAI: Olmo 3 32B Think | $0.15 | $0.50 | 263ms | 79 |
| Qwen: Qwen3.6 35B A3B | $0.15 | $1.00 | 538ms | 89 |
| Google: Gemma 4 31B | $0.15 | $0.40 | 1017ms | 22 |
| Qwen: Qwen3.5-35B-A3B | $0.15 | $1.00 | 840ms | 97 |
| AllenAI: Molmo2 8B | $0.20 | $0.20 | 283ms | 39 |
| Qwen: Qwen3 VL 235B A22B Instruct | $0.21 | $1.90 | 4218ms | 16 |
| Meta: Llama 3.3 70B Instruct | $0.22 | $0.50 | 600ms | 40 |
| Qwen: Qwen3 VL 8B Instruct | $0.25 | $0.75 | 793ms | 44 |
| TheDrummer: Cydonia 24B V4.1 | $0.30 | $0.50 | 966ms | 30 |
| MiniMax: MiniMax M3 | $0.30 | $1.20 | 1370ms | 22 |
| MiniMax: MiniMax M2.5 | $0.30 | $1.20 | 457ms | 118 |
| Meta: Llama 4 Maverick | $0.35 | $1.00 | 431ms | 53 |
| Qwen: Qwen3.5 397B A17B | $0.50 | $3.60 | 433ms | 86 |
| TheDrummer: Skyfall 36B V2 | $0.55 | $0.80 | — | — |
| MoonshotAI: Kimi K2.7 Code | $0.75 | $3.50 | 1683ms | 39 |
| MoonshotAI: Kimi K2.6 | $0.75 | $3.50 | 716ms | 68 |
| Qwen: Qwen2.5 VL 72B Instruct | $0.80 | $1.00 | 1440ms | 16 |
| Z.ai: GLM 5 | $1.00 | $3.20 | 699ms | 34 |
| Z.ai: GLM 5.2 | $1.40 | $4.40 | 7277ms | 9 |
| Z.ai: GLM 5.1 | $1.40 | $4.40 | 1017ms | 53 |
| DeepSeek: DeepSeek V4 Pro | $1.74 | $3.48 | 479ms | 12 |
Community Reviews
4.5★★★★★(2 reviews)
clouduser42
★★★★★2025-06-15
Reliable service, great API documentation.
mlresearcher
★★★★☆2025-06-10
Good performance but support could be faster.