LIVE
Models: —+Providers: —+Cheapest H100: $2.49/hrUpdated: 05:15 PMModels: —+Providers: —+Cheapest H100: $2.49/hrUpdated: 05:15 PM
Marketplace
Providers Models
G

Google

AGGREGATEDINFERENCE
N/A
Uptime
N/A
Rating

30-Day Uptime

100%
2026-03-232026-04-21

Inference Latency

OpenAI: gpt-oss-20b343ms TTFT · 186 TPS
Google: Gemini 2.0 Flash Lite450ms TTFT · 74 TPS
OpenAI: gpt-oss-120b418ms TTFT · 62 TPS
Google: Gemini 2.0 Flash575ms TTFT · 65 TPS
Google: Gemini 2.5 Flash Lite Preview 09-2025418ms TTFT · 124 TPS
Google: Gemini 2.5 Flash Lite395ms TTFT · 135 TPS
Qwen: Qwen3 Next 80B A3B Instruct460ms TTFT · 152 TPS
Qwen: Qwen3 Next 80B A3B Thinking519ms TTFT · 125 TPS
Qwen: Qwen3 Coder 480B A35B428ms TTFT · 11 TPS
Qwen: Qwen3 Coder 480B A35B (exacto)1707ms TTFT · 45 TPS
Google: Gemini 3.1 Flash Lite Preview1359ms TTFT · 73 TPS
Qwen: Qwen3 235B A22B Instruct 2507267ms TTFT · 63 TPS
Meta: Llama 4 Scout2422ms TTFT · 81 TPS
Google: Gemini 2.5 Flash1905ms TTFT · 41 TPS
Google: Nano Banana (Gemini 2.5 Flash Image)8293ms TTFT · 133 TPS
MiniMax: MiniMax M2363ms TTFT · 71 TPS
Google: Gemini 2.5 Flash840ms TTFT · 62 TPS
Google: Gemini 3 Flash Preview1328ms TTFT · 57 TPS
Google: Nano Banana 2 (Gemini 3.1 Flash Image Preview)18183ms TTFT · 75 TPS
DeepSeek: DeepSeek V3.2893ms TTFT · 22 TPS

Inference Models

ModelInput $/MOutput $/MTTFTTPS
OpenAI: gpt-oss-20b$0.07$0.25343ms186
Google: Gemini 2.0 Flash Lite$0.08$0.30450ms74
OpenAI: gpt-oss-120b$0.09$0.36418ms62
Google: Gemini 2.0 Flash$0.10$0.40575ms65
Google: Gemini 2.5 Flash Lite Preview 09-2025$0.10$0.40418ms124
Google: Gemini 2.5 Flash Lite$0.10$0.40395ms135
Qwen: Qwen3 Next 80B A3B Instruct$0.15$1.20460ms152
Qwen: Qwen3 Next 80B A3B Thinking$0.15$1.20519ms125
Qwen: Qwen3 Coder 480B A35B$0.22$1.80428ms11
Qwen: Qwen3 Coder 480B A35B (exacto)$0.22$1.801707ms45
Google: Gemini 3.1 Flash Lite Preview$0.25$1.501359ms73
Qwen: Qwen3 235B A22B Instruct 2507$0.25$1.00267ms63
Anthropic: Claude 3 Haiku$0.25$1.25
Meta: Llama 4 Scout$0.25$0.702422ms81
Google: Gemini 2.5 Flash$0.30$2.501905ms41
Google: Nano Banana (Gemini 2.5 Flash Image)$0.30$2.508293ms133
MiniMax: MiniMax M2$0.30$1.20363ms71
Google: Gemini 2.5 Flash$0.30$2.50840ms62
Google: Gemini 3 Flash Preview$0.50$3.001328ms57
Google: Nano Banana 2 (Gemini 3.1 Flash Image Preview)$0.50$3.0018183ms75
DeepSeek: DeepSeek V3.2$0.56$1.68893ms22
MoonshotAI: Kimi K2 Thinking$0.60$2.504345ms30
DeepSeek: DeepSeek V3.1$0.60$1.70768ms81
Z.ai: GLM 4.7$0.60$2.201561ms86
Meta: Llama 3.3 70B Instruct$0.72$0.72262ms36
Anthropic: Claude 3.5 Haiku$0.80$4.00
Anthropic: Claude Haiku 4.5$1.00$5.00564ms77
Google: Gemini 2.5 Pro$1.25$10.002898ms84
Google: Gemini 2.5 Pro Preview 05-06$1.25$10.002898ms84
Google: Gemini 2.5 Pro Preview 06-05$1.25$10.002638ms74
Google: Gemini 2.5 Pro$1.25$10.002570ms73
Google: Gemini 2.5 Pro Preview 06-05$1.25$10.002892ms84
Google: Gemini 2.5 Pro Preview 05-06$1.25$10.002570ms73
Google: Gemini 3 Pro Preview$2.00$12.006924ms66
Google: Nano Banana Pro (Gemini 3 Pro Image Preview)$2.00$12.0034803ms36
Google: Gemini 3.1 Pro Preview$2.00$12.007025ms63
Anthropic: Claude Sonnet 4.5$3.00$15.001662ms36
Anthropic: Claude 3.7 Sonnet$3.00$15.00
Anthropic: Claude Sonnet 4$3.00$15.00
Anthropic: Claude 3.7 Sonnet$3.00$15.00
Anthropic: Claude Sonnet 4.6$3.00$15.001478ms37
Anthropic: Claude Sonnet 4.6$3.00$15.001125ms45
Anthropic: Claude Sonnet 4$3.00$15.00810ms36
Anthropic: Claude Sonnet 4.5$3.00$15.001439ms36
Anthropic: Claude 3.7 Sonnet (thinking)$3.00$15.001001ms50
Anthropic: Claude 3.7 Sonnet$3.00$15.00667ms44
Anthropic: Claude Sonnet 4$3.00$15.00
Anthropic: Claude Opus 4.7$5.00$25.002181ms53
Anthropic: Claude Opus 4.5$5.00$25.001107ms41
Anthropic: Claude Opus 4.6$5.00$25.001667ms39
Anthropic: Claude Opus 4.1$15.00$75.00
Anthropic: Claude Opus 4$15.00$75.00
Anthropic: Claude Opus 4.1$15.00$75.00
Anthropic: Claude Opus 4.1$15.00$75.00
Anthropic: Claude Opus 4$15.00$75.006488ms17

Community Reviews

4.5★★★★★(2 reviews)
clouduser42
★★★★★2025-06-15

Reliable service, great API documentation.

mlresearcher
★★★★2025-06-10

Good performance but support could be faster.