LIVE
Models: —+Providers: —+Cheapest H100: $2.49/hrUpdated: 04:34 PMModels: —+Providers: —+Cheapest H100: $2.49/hrUpdated: 04:34 PM
Marketplace
Providers Models
G

Google

AGGREGATEDINFERENCE
N/A
Uptime
N/A
Rating

30-Day Uptime

100%
2026-05-222026-06-20

Inference Latency

OpenAI: gpt-oss-20b804ms TTFT · 23 TPS
Google: Gemini 2.0 Flash Lite423ms TTFT · 90 TPS
OpenAI: gpt-oss-120b380ms TTFT · 102 TPS
Google: Gemini 2.5 Flash Lite Preview 09-2025373ms TTFT · 142 TPS
Google: Gemini 2.5 Flash Lite563ms TTFT · 74 TPS
Google: Gemini 2.5 Flash Lite552ms TTFT · 69 TPS
Google: Gemini 2.0 Flash499ms TTFT · 37 TPS
Qwen: Qwen3 Next 80B A3B Instruct853ms TTFT · 62 TPS
Google: Gemma 4 26B A4B 353ms TTFT · 57 TPS
Qwen: Qwen3 Coder 480B A35B (exacto)1707ms TTFT · 45 TPS
Qwen: Qwen3 Coder 480B A35B1474ms TTFT · 25 TPS
Google: Gemini 3.1 Flash Lite984ms TTFT · 106 TPS
Qwen: Qwen3 235B A22B Instruct 2507425ms TTFT · 63 TPS
Meta: Llama 4 Scout439ms TTFT · 82 TPS
Google: Gemini 3.1 Flash Lite Preview655ms TTFT · 97 TPS
Google: Gemini 2.5 Flash767ms TTFT · 84 TPS
MiniMax: MiniMax M2363ms TTFT · 125 TPS
Google: Nano Banana (Gemini 2.5 Flash Image)7172ms TTFT · 169 TPS
Google: Gemini 2.5 Flash1895ms TTFT · 72 TPS
Google: Gemini 2.5 Flash653ms TTFT · 81 TPS

Inference Models

ModelInput $/MOutput $/MTTFTTPS
OpenAI: gpt-oss-20b$0.07$0.25804ms23
Google: Gemini 2.0 Flash Lite$0.08$0.30423ms90
OpenAI: gpt-oss-120b$0.09$0.36380ms102
Google: Gemini 2.5 Flash Lite Preview 09-2025$0.10$0.40373ms142
Google: Gemini 2.5 Flash Lite$0.10$0.40563ms74
Google: Gemini 2.5 Flash Lite$0.10$0.40552ms69
Google: Gemini 2.0 Flash$0.10$0.40499ms37
Qwen: Qwen3 Next 80B A3B Instruct$0.15$1.20853ms62
Qwen: Qwen3 Next 80B A3B Thinking$0.15$1.20
Google: Gemma 4 26B A4B $0.15$0.60353ms57
Qwen: Qwen3 Coder 480B A35B (exacto)$0.22$1.801707ms45
Qwen: Qwen3 Coder 480B A35B$0.22$1.801474ms25
Google: Gemini 3.1 Flash Lite$0.25$1.50984ms106
Qwen: Qwen3 235B A22B Instruct 2507$0.25$1.00425ms63
Meta: Llama 4 Scout$0.25$0.70439ms82
Google: Gemini 3.1 Flash Lite Preview$0.25$1.50655ms97
Google: Gemini 2.5 Flash$0.30$2.50767ms84
MiniMax: MiniMax M2$0.30$1.20363ms125
Google: Nano Banana (Gemini 2.5 Flash Image)$0.30$2.507172ms169
Google: Gemini 2.5 Flash$0.30$2.501895ms72
Google: Gemini 2.5 Flash$0.30$2.50653ms81
Meta: Llama 4 Maverick$0.35$1.15760ms62
Google: Gemini 3 Flash Preview$0.50$3.001098ms68
Google: Nano Banana 2 (Gemini 3.1 Flash Image)$0.50$3.0010814ms91
Google: Nano Banana 2 (Gemini 3.1 Flash Image Preview)$0.50$3.0010797ms80
DeepSeek: DeepSeek V3.2$0.56$1.68981ms17
MoonshotAI: Kimi K2 Thinking$0.60$2.50449ms193
Z.ai: GLM 4.7$0.60$2.20485ms159
DeepSeek: DeepSeek V3.1$0.60$1.70520ms106
Meta: Llama 3.3 70B Instruct$0.72$0.72267ms52
Meta: Llama 3.3 70B Instruct$0.72$0.72164ms35
Anthropic: Claude Haiku 4.5$1.00$5.00407ms90
Anthropic: Claude Haiku 4.5$1.00$5.00904ms92
Google: Gemini 2.5 Pro Preview 06-05$1.25$10.002094ms106
Google: Gemini 2.5 Pro$1.25$10.002078ms85
Google: Gemini 2.5 Pro$1.25$10.001083ms72
Google: Gemini 2.5 Pro Preview 05-06$1.25$10.002094ms106
Google: Gemini 2.5 Pro Preview 05-06$1.25$10.001083ms72
Google: Gemini 2.5 Pro Preview 06-05$1.25$10.001083ms72
Google: Gemini 2.5 Pro$1.25$10.002094ms106
Google: Gemini 2.5 Pro Preview 06-05$1.25$10.002078ms85
Google: Gemini 2.5 Pro Preview 05-06$1.25$10.002078ms85
Google: Gemini 3.5 Flash$1.50$9.001606ms132
Google: Nano Banana Pro (Gemini 3 Pro Image Preview)$2.00$12.0020103ms61
Google: Gemini 3.1 Pro Preview$2.00$12.002860ms88
Google: Gemini 3 Pro Preview$2.00$12.006924ms66
Google: Nano Banana Pro (Gemini 3 Pro Image)$2.00$12.0018660ms64
Anthropic: Claude Sonnet 4.6$3.00$15.001674ms26
Anthropic: Claude Sonnet 4.6$3.00$15.001317ms36
Anthropic: Claude Sonnet 4$3.00$15.001012ms39
Anthropic: Claude 3.7 Sonnet$3.00$15.00
Anthropic: Claude Sonnet 4.6$3.00$15.001232ms40
Anthropic: Claude Sonnet 4$3.00$15.00
Anthropic: Claude Sonnet 4.5$3.00$15.00844ms36
Anthropic: Claude 3.7 Sonnet$3.00$15.00859ms44
Anthropic: Claude 3.7 Sonnet (thinking)$3.00$15.001374ms49
Anthropic: Claude Sonnet 4$3.00$15.00803ms30
Anthropic: Claude Sonnet 4.5$3.00$15.001482ms37
Anthropic: Claude Opus 4.7$5.00$25.002337ms65
Anthropic: Claude Opus 4.5$5.00$25.00749ms45
Anthropic: Claude Opus 4.8$5.00$25.00
Anthropic: Claude Opus 4.6$5.00$25.002110ms43
Anthropic: Claude Opus 4.6$5.00$25.00
Anthropic: Claude Opus 4.8$5.00$25.001406ms46
Anthropic: Claude Opus 4.7$5.00$25.001322ms51
Anthropic: Claude Fable 5$10.00$50.00
Anthropic: Claude Opus 4.1$15.00$75.006001ms21
Anthropic: Claude Opus 4$15.00$75.001968ms21

Community Reviews

4.5★★★★★(2 reviews)
clouduser42
★★★★★2025-06-15

Reliable service, great API documentation.

mlresearcher
★★★★2025-06-10

Good performance but support could be faster.