LIVE
Models: —+Providers: —+Cheapest H100: $2.49/hrUpdated: 05:18 PMModels: —+Providers: —+Cheapest H100: $2.49/hrUpdated: 05:18 PM
Marketplace
Providers Models
T

Together

AGGREGATEDINFERENCE
N/A
Uptime
N/A
Rating

30-Day Uptime

100%
2026-03-232026-04-21

Inference Latency

LiquidAI: LFM2-24B-A2B141ms TTFT · 72 TPS
OpenAI: gpt-oss-20b228ms TTFT · 146 TPS
Google: Gemma 3n 4B1587ms TTFT · 6 TPS
Meta: Llama 3 8B Instruct869ms TTFT · 28 TPS
Qwen: Qwen3.5-9B383ms TTFT · 17 TPS
EssentialAI: Rnj 1 Instruct494ms TTFT · 89 TPS
OpenAI: gpt-oss-120b432ms TTFT · 51 TPS
Google: Gemma 4 31B373ms TTFT · 9 TPS
Qwen: Qwen3 235B A22B Instruct 25072040ms TTFT · 12 TPS
Mistral: Mistral 7B Instruct v0.2315ms TTFT · 78 TPS
Meta: Llama Guard 4 12B140ms TTFT · 17 TPS
MiniMax: MiniMax M2.72085ms TTFT · 25 TPS
MiniMax: MiniMax M2.51038ms TTFT · 43 TPS
Qwen: Qwen2.5 7B Instruct269ms TTFT · 53 TPS
Qwen: Qwen3 Coder Next884ms TTFT · 58 TPS
MoonshotAI: Kimi K2.5476ms TTFT · 51 TPS
Qwen: Qwen3.5 397B A17B1120ms TTFT · 56 TPS
DeepSeek: DeepSeek V3.1739ms TTFT · 10 TPS
Meta: Llama 3.3 70B Instruct732ms TTFT · 21 TPS
Z.ai: GLM 51113ms TTFT · 50 TPS

Inference Models

ModelInput $/MOutput $/MTTFTTPS
LiquidAI: LFM2-24B-A2B$0.03$0.12141ms72
OpenAI: gpt-oss-20b$0.05$0.20228ms146
Google: Gemma 3n 4B$0.06$0.121587ms6
Meta: Llama 3 8B Instruct$0.10$0.10869ms28
Qwen: Qwen3.5-9B$0.10$0.15383ms17
EssentialAI: Rnj 1 Instruct$0.15$0.15494ms89
OpenAI: gpt-oss-120b$0.15$0.60432ms51
Arcee AI: Spotlight$0.18$0.18
Google: Gemma 4 31B$0.20$0.50373ms9
Qwen: Qwen3 235B A22B Instruct 2507$0.20$0.602040ms12
Mistral: Mistral 7B Instruct v0.2$0.20$0.20315ms78
Meta: Llama Guard 4 12B$0.20$0.20140ms17
Meta: LlamaGuard 2 8B$0.20$0.20
Mistral: Mistral 7B Instruct v0.3$0.20$0.20
Mistral: Mistral 7B Instruct$0.20$0.20
MiniMax: MiniMax M2.7$0.30$1.202085ms25
MiniMax: MiniMax M2.5$0.30$1.201038ms43
Qwen: Qwen2.5 7B Instruct$0.30$0.30269ms53
Qwen: Qwen3 Coder Next$0.50$1.20884ms58
MoonshotAI: Kimi K2.5$0.50$2.80476ms51
Arcee AI: Coder Large$0.50$0.80
Qwen: Qwen3.5 397B A17B$0.60$3.601120ms56
DeepSeek: DeepSeek V3.1$0.60$1.70739ms10
Arcee AI: Virtuoso Large$0.75$1.20
Meta: Llama 3.3 70B Instruct$0.88$0.88732ms21
Arcee AI: Maestro Reasoning$0.90$3.30
Z.ai: GLM 5$1.00$3.201113ms50
Deep Cogito: Cogito v2.1 671B$1.25$1.25451ms32
Z.ai: GLM 5.1$1.40$4.401158ms40
Qwen: Qwen3 Coder 480B A35B$2.00$2.00
DeepSeek: R1 0528$3.00$7.00925ms88

Community Reviews

4.5★★★★★(2 reviews)
clouduser42
★★★★★2025-06-15

Reliable service, great API documentation.

mlresearcher
★★★★2025-06-10

Good performance but support could be faster.