T
Together
AGGREGATEDINFERENCE
N/A
Uptime
N/A
Rating
30-Day Uptime
100%2026-03-232026-04-21
Inference Latency
LiquidAI: LFM2-24B-A2B141ms TTFT · 72 TPS
OpenAI: gpt-oss-20b228ms TTFT · 146 TPS
Google: Gemma 3n 4B1587ms TTFT · 6 TPS
Meta: Llama 3 8B Instruct869ms TTFT · 28 TPS
Qwen: Qwen3.5-9B383ms TTFT · 17 TPS
EssentialAI: Rnj 1 Instruct494ms TTFT · 89 TPS
OpenAI: gpt-oss-120b432ms TTFT · 51 TPS
Google: Gemma 4 31B373ms TTFT · 9 TPS
Qwen: Qwen3 235B A22B Instruct 25072040ms TTFT · 12 TPS
Mistral: Mistral 7B Instruct v0.2315ms TTFT · 78 TPS
Meta: Llama Guard 4 12B140ms TTFT · 17 TPS
MiniMax: MiniMax M2.72085ms TTFT · 25 TPS
MiniMax: MiniMax M2.51038ms TTFT · 43 TPS
Qwen: Qwen2.5 7B Instruct269ms TTFT · 53 TPS
Qwen: Qwen3 Coder Next884ms TTFT · 58 TPS
MoonshotAI: Kimi K2.5476ms TTFT · 51 TPS
Qwen: Qwen3.5 397B A17B1120ms TTFT · 56 TPS
DeepSeek: DeepSeek V3.1739ms TTFT · 10 TPS
Meta: Llama 3.3 70B Instruct732ms TTFT · 21 TPS
Z.ai: GLM 51113ms TTFT · 50 TPS
Inference Models
| Model | Input $/M | Output $/M | TTFT | TPS |
|---|---|---|---|---|
| LiquidAI: LFM2-24B-A2B | $0.03 | $0.12 | 141ms | 72 |
| OpenAI: gpt-oss-20b | $0.05 | $0.20 | 228ms | 146 |
| Google: Gemma 3n 4B | $0.06 | $0.12 | 1587ms | 6 |
| Meta: Llama 3 8B Instruct | $0.10 | $0.10 | 869ms | 28 |
| Qwen: Qwen3.5-9B | $0.10 | $0.15 | 383ms | 17 |
| EssentialAI: Rnj 1 Instruct | $0.15 | $0.15 | 494ms | 89 |
| OpenAI: gpt-oss-120b | $0.15 | $0.60 | 432ms | 51 |
| Arcee AI: Spotlight | $0.18 | $0.18 | — | — |
| Google: Gemma 4 31B | $0.20 | $0.50 | 373ms | 9 |
| Qwen: Qwen3 235B A22B Instruct 2507 | $0.20 | $0.60 | 2040ms | 12 |
| Mistral: Mistral 7B Instruct v0.2 | $0.20 | $0.20 | 315ms | 78 |
| Meta: Llama Guard 4 12B | $0.20 | $0.20 | 140ms | 17 |
| Meta: LlamaGuard 2 8B | $0.20 | $0.20 | — | — |
| Mistral: Mistral 7B Instruct v0.3 | $0.20 | $0.20 | — | — |
| Mistral: Mistral 7B Instruct | $0.20 | $0.20 | — | — |
| MiniMax: MiniMax M2.7 | $0.30 | $1.20 | 2085ms | 25 |
| MiniMax: MiniMax M2.5 | $0.30 | $1.20 | 1038ms | 43 |
| Qwen: Qwen2.5 7B Instruct | $0.30 | $0.30 | 269ms | 53 |
| Qwen: Qwen3 Coder Next | $0.50 | $1.20 | 884ms | 58 |
| MoonshotAI: Kimi K2.5 | $0.50 | $2.80 | 476ms | 51 |
| Arcee AI: Coder Large | $0.50 | $0.80 | — | — |
| Qwen: Qwen3.5 397B A17B | $0.60 | $3.60 | 1120ms | 56 |
| DeepSeek: DeepSeek V3.1 | $0.60 | $1.70 | 739ms | 10 |
| Arcee AI: Virtuoso Large | $0.75 | $1.20 | — | — |
| Meta: Llama 3.3 70B Instruct | $0.88 | $0.88 | 732ms | 21 |
| Arcee AI: Maestro Reasoning | $0.90 | $3.30 | — | — |
| Z.ai: GLM 5 | $1.00 | $3.20 | 1113ms | 50 |
| Deep Cogito: Cogito v2.1 671B | $1.25 | $1.25 | 451ms | 32 |
| Z.ai: GLM 5.1 | $1.40 | $4.40 | 1158ms | 40 |
| Qwen: Qwen3 Coder 480B A35B | $2.00 | $2.00 | — | — |
| DeepSeek: R1 0528 | $3.00 | $7.00 | 925ms | 88 |
Community Reviews
4.5★★★★★(2 reviews)
clouduser42
★★★★★2025-06-15
Reliable service, great API documentation.
mlresearcher
★★★★☆2025-06-10
Good performance but support could be faster.