I
Inceptron
AGGREGATEDINFERENCE
N/A
Uptime
N/A
Rating
30-Day Uptime
96.7%2026-03-232026-04-21
Inference Latency
Meta: Llama 3.3 70B Instruct1293ms TTFT · 9 TPS
MiniMax: MiniMax M2.5514ms TTFT · 45 TPS
MoonshotAI: Kimi K2.5469ms TTFT · 38 TPS
Z.ai: GLM 5.1701ms TTFT · 28 TPS
Inference Models
| Model | Input $/M | Output $/M | TTFT | TPS |
|---|---|---|---|---|
| Meta: Llama 3.3 70B Instruct | $0.12 | $0.38 | 1293ms | 9 |
| MiniMax: MiniMax M2.5 | $0.24 | $0.90 | 514ms | 45 |
| MoonshotAI: Kimi K2.5 | $0.44 | $2.20 | 469ms | 38 |
| Z.ai: GLM 5.1 | $1.40 | $4.40 | 701ms | 28 |
Community Reviews
4.5★★★★★(2 reviews)
clouduser42
★★★★★2025-06-15
Reliable service, great API documentation.
mlresearcher
★★★★☆2025-06-10
Good performance but support could be faster.