A
Azure
AGGREGATEDINFERENCE
N/A
Uptime
N/A
Rating
30-Day Uptime
93.3%2026-03-232026-04-21
Inference Latency
OpenAI: GPT-5 Nano21570ms TTFT · 105 TPS
OpenAI: GPT-4.1 Nano1632ms TTFT · 30 TPS
OpenAI: GPT-4o-mini726ms TTFT · 36 TPS
OpenAI: GPT-5.4 Nano1811ms TTFT · 29 TPS
OpenAI: GPT-5.1-Codex-Mini3615ms TTFT · 36 TPS
OpenAI: GPT-5 Mini20825ms TTFT · 34 TPS
OpenAI: GPT-4.1 Mini1149ms TTFT · 50 TPS
OpenAI: GPT-5.4 Mini1383ms TTFT · 52 TPS
OpenAI: GPT-3.5 Turbo (older v0613)1018ms TTFT · 49 TPS
OpenAI: GPT-5.1 Chat3140ms TTFT · 32 TPS
OpenAI: GPT-513950ms TTFT · 56 TPS
OpenAI: GPT-5.1-Codex2300ms TTFT · 15 TPS
OpenAI: GPT-5.11574ms TTFT · 33 TPS
DeepSeek: R11938ms TTFT · 49 TPS
OpenAI: GPT-5.3-Codex9103ms TTFT · 50 TPS
OpenAI: GPT-5.2 Chat3166ms TTFT · 24 TPS
OpenAI: GPT-5.3 Chat2265ms TTFT · 26 TPS
OpenAI: GPT-5.2-Codex10413ms TTFT · 31 TPS
OpenAI: GPT-5.21366ms TTFT · 14 TPS
OpenAI: GPT-4.11274ms TTFT · 49 TPS
Inference Models
| Model | Input $/M | Output $/M | TTFT | TPS |
|---|---|---|---|---|
| OpenAI: GPT-5 Nano | $0.05 | $0.40 | 21570ms | 105 |
| OpenAI: GPT-4.1 Nano | $0.10 | $0.40 | 1632ms | 30 |
| OpenAI: GPT-4o-mini | $0.15 | $0.60 | 726ms | 36 |
| OpenAI: GPT-5.4 Nano | $0.20 | $1.25 | 1811ms | 29 |
| OpenAI: GPT-5.1-Codex-Mini | $0.25 | $2.00 | 3615ms | 36 |
| OpenAI: GPT-5 Mini | $0.25 | $2.00 | 20825ms | 34 |
| OpenAI: GPT-4.1 Mini | $0.40 | $1.60 | 1149ms | 50 |
| OpenAI: GPT-5.4 Mini | $0.75 | $4.50 | 1383ms | 52 |
| OpenAI: GPT-3.5 Turbo (older v0613) | $1.00 | $2.00 | 1018ms | 49 |
| OpenAI: GPT-5.1 Chat | $1.25 | $10.00 | 3140ms | 32 |
| OpenAI: GPT-5 | $1.25 | $10.00 | 13950ms | 56 |
| OpenAI: GPT-5.1-Codex-Max | $1.25 | $10.00 | — | — |
| OpenAI: GPT-5.1-Codex | $1.25 | $10.00 | 2300ms | 15 |
| OpenAI: GPT-5.1 | $1.25 | $10.00 | 1574ms | 33 |
| DeepSeek: R1 | $1.49 | $5.94 | 1938ms | 49 |
| OpenAI: GPT-5.3-Codex | $1.75 | $14.00 | 9103ms | 50 |
| OpenAI: GPT-5.2 Chat | $1.75 | $14.00 | 3166ms | 24 |
| OpenAI: GPT-5.3 Chat | $1.75 | $14.00 | 2265ms | 26 |
| OpenAI: GPT-5.2-Codex | $1.75 | $14.00 | 10413ms | 31 |
| OpenAI: GPT-5.2 | $1.75 | $14.00 | 1366ms | 14 |
| OpenAI: GPT-4.1 | $2.00 | $8.00 | 1274ms | 49 |
| OpenAI: GPT-4o (2024-08-06) | $2.50 | $10.00 | 1301ms | 75 |
| OpenAI: GPT-5.4 | $2.50 | $15.00 | 1778ms | 36 |
| OpenAI: GPT-4o | $2.50 | $10.00 | 885ms | 33 |
| Anthropic: Claude Sonnet 4.6 | $3.00 | $15.00 | 2215ms | 32 |
| OpenAI: GPT-3.5 Turbo 16k | $3.00 | $4.00 | — | — |
| OpenAI: GPT-4o (2024-05-13) | $5.00 | $15.00 | — | — |
| Anthropic: Claude Opus 4.6 | $5.00 | $25.00 | 2645ms | 34 |
| OpenAI: GPT-4 | $30.00 | $60.00 | 672ms | 63 |
| OpenAI: GPT-5.4 Pro | $30.00 | $180.00 | 48148ms | 5 |
Community Reviews
4.5★★★★★(2 reviews)
clouduser42
★★★★★2025-06-15
Reliable service, great API documentation.
mlresearcher
★★★★☆2025-06-10
Good performance but support could be faster.