A
Azure
AGGREGATEDINFERENCE
N/A
Uptime
N/A
Rating
30-Day Uptime
100%2026-05-222026-06-20
Inference Latency
OpenAI: GPT-5 Nano1145ms TTFT · 71 TPS
OpenAI: GPT-4.1 Nano792ms TTFT · 164 TPS
OpenAI: GPT-4o-mini748ms TTFT · 25 TPS
OpenAI: GPT-5.4 Nano1157ms TTFT · 52 TPS
OpenAI: GPT-5 Mini2726ms TTFT · 67 TPS
OpenAI: GPT-5.1-Codex-Mini4291ms TTFT · 136 TPS
OpenAI: GPT-4.1 Mini748ms TTFT · 69 TPS
OpenAI: GPT-5.4 Mini740ms TTFT · 44 TPS
OpenAI: GPT-3.5 Turbo (older v0613)478ms TTFT · 29 TPS
OpenAI: GPT-5.11172ms TTFT · 60 TPS
OpenAI: GPT-51687ms TTFT · 37 TPS
DeepSeek: R11687ms TTFT · 71 TPS
OpenAI: GPT-5.3-Codex2703ms TTFT · 74 TPS
OpenAI: GPT-5.21980ms TTFT · 31 TPS
OpenAI: GPT-5.2 Chat1975ms TTFT · 56 TPS
OpenAI: GPT-4.11133ms TTFT · 53 TPS
OpenAI: GPT-4o (2024-08-06)618ms TTFT · 3 TPS
OpenAI: GPT-5.44242ms TTFT · 53 TPS
OpenAI: GPT-4o833ms TTFT · 21 TPS
OpenAI: GPT-3.5 Turbo 16k567ms TTFT · 16 TPS
Inference Models
| Model | Input $/M | Output $/M | TTFT | TPS |
|---|---|---|---|---|
| OpenAI: GPT-5 Nano | $0.05 | $0.40 | 1145ms | 71 |
| OpenAI: GPT-5 Nano | $0.05 | $0.40 | — | — |
| OpenAI: GPT-4.1 Nano | $0.10 | $0.40 | — | — |
| OpenAI: GPT-4.1 Nano | $0.10 | $0.40 | 792ms | 164 |
| OpenAI: GPT-4o-mini | $0.15 | $0.60 | — | — |
| OpenAI: GPT-4o-mini | $0.15 | $0.60 | 748ms | 25 |
| OpenAI: GPT-5.4 Nano | $0.20 | $1.25 | 1157ms | 52 |
| OpenAI: GPT-5 Mini | $0.25 | $2.00 | 2726ms | 67 |
| OpenAI: GPT-5 Mini | $0.25 | $2.00 | — | — |
| OpenAI: GPT-5.1-Codex-Mini | $0.25 | $2.00 | 4291ms | 136 |
| OpenAI: GPT-4.1 Mini | $0.40 | $1.60 | 748ms | 69 |
| OpenAI: GPT-4.1 Mini | $0.40 | $1.60 | — | — |
| OpenAI: GPT-5.4 Mini | $0.75 | $4.50 | 740ms | 44 |
| OpenAI: GPT-3.5 Turbo (older v0613) | $1.00 | $2.00 | 478ms | 29 |
| OpenAI: GPT-5.1-Codex-Max | $1.25 | $10.00 | — | — |
| OpenAI: GPT-5.1 | $1.25 | $10.00 | 1172ms | 60 |
| OpenAI: GPT-5.1 | $1.25 | $10.00 | — | — |
| OpenAI: GPT-5 | $1.25 | $10.00 | — | — |
| OpenAI: GPT-5 | $1.25 | $10.00 | 1687ms | 37 |
| OpenAI: GPT-5.1-Codex | $1.25 | $10.00 | — | — |
| OpenAI: GPT-5.1 Chat | $1.25 | $10.00 | — | — |
| DeepSeek: R1 | $1.49 | $5.94 | 1687ms | 71 |
| OpenAI: GPT-5.2-Codex | $1.75 | $14.00 | — | — |
| OpenAI: GPT-5.3-Codex | $1.75 | $14.00 | 2703ms | 74 |
| OpenAI: GPT-5.3 Chat | $1.75 | $14.00 | — | — |
| OpenAI: GPT-5.2 | $1.75 | $14.00 | 1980ms | 31 |
| OpenAI: GPT-5.2 Chat | $1.75 | $14.00 | 1975ms | 56 |
| OpenAI: GPT-4.1 | $2.00 | $8.00 | — | — |
| OpenAI: GPT-4.1 | $2.00 | $8.00 | 1133ms | 53 |
| OpenAI: GPT-4o (2024-08-06) | $2.50 | $10.00 | 618ms | 3 |
| OpenAI: GPT-5.4 | $2.50 | $15.00 | 4242ms | 53 |
| OpenAI: GPT-4o | $2.50 | $10.00 | 833ms | 21 |
| OpenAI: GPT-3.5 Turbo 16k | $3.00 | $4.00 | 567ms | 16 |
| Anthropic: Claude Sonnet 4.6 | $3.00 | $15.00 | — | — |
| Anthropic: Claude Opus 4.6 | $5.00 | $25.00 | 1728ms | 6 |
| OpenAI: GPT-4o (2024-05-13) | $5.00 | $15.00 | 1247ms | 3 |
| OpenAI: GPT-5.5 | $5.00 | $30.00 | 3006ms | 16 |
| OpenAI: GPT-5.5 | $5.50 | $33.00 | — | — |
| Anthropic: Claude Fable 5 | $10.00 | $50.00 | — | — |
| OpenAI: GPT-4 | $30.00 | $60.00 | 361ms | 6 |
| OpenAI: GPT-5.4 Pro | $30.00 | $180.00 | — | — |
Community Reviews
4.5★★★★★(2 reviews)
clouduser42
★★★★★2025-06-15
Reliable service, great API documentation.
mlresearcher
★★★★☆2025-06-10
Good performance but support could be faster.