N
NextBit
AGGREGATEDINFERENCE
N/A
Uptime
N/A
Rating
30-Day Uptime
100%2026-05-222026-06-20
Inference Latency
MythoMax 13B709ms TTFT · 35 TPS
Microsoft: Phi 4787ms TTFT · 46 TPS
NeverSleep: Lumimaid v0.2 8B366ms TTFT · 95 TPS
OpenAI: gpt-oss-20b718ms TTFT · 115 TPS
Qwen: Qwen3 14B2459ms TTFT · 27 TPS
Google: Gemma 4 26B A4B 1036ms TTFT · 26 TPS
Qwen: Qwen3 30B A3B1225ms TTFT · 19 TPS
Mistral: Ministral 3 3B 2512495ms TTFT · 33 TPS
TheDrummer: Rocinante 12B534ms TTFT · 72 TPS
Qwen: Qwen3.5-35B-A3B568ms TTFT · 37 TPS
DeepSeek: R1 Distill Qwen 32B753ms TTFT · 23 TPS
Mistral: Ministral 3 8B 2512533ms TTFT · 5 TPS
Mistral: Ministral 3 14B 2512534ms TTFT · 26 TPS
TheDrummer: UnslopNemo 12B542ms TTFT · 58 TPS
ReMM SLERP 13B778ms TTFT · 20 TPS
Google: Gemma 2 27B554ms TTFT · 30 TPS
Sao10K: Llama 3.3 Euryale 70B1329ms TTFT · 7 TPS
Noromaid 20B773ms TTFT · 31 TPS
DeepSeek: DeepSeek V4 Pro894ms TTFT · 55 TPS
Inference Models
| Model | Input $/M | Output $/M | TTFT | TPS |
|---|---|---|---|---|
| MythoMax 13B | $0.06 | $0.06 | 709ms | 35 |
| Microsoft: Phi 4 | $0.07 | $0.14 | 787ms | 46 |
| NeverSleep: Lumimaid v0.2 8B | $0.09 | $0.60 | 366ms | 95 |
| OpenAI: gpt-oss-20b | $0.10 | $0.45 | 718ms | 115 |
| Qwen: Qwen3 14B | $0.10 | $0.24 | 2459ms | 27 |
| Google: Gemma 4 26B A4B | $0.13 | $0.40 | 1036ms | 26 |
| Qwen: Qwen3 30B A3B | $0.14 | $0.55 | 1225ms | 19 |
| Mistral: Ministral 3 3B 2512 | $0.15 | $0.15 | 495ms | 33 |
| TheDrummer: Rocinante 12B | $0.17 | $0.43 | 534ms | 72 |
| Qwen: Qwen3.5-35B-A3B | $0.23 | $1.60 | 568ms | 37 |
| DeepSeek: R1 Distill Qwen 32B | $0.29 | $0.29 | 753ms | 23 |
| Mistral: Ministral 3 8B 2512 | $0.30 | $0.30 | 533ms | 5 |
| Mistral: Ministral 3 14B 2512 | $0.35 | $0.35 | 534ms | 26 |
| TheDrummer: UnslopNemo 12B | $0.40 | $0.40 | 542ms | 58 |
| ReMM SLERP 13B | $0.45 | $0.65 | 778ms | 20 |
| Google: Gemma 2 27B | $0.65 | $0.65 | 554ms | 30 |
| Sao10K: Llama 3.3 Euryale 70B | $0.65 | $0.75 | 1329ms | 7 |
| Noromaid 20B | $1.00 | $1.75 | 773ms | 31 |
| DeepSeek: DeepSeek V4 Pro | $1.55 | $3.20 | 894ms | 55 |
Community Reviews
4.5★★★★★(2 reviews)
clouduser42
★★★★★2025-06-15
Reliable service, great API documentation.
mlresearcher
★★★★☆2025-06-10
Good performance but support could be faster.