LIVE
Models: —+Providers: —+Cheapest H100: $2.49/hrUpdated: 04:27 PMModels: —+Providers: —+Cheapest H100: $2.49/hrUpdated: 04:27 PM
Marketplace
Providers Models
N

NextBit

AGGREGATEDINFERENCE
N/A
Uptime
N/A
Rating

30-Day Uptime

100%
2026-05-222026-06-20

Inference Latency

MythoMax 13B709ms TTFT · 35 TPS
Microsoft: Phi 4787ms TTFT · 46 TPS
NeverSleep: Lumimaid v0.2 8B366ms TTFT · 95 TPS
OpenAI: gpt-oss-20b718ms TTFT · 115 TPS
Qwen: Qwen3 14B2459ms TTFT · 27 TPS
Google: Gemma 4 26B A4B 1036ms TTFT · 26 TPS
Qwen: Qwen3 30B A3B1225ms TTFT · 19 TPS
Mistral: Ministral 3 3B 2512495ms TTFT · 33 TPS
TheDrummer: Rocinante 12B534ms TTFT · 72 TPS
Qwen: Qwen3.5-35B-A3B568ms TTFT · 37 TPS
DeepSeek: R1 Distill Qwen 32B753ms TTFT · 23 TPS
Mistral: Ministral 3 8B 2512533ms TTFT · 5 TPS
Mistral: Ministral 3 14B 2512534ms TTFT · 26 TPS
TheDrummer: UnslopNemo 12B542ms TTFT · 58 TPS
ReMM SLERP 13B778ms TTFT · 20 TPS
Google: Gemma 2 27B554ms TTFT · 30 TPS
Sao10K: Llama 3.3 Euryale 70B1329ms TTFT · 7 TPS
Noromaid 20B773ms TTFT · 31 TPS
DeepSeek: DeepSeek V4 Pro894ms TTFT · 55 TPS

Inference Models

ModelInput $/MOutput $/MTTFTTPS
MythoMax 13B$0.06$0.06709ms35
Microsoft: Phi 4$0.07$0.14787ms46
NeverSleep: Lumimaid v0.2 8B$0.09$0.60366ms95
OpenAI: gpt-oss-20b$0.10$0.45718ms115
Qwen: Qwen3 14B$0.10$0.242459ms27
Google: Gemma 4 26B A4B $0.13$0.401036ms26
Qwen: Qwen3 30B A3B$0.14$0.551225ms19
Mistral: Ministral 3 3B 2512$0.15$0.15495ms33
TheDrummer: Rocinante 12B$0.17$0.43534ms72
Qwen: Qwen3.5-35B-A3B$0.23$1.60568ms37
DeepSeek: R1 Distill Qwen 32B$0.29$0.29753ms23
Mistral: Ministral 3 8B 2512$0.30$0.30533ms5
Mistral: Ministral 3 14B 2512$0.35$0.35534ms26
TheDrummer: UnslopNemo 12B$0.40$0.40542ms58
ReMM SLERP 13B$0.45$0.65778ms20
Google: Gemma 2 27B$0.65$0.65554ms30
Sao10K: Llama 3.3 Euryale 70B$0.65$0.751329ms7
Noromaid 20B$1.00$1.75773ms31
DeepSeek: DeepSeek V4 Pro$1.55$3.20894ms55

Community Reviews

4.5★★★★★(2 reviews)
clouduser42
★★★★★2025-06-15

Reliable service, great API documentation.

mlresearcher
★★★★2025-06-10

Good performance but support could be faster.