LIVE
Models: —+Providers: —+Cheapest H100: $2.49/hrUpdated: 05:21 PMModels: —+Providers: —+Cheapest H100: $2.49/hrUpdated: 05:21 PM
Marketplace
Providers Models
N

NextBit

AGGREGATEDINFERENCE
N/A
Uptime
N/A
Rating

30-Day Uptime

96.7%
2026-03-232026-04-21

Inference Latency

Qwen: Qwen3 14B2360ms TTFT · 5 TPS
MythoMax 13B1344ms TTFT · 17 TPS
NeverSleep: Lumimaid v0.2 8B366ms TTFT · 95 TPS
OpenAI: gpt-oss-20b997ms TTFT · 72 TPS
Google: Gemma 4 26B A4B 1436ms TTFT · 26 TPS
Qwen: Qwen3 30B A3B1186ms TTFT · 18 TPS
Mistral: Ministral 3 3B 25121096ms TTFT · 18 TPS
TheDrummer: Rocinante 12B935ms TTFT · 56 TPS
DeepSeek: R1 Distill Qwen 32B1118ms TTFT · 23 TPS
Mistral: Ministral 3 8B 25121806ms TTFT · 24 TPS
Qwen: Qwen3.5-35B-A3B1750ms TTFT · 86 TPS
Mistral: Ministral 3 14B 25122572ms TTFT · 18 TPS
TheDrummer: UnslopNemo 12B893ms TTFT · 58 TPS
ReMM SLERP 13B1117ms TTFT · 15 TPS
Google: Gemma 2 27B765ms TTFT · 18 TPS
Sao10K: Llama 3.3 Euryale 70B3591ms TTFT · 10 TPS
Noromaid 20B773ms TTFT · 31 TPS

Inference Models

ModelInput $/MOutput $/MTTFTTPS
Qwen: Qwen3 14B$0.06$0.242360ms5
MythoMax 13B$0.06$0.061344ms17
Microsoft: Phi 4$0.07$0.14
NeverSleep: Lumimaid v0.2 8B$0.09$0.60366ms95
OpenAI: gpt-oss-20b$0.10$0.45997ms72
Google: Gemma 4 26B A4B $0.10$0.401436ms26
Qwen: Qwen3 30B A3B$0.14$0.551186ms18
Mistral: Ministral 3 3B 2512$0.15$0.151096ms18
TheDrummer: Rocinante 12B$0.17$0.43935ms56
DeepSeek: R1 Distill Qwen 32B$0.29$0.291118ms23
Mistral: Ministral 3 8B 2512$0.30$0.301806ms24
Qwen: Qwen3.5-35B-A3B$0.30$1.801750ms86
Mistral: Ministral 3 14B 2512$0.35$0.352572ms18
TheDrummer: UnslopNemo 12B$0.40$0.40893ms58
ReMM SLERP 13B$0.45$0.651117ms15
Google: Gemma 2 27B$0.65$0.65765ms18
Sao10K: Llama 3.3 Euryale 70B$0.65$0.753591ms10
Noromaid 20B$1.00$1.75773ms31

Community Reviews

4.5★★★★★(2 reviews)
clouduser42
★★★★★2025-06-15

Reliable service, great API documentation.

mlresearcher
★★★★2025-06-10

Good performance but support could be faster.