Skip to main content

Pricing

Serverless Pricing

The prices are based on 1 million tokens.

model_nameinput_priceoutput_price
meta-llama/Llama-4-Scout-17B-16E-Instruct$0.15$0.6
meta-llama/Llama-4-Maverick-17B-128E-Instruct$0.25$0.8
deepseek-ai/DeepSeek-R1$0.50$2.18
deepseek-ai/DeepSeek-V3-0324$0.9$0.9
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B$0.2$0.2
deepseek-ai/DeepSeek-R1-Distill-Qwen-7B$0.1$0.2
deepseek-ai/DeepSeek-R1-Distill-Qwen-14B$0.2$0.2
deepseek-ai/DeepSeek-R1-Distill-Qwen-32B$0.5$0.9
deepseek-ai/DeepSeek-R1-Distill-Llama-8B$0.14$0.39
deepseek-ai/DeepSeek-R1-Distill-Llama-70B$0.25$0.75
meta-llama/Llama-3.1-8B$0.03$0.09
meta-llama/Llama-3.3-70B-Instruct$0.25$0.75
Qwen/QwQ-32B$0.50$1.50

Dedicated Pricing

GPU TypePrice
H100$4.98/Hour
H200$5.98/Hour