Skip to main content

Pricing

Serverless Pricing

The prices are based on 1 million tokens.

model_nameinput_priceoutput_price
deepseek-ai/DeepSeek-R1$2.00$6.00
deepseek-ai/DeepSeek-V3$1.25$1.25
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B$0.02$0.06
deepseek-ai/DeepSeek-R1-Distill-Qwen-7B$0.03$0.09
deepseek-ai/DeepSeek-R1-Distill-Qwen-14B$0.06$0.18
deepseek-ai/DeepSeek-R1-Distill-Qwen-32B$0.13$0.40
deepseek-ai/DeepSeek-R1-Distill-Llama-8B$0.03$0.09
deepseek-ai/DeepSeek-R1-Distill-Llama-70B$0.25$0.75
meta-llama/Llama-3.1-8B$0.03$0.09
meta-llama/Llama-3.3-70B-Instruct$0.25$0.75
Qwen/QwQ-32B$0.50$1.50

Dedicated Pricing

GPU TypePrice
H100$4.98/Hour
H200$5.98/Hour