Pricing
Serverless Pricing
The prices are based on 1 million tokens.
model_name | input_price | output_price |
---|---|---|
deepseek-ai/DeepSeek-R1 | $2.00 | $6.00 |
deepseek-ai/DeepSeek-V3 | $1.25 | $1.25 |
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B | $0.02 | $0.06 |
deepseek-ai/DeepSeek-R1-Distill-Qwen-7B | $0.03 | $0.09 |
deepseek-ai/DeepSeek-R1-Distill-Qwen-14B | $0.06 | $0.18 |
deepseek-ai/DeepSeek-R1-Distill-Qwen-32B | $0.13 | $0.40 |
deepseek-ai/DeepSeek-R1-Distill-Llama-8B | $0.03 | $0.09 |
deepseek-ai/DeepSeek-R1-Distill-Llama-70B | $0.25 | $0.75 |
meta-llama/Llama-3.1-8B | $0.03 | $0.09 |
meta-llama/Llama-3.3-70B-Instruct | $0.25 | $0.75 |
Qwen/QwQ-32B | $0.50 | $1.50 |
Dedicated Pricing
GPU Type | Price |
---|---|
H100 | $4.98/Hour |
H200 | $5.98/Hour |