Skip to main content

Serverless Pricing

Model NameInput Price (per 1M tokens)Output Price (per 1M tokens)
ZAI: GLM-4.6$0.60$2.00
DeepSeek: DeepSeek-V3.2-Exp$0.27$0.41
DeepSeek: DeepSeek-V3.1-Terminus$0.27$1.00
Qwen: Qwen3 Next 80B A3B Thinking$0.15$1.50
Qwen: Qwen3 Next 80B A3B Instruct$0.15$1.50
Moonshotai: Kimi-K2-Instruct-0905$0.60$2.50
DeepSeek: DeepSeek V3.1$0.27$1.00
OpenAI: GPT OSS 120b$0.07$0.28
Qwen: Qwen3 Coder 480B A35B Instruct FP8$0.29$1.20
ZAI: GLM-4.5-FP8$0.60$2.20
ZAI: GLM-4.5-Air-FP8$0.20$1.10
Moonshotai: Kimi-K2-Instruct$1.00$3.00
DeepSeek: DeepSeek V3 0324$0.28$0.88
DeepSeek: DeepSeek R1 0528$0.70$2.30
Qwen: Qwen3 235B A22B Instruct 2507 FP8$0.17$1.09
Qwen: Qwen3 32B FP8$0.10$0.60
Qwen: Qwen3 235B A22B FP8$0.17$1.09
DeepSeek: DeepSeek Prover V2 671B$0.50$2.18
DeepSeek: DeepSeek R1$0.50$2.18
DeepSeek: DeepSeek R1 Distill Llama 70B$0.25$0.75
DeepSeek: DeepSeek R1 Distill Llama 8B$0.14$0.39
DeepSeek: DeepSeek R1 Distill Qwen 14B$0.20$0.20
DeepSeek: DeepSeek R1 Distill Qwen 1.5B$0.00$0.00
DeepSeek: DeepSeek R1 Distill Qwen 32B$0.50$0.90
DeepSeek: DeepSeek R1 Distill Qwen 7B$0.10$0.20
Meta: Llama 3.3 70B Instruct$0.25$0.75
Meta: Llama-4 Maverick 17B 128E Instruct FP8$0.25$0.80
Meta: Llama-4 Scout 17B 16E Instruct$0.08$0.50
Qwen: Qwen3 235B A22B Thinking 2507 FP8$0.60$3.00
Qwen: Qwen3-30B-A3B$0.08$0.25

Dedicated Pricing

GPU TypePrice
H100$2.98/Hour
H200$3.98/Hour