Skip to main content
Pricing on GMI Cloud changes as new models, hardware, and regions come online. Rather than mirror prices here (where they would go stale), the up-to-date rates live in the console.

Where to check

  • Inference (Serverless and Dedicated): visit the GMI Cloud Console and open Inference > Model Hub. Each model card shows current input/output rates and any region modifiers.
  • GPU Compute (clusters, bare metal, containers): open Compute > Home in the console. Each SKU card lists the live hourly rate, region, and any active discount.
  • Storage and networking: shown on the relevant resource page in the console at allocation time.

Estimating costs

When you provision a cluster, allocate an Elastic IP, or run a workload, the console shows a Summary panel with the estimated monthly cost, list price, and any applicable discount before you confirm. For volume pricing, custom commitments, or enterprise agreements, contact sales.