Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.gmicloud.ai/llms.txt

Use this file to discover all available pages before exploring further.

1

Create an account and get your API key

Sign in to the GMI Cloud Console. Go to Settings → API Keys and create a new key. Copy it — you’ll use it in the next step.
2

Make your first inference call

GMI’s inference API is OpenAI-compatible. Swap in your API key and endpoint:
from openai import OpenAI

client = OpenAI(
    base_url="https://api.gmicloud.ai/v1",
    api_key="YOUR_GMI_API_KEY",
)

response = client.chat.completions.create(
    model="meta-llama/Llama-3.3-70B-Instruct",
    messages=[{"role": "user", "content": "Hello, what can you do?"}],
)

print(response.choices[0].message.content)
3

Explore what's next

Pick where to go based on your use case:

Go deeper

Browse models

Text, image, video, and audio models available on GMI.

Dedicated endpoints

Reserve capacity for production workloads.

GPU Compute

Managed clusters and bare-metal for training and fine-tuning.

GMI Studio

Build multi-step AI pipelines visually.

API Reference

Full REST API docs for all GMI services.

Agent Frameworks

Plug GMI into Hermes, Dify, and OpenClaw.