GMI Cloud is one platform for running AI workloads end to end. Use serverless inference, dedicated GPU clusters, a visual workflow builder, or publish your own agents on the marketplace.Documentation Index
Fetch the complete documentation index at: https://docs.gmicloud.ai/llms.txt
Use this file to discover all available pages before exploring further.
Products
Inference
Serverless and Dedicated endpoints for chat, vision, image, video, and audio models. OpenAI-compatible APIs.
GPU Compute
Managed Kubernetes clusters, container instances, and bare-metal servers on H200 and B200 GPUs.
GMI Studio
Visual workflow builder. Connect models with nodes to make multi-step pipelines for media and text.
GMI AgentBox
Marketplace of ready-to-use AI agents. Browse, use, or publish your own.
Build with GMI
Guides
Task-focused walkthroughs across products: agents, coding tools, model quickstarts, and migration.
API Reference
REST APIs for IAM, Compute, IDC, and Inference services. Full request and response schemas.
Coding Tools
Use GMI models inside Claude Code, Codex, and Cursor.
Agent Frameworks
Plug GMI into Hermes, Dify, and OpenClaw.
Model catalog
Text
LLMs for chat, code, and reasoning.
Image
Generation, editing, and batch image workflows.
Video
Text-to-video, image-to-video, and editing.
Audio
TTS, voice cloning, and music generation.
Quick links
Console
Manage everything in one place.
Pricing
Current rates for inference and compute.
Migration
Move S3 workloads to GMI Cloud.
Contact Sales
Enterprise pricing or early access.