Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.gmicloud.ai/llms.txt

Use this file to discover all available pages before exploring further.

GMI Cloud is one platform for running AI workloads end to end. Use serverless inference, dedicated GPU clusters, a visual workflow builder, or publish your own agents on the marketplace.

Products

Inference

Serverless and Dedicated endpoints for chat, vision, image, video, and audio models. OpenAI-compatible APIs.

GPU Compute

Managed Kubernetes clusters, container instances, and bare-metal servers on H200 and B200 GPUs.

GMI Studio

Visual workflow builder. Connect models with nodes to make multi-step pipelines for media and text.

GMI AgentBox

Marketplace of ready-to-use AI agents. Browse, use, or publish your own.

Build with GMI

Guides

Task-focused walkthroughs across products: agents, coding tools, model quickstarts, and migration.

API Reference

REST APIs for IAM, Compute, IDC, and Inference services. Full request and response schemas.

Coding Tools

Use GMI models inside Claude Code, Codex, and Cursor.

Agent Frameworks

Plug GMI into Hermes, Dify, and OpenClaw.

Model catalog

Text

LLMs for chat, code, and reasoning.

Image

Generation, editing, and batch image workflows.

Video

Text-to-video, image-to-video, and editing.

Audio

TTS, voice cloning, and music generation.

Console

Manage everything in one place.

Pricing

Current rates for inference and compute.

Migration

Move S3 workloads to GMI Cloud.

Contact Sales

Enterprise pricing or early access.