Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.gmicloud.ai/llms.txt

Use this file to discover all available pages before exploring further.

Models are grouped by what they produce (text, audio, video, image, or 3D assets) so you can jump straight to the modality you care about. Each category page explains typical technical topics and lists every model in that group with links to detailed pages.

Browse by modality

ModalityWhat it coversStart here
LanguageChat, code, reasoning, OCR / vision-languageLLM models
AudioTTS, voice cloning, musicAudio models
VideoText-to-video, image-to-video, editing, avatarsVideo models
ImageGeneration, editing, batch inferenceImage models

Model API (serving, pricing, limits)

Individual model pages describe that model’s inputs and examples. For how to call the platform end-to-end (marketplace, serverless vs dedicated deployments, LLM and video API references, SDKs, rate limits, billing, tasks, and artifacts), use Model API in the sidebar.

Choosing a model

  • Match modality — Pick LLM vs audio vs video vs image vs 3D first; hybrid needs may use multiple APIs.
  • Read constraints on the model page — Context length, resolution, duration, and rate limits vary.
  • Start with defaults — Official examples on each page reflect supported parameters today.