Skip to main content

4.1 Overview of the Editor Interface

The canvas is a visual editor where you build workflows by placing and connecting nodes.

4.2 Node Library

GMI Official Nodes
  • Image, video, LLM, and audio generation nodes through API
  • GMI Image Upload and Batch Images nodes

4.2.1 Video Models

Wan
  • wan2.6-t2vText → Video: Generates videos directly from text prompts.
  • wan2.6-i2vImage + Text → Video: Animates a still image into a video.
  • wan2.6-r2vVideo + Text → Video: Edits or transforms an existing video.
  • wan2.5-t2v-previewText → Video: Fast preview version for quick testing.
  • wan2.5-i2v-previewImage + Text → Video: Preview image-to-video generation.
Veo
  • veo-3.1-fast-generate-previewText → Video: Fast, low-latency video generation.
  • veo-3.1-generate-previewText → Video: Higher-quality cinematic video generation.
  • Veo3-FastText → Video: Balanced speed and visual quality.
  • Veo3Text → Video: Premium cinematic video generation.
Sora
  • sora-2Text → Video: Realistic video generation from text.
  • sora-2-proText → Video: Higher fidelity and longer consistency videos.
Kling
  • kling-v2-6Text → Video: Smooth, stylized text-to-video generation.
  • kling-v2-5-turboText → Video: Faster text-to-video generation.
  • Kling-Text2Video-V1.6-StandardText → Video: Standard text-to-video generation.
  • Kling-Text2Video-V2.1-MasterText → Video: Advanced, high-quality video generation.
  • Kling-Text2Video-V2-MasterText → Video: Premium text-to-video model.
  • Kling-Image2Video-V1.6-StandardImage → Video: Basic image animation.
  • Kling-Image2Video-V1.6-ProImage → Video: Higher-quality image animation.
  • Kling-Image2Video-V2.1-StandardImage → Video: Improved motion consistency.
  • Kling-Image2Video-V2.1-ProImage → Video: Professional-grade image-to-video.
  • Kling-Image2Video-V2.1-MasterImage → Video: Premium image-to-video quality.
  • Kling-Image2Video-V2-MasterImage → Video: Cinematic image animation.
  • kling-o1-image-to-videoImage → Video: Omni image-to-video generation.
  • kling-o1-reference-to-videoReference + Text → Video: Generates video guided by reference.
  • kling-o1-edit-videoVideo + Text → Video: Edits video using text instructions.
  • kling-o1-flfvImage / Video → Video: Flexible multi-input video generation.
Minimax Hailuo
  • Minimax-Hailuo-02Text → Video: General-purpose text-to-video generation.
  • Minimax-Hailuo-2.3Text → Video: Improved motion and visual consistency.
  • Minimax-Hailuo-2.3-FastText → Video: Faster, cost-efficient generation.
PixVerse
  • pixverse-v5-t2vText → Video: Creative text-to-video generation.
  • pixverse-v5-i2vImage → Video: Image animation with visual effects.
  • pixverse-v5-transitionImages / Video → Video: Generates smooth scene transitions.
Seedance
  • seedance-1-0-pro-fast-251015Text → Video: Ultra-fast video generation.
  • seedance-1-0-pro-250528Text → Video: Higher-quality Seedance generation.
Luma
  • Luma-Ray2Text → Video: Cinematic video with realistic camera motion.
Bria
  • bria-video-eraserVideo → Video: Removes unwanted objects from videos.

4.2.2 Image Models

Bria
  • bria-fiboText → Image: Generates brand-safe, commercially usable images from text prompts.
  • bria-genfillImage + Mask → Image: Fills or replaces selected regions with context-aware content.
  • bria-eraserImage → Image: Removes unwanted objects or elements from an image cleanly.
Google (Gemini)
  • gemini-3-pro-image-previewText → Image: High-quality image generation with strong prompt understanding and visual accuracy.
Seedream / SeedEdit
  • seedream-4-0-250828Text → Image: Produces high-fidelity images with rich details and consistent styles.
  • seedream-3-0-t2i-250415Text → Image: Balanced text-to-image generation for general use.
  • seededit-3-0-i2i-250628Image → Image: Advanced image editing while preserving structure and key details.
Tongyi (Alibaba)
  • Z-Image-TurboText → Image: Ultra-fast and cost-efficient text-to-image generation for rapid iteration.
Reve
  • reve-create-20250915Text → Image: Creative text-to-image generation with strong artistic styles.
  • reve-edit-20250915Image → Image: Prompt-based image editing while preserving layout and subject.
  • reve-edit-fast-20251030Image → Image: Faster image editing for quick adjustments.
  • reve-remix-20250915Image → Image: Remixes an image into new styles or variations.
  • reve-remix-fast-20251030Image → Image: Fast image remixing with lower latency.

4.2.3 Audio Models

Note: When uploading audio to GMI, only WAV and MP3 file formats are supported.
Minimax Audio
  • minimax-audio-voice-clone-speech-2.6-turboText + Voice Sample → Audio: Fast voice cloning and speech generation with natural tone.
  • minimax-audio-voice-clone-speech-2.6-hdText + Voice Sample → Audio: High-fidelity voice cloning with improved clarity and realism.
  • minimax-tts-speech-2.6-turboText → Audio: Low-latency text-to-speech for real-time or high-throughput use.
  • minimax-tts-speech-2.6-hdText → Audio: High-quality text-to-speech with richer voice detail.
  • minimax-tts-speech-2.5-turbo-previewText → Audio: Preview version optimized for fast speech generation.
  • minimax-tts-speech-2.5-hd-previewText → Audio: Preview high-definition text-to-speech generation.
  • minimax-tts-speech-02-turboText → Audio: Efficient text-to-speech with balanced speed and quality.
  • minimax-tts-speech-02-hdText → Audio: Enhanced speech quality with clearer pronunciation.
  • minimax-tts-speech-01-turboText → Audio: Earlier turbo TTS model optimized for speed.
  • minimax-tts-speech-01-hdText → Audio: Earlier HD TTS model focused on audio clarity.
ElevenLabs
  • elevenlabs-tts-v3Text → Audio: High-quality text-to-speech with expressive and natural voices.
  • elevenlabs-tts-multilingual-v2Text → Audio: Multilingual text-to-speech supporting multiple languages and accents.
Step Audio
  • Step-Audio-EditXAudio + Voice Reference → Audio: Voice cloning and audio editing for modifying or recreating spoken content.
ComfyUI Nodes
  • Native ComfyUI nodes sourced directly from the official ComfyUI repository
  • Standard preprocessing, conditioning, and utility nodes commonly used in Comfy workflows
When to Use GMI Official API Nodes vs ComfyUI Nodes
  • GMI Official API Nodes: Fully managed inference, simplified inputs/outputs, optimized performance
  • ComfyUI Nodes: Fine-grained control, custom preprocessing, advanced workflow logic

4.3 Canvas Basics

  • Add Node: Right‑click → Add Node / Find a node in node
  • Delete Node: Select the node and click the trash symbol
  • Drag & Arrange: Move nodes freely on the canvas
  • Connect Nodes: Drag from an output of one node to an input or another; the data type must match for the connection to be valid

4.4 Toolbar Functions

  • Run: Execute the workflow
  • Save: Save workflow changes