Generates video driven by audio input, optionally guided by image or prompt. ModelDocumentation Index
Fetch the complete documentation index at: https://docs.gmicloud.ai/llms.txt
Use this file to discover all available pages before exploring further.
Inputs
Required
- audio / audio_url (AUDIO or STRING)
Audio input file (2–20 seconds).
Optional
- prompt (STRING)
Text guidance for scene generation. - image / image_url (IMAGE or STRING)
Optional first-frame visual reference. - resolution (STRING, default: 1920x1080)
Output resolution. - guidance_scale (FLOAT, default: 5)
Strength of prompt adherence.
Outputs
- video (VIDEO)
Generated video. - video_url (STRING)
Public video URL. - file_path (STRING)
Local saved file path.