Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.gmicloud.ai/llms.txt

Use this file to discover all available pages before exploring further.

Model ID
minimax-audio-voice-clone-speech-2.6-turbo

Minimax Audio Voice Clone API Usage Guide

Overview

Minimax Audio Voice Clone allows you to clone any voice from an audio sample and use it to generate custom speech. Simply provide URLs to your audio files, and the system will automatically handle downloading, processing, and voice cloning. The cloned voice can then speak any text you provide.

Key Features:

  • Synchronous Operation: Get results immediately in 5-15 seconds
  • URL-Based Input: Provide audio URLs, backend handles all processing
  • Style Control: Optional prompt audio to define speaking style, tone, and emotion
  • Audio Enhancement: Built-in noise reduction and volume normalization
  • High-Quality Audio: Supports MP3, M4A, and WAV formats

Authentication

All API requests require authentication using an API key. Include your API key in the Authorization header:
Authorization: Bearer YOUR_API_KEY

Submit Voice Clone Request

Endpoint

POST /api/v1/apikey/requests

Request Format

curl -X POST "https://console.gmicloud.ai/api/v1/ie/requestqueue/apikey/requests" \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "minimax-audio-voice-clone-speech-2.6-turbo",
    "payload": {
      "text": "Hello! This is my cloned voice speaking.",
      "source_audio": "https://your_reference_voice.mp3",
      "voice_id": "my_custom_voice_001",
      "prompt_audio": "https://your_prompt_aduio.mp3",
      "prompt_text": "The transcript corresponding to the sample audio. It must match the audio content, and end with punctuation.",
      "need_noise_reduction": true,
      "need_volumn_normalization": true
    }
  }'

Request Parameters

ParameterTypeRequiredDescriptionDefaultConstraints
modelstringYesModel identifier-"minimax-audio-voice-clone-speech-2.6-turbo"
payload.textstringYesText content to be synthesized using the cloned voice-Required, non-empty string
payload.source_audiostringYesURL of the source audio file for voice cloning. Backend downloads automatically.-Valid HTTP/HTTPS URL. Supported formats: mp3, m4a, wav
payload.voice_idstringNoThe voice_id of the cloned voice. Length range:[8:256], must start with an English letter, must not duplicatedAuto-generated (request ID)Alphanumeric string, underscores allowed
payload.prompt_audiostringNoURL of the prompt audio file. Defines speaking style/emotion. Must be used with prompt_text(less than 8s).-Valid HTTP/HTTPS URL. Supported formats: mp3, m4a, wav, flac
payload.prompt_textstringNoDescription of the prompt audio (e.g., “This voice sounds natural and pleasant”)-Required if prompt_audio is provided
payload.need_noise_reductionbooleanNoApply noise reduction to the generated audiofalsetrue or false
payload.need_volumn_normalizationbooleanNoApply volume normalization to the generated audiofalsetrue or false

Response

Voice Clone is synchronous and returns the result immediately (typically within 5-15 seconds).
{
    "request_id": "5c30b275-d669-4a25-8151-de6d60214853",
    "model": "minimax-audio-voice-clone-speech-2.6-turbo",
    "status": "success",
    "created_at": 1762215580,
    "updated_at": 1762215606,
    "queued_at": 1762215580
}

Check Request Status

Endpoint

GET /api/v1/apikey/requests/{request_id}

Example

curl -X GET "https://console.gmicloud.ai/api/v1/ie/requestqueue/apikey/requests/your_request_id" \
  -H "Authorization: Bearer YOUR_API_KEY"

Response

{
    "request_id": "5c3abcde-d669-4a25-8151-de6d602abcde",
    "org_id": "637abcde-1870-4ff4-80b0-d485980abcde",
    "user_id": "5edabcde-4eed-4dcd-b732-996a956abcde",
    "model": "minimax-audio-voice-clone-speech-2.6-turbo",
    "status": "success",
    "is_public": false,
    "payload": {
        "need_noise_reduction": true,
        "need_volumn_normalization": true,
        "source_audio": "https://your_source_audio.wav",
        "text": "Attention! This is a test.",
        "voice_id": "thisistestNov"
    },
    "outcome": {
        "media_urls": [
            {
                "id": "0",
                "url": "https://storage.googleapis.com/your_cloned_result.mp3"
            }
        ],
        "voice_id": ""
    },
    "created_at": 1762215580,
    "updated_at": 1762215606,
    "queued_at": 1762215580
}

Request Status Values

Voice Clone is synchronous, so the response will immediately return one of these statuses:
StatusDescription
successVoice cloning completed successfully
failedVoice cloning failed (see error message)

List Your Requests

Endpoint

GET /api/v1/ie/requestqueue/apikey/requests?model_id=minimax-audio-voice-clone-speech-2.6-turbo

Example

curl -X GET "https://console.gmicloud.ai/api/v1/ie/requestqueue/apikey/requests?model_id=minimax-audio-voice-clone-speech-2.6-turbo" \
  -H "Authorization: Bearer YOUR_API_KEY"

Get Model Information

Endpoint

GET /api/v1/ie/requestqueue/apikey/models/minimax-audio-voice-clone-speech-2.6-turbo

Example

curl -X GET "https://console.gmicloud.ai/api/v1/ie/requestqueue/apikey/models/minimax-audio-voice-clone-speech-2.6-turbo" \
  -H "Authorization: Bearer YOUR_API_KEY"

List Available Models

Endpoint

GET /api/v1/apikey/models

Example

curl -X GET "https://console.gmicloud.ai/api/v1/ie/requestqueue/apikey/models" \
  -H "Authorization: Bearer YOUR_API_KEY"

Response

{
  "model_ids": [
    "minimax-audio-voice-clone-speech-2.6-hd",
    "minimax-audio-voice-clone-speech-2.6-turbo"
  ]
}

Example Use Cases

Basic Voice Clone (Minimal Required Parameters)

curl -X POST "https://console.gmicloud.ai/api/v1/ie/requestqueue/apikey/requests" \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "minimax-audio-voice-clone-speech-2.6-turbo",
    "payload": {
      "text": "Welcome to the world of AI-powered voice cloning!",
      "source_audio": "https://storage.googleapis.com/my-bucket/reference-voice.mp3"
    }
  }'

Advanced Voice Clone (All Parameters)

curl -X POST "https://console.gmicloud.ai/api/v1/ie/requestqueue/apikey/requests" \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "minimax-audio-voice-clone-speech-2.6-turbo",
    "payload": {
      "text": "This message is delivered with a warm, friendly tone that makes listeners feel welcomed and valued.",
      "source_audio": "https://storage.googleapis.com/my-bucket/reference-voice.mp3",
      "voice_id": "friendly_host_voice1",
      "prompt_audio": "https://storage.googleapis.com/my-bucket/friendly-style.mp3",
      "prompt_text": "The transcript corresponding to the sample audio. It must match the audio content, and end with punctuation.",
      "need_noise_reduction": true,
      "need_volumn_normalization": true
    }
  }'