Documentation Index
Fetch the complete documentation index at: https://docs.gmicloud.ai/llms.txt
Use this file to discover all available pages before exploring further.
Model ID
minimax-audio-voice-clone-speech-2.6-hd
Minimax Audio Voice Clone API Usage Guide
Overview
Minimax Audio Voice Clone allows you to clone any voice from an audio sample and use it to generate custom speech. Simply provide URLs to your audio files, and the system will automatically handle downloading, processing, and voice cloning. The cloned voice can then speak any text you provide.
Key Features:
- Synchronous Operation: Get results immediately in 5-15 seconds
- URL-Based Input: Provide audio URLs, backend handles all processing
- Style Control: Optional prompt audio to define speaking style, tone, and emotion
- Audio Enhancement: Built-in noise reduction and volume normalization
- High-Quality Audio: Supports MP3, M4A, and WAV formats
Authentication
All API requests require authentication using an API key. Include your API key in the Authorization header:
Authorization: Bearer YOUR_API_KEY
Submit Voice Clone Request
Endpoint
POST /api/v1/apikey/requests
curl -X POST "https://console.gmicloud.ai/api/v1/ie/requestqueue/apikey/requests" \
-H "Authorization: Bearer YOUR_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"model": "minimax-audio-voice-clone-speech-2.6-hd",
"payload": {
"text": "Hello! This is my cloned voice speaking.",
"source_audio": "https://your_reference_voice.mp3",
"voice_id": "my_custom_voice_001",
"prompt_audio": "https://your_prompt_aduio.mp3",
"prompt_text": "The transcript corresponding to the sample audio. It must match the audio content, and end with punctuation.",
"need_noise_reduction": true,
"need_volumn_normalization": true
}
}'
Request Parameters
| Parameter | Type | Required | Description | Default | Constraints |
|---|
model | string | Yes | Model identifier | - | "minimax-audio-voice-clone-speech-2.6-hd" |
payload.text | string | Yes | Text content to be synthesized using the cloned voice | - | Required, non-empty string |
payload.source_audio | string | Yes | URL of the source audio file for voice cloning. Backend downloads automatically. | - | Valid HTTP/HTTPS URL. Supported formats: mp3, m4a, wav |
payload.voice_id | string | No | The voice_id of the cloned voice. Length range:[8:256], must start with an English letter, must not duplicated | Auto-generated (request ID) | Alphanumeric string, underscores allowed |
payload.prompt_audio | string | No | URL of the prompt audio file. Defines speaking style/emotion. Must be used with prompt_text(less than 8s). | - | Valid HTTP/HTTPS URL. Supported formats: mp3, m4a, wav, flac |
payload.prompt_text | string | No | Description of the prompt audio (e.g., “This voice sounds natural and pleasant”) | - | Required if prompt_audio is provided |
payload.need_noise_reduction | boolean | No | Apply noise reduction to the generated audio | false | true or false |
payload.need_volumn_normalization | boolean | No | Apply volume normalization to the generated audio | false | true or false |
Response
Voice Clone is synchronous and returns the result immediately (typically within 5-15 seconds).
{
"request_id": "5c30b275-d669-4a25-8151-de6d60214853",
"model": "minimax-audio-voice-clone-speech-2.6-hd",
"status": "success",
"created_at": 1762215580,
"updated_at": 1762215606,
"queued_at": 1762215580
}
Check Request Status
Endpoint
GET /api/v1/apikey/requests/{request_id}
Example
curl -X GET "https://console.gmicloud.ai/api/v1/ie/requestqueue/apikey/requests/your_request_id" \
-H "Authorization: Bearer YOUR_API_KEY"
Response
{
"request_id": "5c3abcde-d669-4a25-8151-de6d602abcde",
"org_id": "637abcde-1870-4ff4-80b0-d485980abcde",
"user_id": "5edabcde-4eed-4dcd-b732-996a956abcde",
"model": "minimax-audio-voice-clone-speech-2.6-hd",
"status": "success",
"is_public": false,
"payload": {
"need_noise_reduction": true,
"need_volumn_normalization": true,
"source_audio": "https://your_source_audio.wav",
"text": "Attention! This is a test.",
"voice_id": "thisistestNov"
},
"outcome": {
"media_urls": [
{
"id": "0",
"url": "https://storage.googleapis.com/your_cloned_result.mp3"
}
],
"voice_id": ""
},
"created_at": 1762215580,
"updated_at": 1762215606,
"queued_at": 1762215580
}
Request Status Values
Voice Clone is synchronous, so the response will immediately return one of these statuses:
| Status | Description |
|---|
success | Voice cloning completed successfully |
failed | Voice cloning failed (see error message) |
List Your Requests
Endpoint
GET /api/v1/ie/requestqueue/apikey/requests?model_id=minimax-audio-voice-clone-speech-2.6-hd
Example
curl -X GET "https://console.gmicloud.ai/api/v1/ie/requestqueue/apikey/requests?model_id=minimax-audio-voice-clone-speech-2.6-hd" \
-H "Authorization: Bearer YOUR_API_KEY"
Endpoint
GET /api/v1/ie/requestqueue/apikey/models/minimax-audio-voice-clone-speech-2.6-hd
Example
curl -X GET "https://console.gmicloud.ai/api/v1/ie/requestqueue/apikey/models/minimax-audio-voice-clone-speech-2.6-hd" \
-H "Authorization: Bearer YOUR_API_KEY"
List Available Models
Endpoint
GET /api/v1/apikey/models
Example
curl -X GET "https://console.gmicloud.ai/api/v1/ie/requestqueue/apikey/models" \
-H "Authorization: Bearer YOUR_API_KEY"
Response
{
"model_ids": [
"minimax-audio-voice-clone-speech-2.6-hd",
"minimax-audio-voice-clone-speech-2.6-turbo"
]
}
Example Use Cases
Basic Voice Clone (Minimal Required Parameters)
curl -X POST "https://console.gmicloud.ai/api/v1/ie/requestqueue/apikey/requests" \
-H "Authorization: Bearer YOUR_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"model": "minimax-audio-voice-clone-speech-2.6-hd",
"payload": {
"text": "Welcome to the world of AI-powered voice cloning!",
"source_audio": "https://storage.googleapis.com/my-bucket/reference-voice.mp3"
}
}'
Advanced Voice Clone (All Parameters)
curl -X POST "https://console.gmicloud.ai/api/v1/ie/requestqueue/apikey/requests" \
-H "Authorization: Bearer YOUR_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"model": "minimax-audio-voice-clone-speech-2.6-hd",
"payload": {
"text": "This message is delivered with a warm, friendly tone that makes listeners feel welcomed and valued.",
"source_audio": "https://storage.googleapis.com/my-bucket/reference-voice.mp3",
"voice_id": "friendly_host_voice1",
"prompt_audio": "https://storage.googleapis.com/my-bucket/friendly-style.mp3",
"prompt_text": "The transcript corresponding to the sample audio. It must match the audio content, and end with punctuation.",
"need_noise_reduction": true,
"need_volumn_normalization": true
}
}'