Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.gmicloud.ai/llms.txt

Use this file to discover all available pages before exploring further.

Model ID
elevenlabs-tts-v3

ElevenLabs TTS v3 API Usage Guide

Overview

Key Features:

  • 10 Preset Voices: Curated selection of professional voices for various use cases
  • Advanced Voice Control: Fine-tune stability, similarity, style, and speed for personalized output
  • Multi-Language Support: Native support for 29+ languages including English, Spanish, French, German, Polish, and more
  • High-Quality Audio: Multiple output formats (MP3, PCM) with adjustable sample rates and bitrates
  • Speaker Boost: Enhanced voice clarity and presence
  • Deterministic Generation: Set seed for reproducible results

Authentication

All API requests require authentication using an API key. Include your API key in the Authorization header:
Authorization: Bearer YOUR_API_KEY

Submit Video Generation Request

Base URL

https://console.gmicloud.ai

Endpoint

POST /api/v1/ie/requestqueue/apikey/requests

Request Format

curl -X POST "https://console.gmicloud.ai/api/v1/ie/requestqueue/apikey/requests" \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "elevenlabs-tts-v3",
    "payload": {
      "text": "Welcome to the world of natural-sounding speech synthesis.",
      "voice_id": "21m00Tcm4TlvDq8ikWAM",
      "stability": 0.5,
      "similarity_boost": 0.75,
      "style": 0.0,
      "speed": 1.0,
      "use_speaker_boost": true,
      "output_format": "mp3_44100_128",
      "seed": 0,
      "apply_text_normalization": "auto"
    }
  }'

Request Parameters

ParameterTypeRequiredDescriptionDefaultConstraints
textstringYesText content to be converted to speech.-Required
voice_idstringNoVoice ID for speech synthesis. Each voice has unique characteristics and accents.”21m00Tcm4TlvDq8ikWAM”See Voice ID List below
stabilityenumNoVoice stability mode for eleven_v3 model.0.5Options: 0.0 (Creative), 0.5 (Natural), 1.0 (Robust)
similarity_boostfloatNoSimilarity boost enhances the resemblance to the original voice. Higher values make the voice closer to the original.0.75Range: 0.0–1.0, Step: 0.05
stylefloatNoStyle exaggeration. Higher values amplify the voice’s characteristic style.0.0Range: 0.0–1.0, Step: 0.05
speedfloatNoSpeech speed multiplier. 1.0 is normal speed.1.0Range: 0.5–2.0, Step: 0.1
use_speaker_boostbooleanNoEnable speaker boost for enhanced voice clarity and presence.trueOptional
output_formatstringNoOutput audio format. MP3 formats use lossy compression, PCM formats are uncompressed. Higher bitrates and sample rates require higher subscription tiers.”mp3_44100_128”See Output Format Options below
seedintegerNoRandom seed for deterministic generation. Same seed with same parameters produces same result.0Range: 0–4294967295
apply_text_normalizationstringNoText normalization mode. Auto lets the system decide, On always normalizes (e.g., spelling out numbers), Off skips normalization.”auto”Options: “auto”, “on”, “off”

Available Voice IDs

Voice IDDisplay NameUse Case
21m00Tcm4TlvDq8ikWAM RachelConversational, Narration
29vD33N1CtxCmqQRPOHJDrewNews, Professional
2EiwWnXFnvU5JabPnv8nClydeCharacter, Animation
5Q0t7uMcjvnagumLfvZiPaulNews, Broadcasting
9BWtsMINqrJLrRacOk9xAriaEducational, Tutorial
AZnzlk1XvdvUeBnXmlldDomiNarration, Audiobooks
CYw3kZ02Hs0563khs1FjDaveBritish Accent, Professional
CwhRBWXzGAHq8TQ4Fs17RogerConversational, Casual
D38z5RcWu1voky8WS1jaFinIrish Character, Storytelling
EXAVITQu4vr4xnSDxMaLSarahEntertainment, Upbeat

Output Format Options

Format ValueDisplay NameDescription
mp3_22050_32MP3 22.05kHz 32kbpsLow quality, smallest file size
mp3_44100_32MP3 44.1kHz 32kbpsStandard quality, low bitrate
mp3_44100_64MP3 44.1kHz 64kbpsStandard quality, medium bitrate
mp3_44100_96MP3 44.1kHz 96kbpsGood quality, medium bitrate
mp3_44100_128MP3 44.1kHz 128kbpsHigh quality (default)
mp3_44100_192MP3 44.1kHz 192kbpsVery high quality
pcm_16000PCM 16kHzUncompressed, 16kHz
pcm_22050PCM 22.05kHzUncompressed, 22.05kHzl
pcm_24000PCM 24kHzUncompressed, 24kHz
pcm_44100PCM 44.1kHzUncompressed, CD quality
pcm_48000PCM 48kHzUncompressed, studio quality

Response

{
  "request_id": "7eaa77fc-bc67-4021-9f1b-96b3fd832314",
  "model": "elevenlabs-tts-v3",
  "status": "queued",
  "created_at": 1761763441,
  "updated_at": 1761763441,
  "queued_at": 1761763441
}

Check Request Status

Endpoint

GET /api/v1/ie/requestqueue/apikey/requests/{request_id}

Example

curl -X GET "https://console.gmicloud.ai/api/v1/ie/requestqueue/apikey/requests/7eaa77fc-bc67-4021-9f1b-96b3fd832314" \
  -H "Authorization: Bearer YOUR_API_KEY"

Response

{
  "request_id": "7eaa77fc-bc67-4021-9f1b-96b3fd832314",
  "model": "elevenlabs-tts-v3",
  "status": "success",
  "payload": {
    "apply_text_normalization": "auto",
    "output_format": "mp3_44100_128",
    "seed": 0,
    "similarity_boost": 0.75,
    "speed": 1,
    "stability": 0.5,
    "style": 0,
    "text": "Welcome to the world of natural-sounding speech synthesis.",
    "use_speaker_boost": true,
    "voice_id": "21m00Tcm4TlvDq8ikWAM"
  },
  "outcome": {
    "audio_url": "https://storage.googleapis.com/.../your_tts_result.mp3",
    "media": [
        {
            "type": "audio",
            "url": "https://storage.googleapis.com/.../your_tts_result.mp3"
        }
    ],
    "status": "audio_generated_successfully"
  },
  "created_at": 1761763441,
  "updated_at": 1761763451,
  "queued_at": 1761763441
}

Request Status Values

StatusDescription
queuedRequest is waiting to be processed
processingVideo generation is in progress
successVideo generation completed successfully
failedVideo generation failed
cancelledRequest was cancelled

List Your Requests

Endpoint

GET api/v1/ie/requestqueue/apikey/requests?model_id=elevenlabs-tts-v3

Example

curl -X GET "https://console.gmicloud.ai/api/v1/ie/requestqueue/apikey/requests?model_id=elevenlabs-tts-v3" \
  -H "Authorization: Bearer YOUR_API_KEY"

Get Model Information

Endpoint

GET /api/v1/ie/requestqueue/apikey/models/elevenlabs-tts-v3

Example

curl -X GET "https://api.example.com/api/v1/apikey/models/elevenlabs-tts-v3" \
  -H "Authorization: Bearer YOUR_API_KEY"

List Available Models

Endpoint

GET /api/v1/apikey/models

Example

curl -X GET "https://api.example.com/api/v1/apikey/models" \
  -H "Authorization: Bearer YOUR_API_KEY"

Response

{
    "model_ids": [
        "elevenlabs-tts-v3",
        "other-model-1",
        "other-model-2"
    ]
}

Audio Specifications

  • Sample Rates: 16 kHz, 22.05 kHz, 24 kHz, 44.1 kHz, 48 kHz
  • Formats: MP3 (lossy compression), PCM (uncompressed)
  • Bitrates (MP3): 32 kbps, 64 kbps, 96 kbps, 128 kbps, 192 kbps
  • Output: Downloadable via signed URL

Example Use Cases

Basic TTS (Minimal Required Parameters)

curl -X POST "https://console.gmicloud.ai/api/v1/ie/requestqueue/apikey/requests" \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "elevenlabs-tts-v3",
    "payload": {
      "text": "Welcome to the future of speech synthesis with ElevenLabs.",
      "voice_id": "21m00Tcm4TlvDq8ikWAM"
    }
  }'

Advanced TTS (All Parameters)

curl -X POST "https://console.gmicloud.ai/api/v1/ie/requestqueue/apikey/requests" \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "elevenlabs-tts-v3",
    "payload": {
      "text": "This is a comprehensive example showcasing all available parameters for professional-grade speech synthesis with full voice control.",
      "voice_id": "29vD33N1CtxCmqQRPOHJ",
      "stability": 1.0,
      "similarity_boost": 0.8,
      "style": 0.2,
      "speed": 1.1,
      "use_speaker_boost": true,
      "output_format": "mp3_44100_192",
      "seed": 12345,
      "apply_text_normalization": "on"
    }
  }'