ElevenLabs TTS v3 API Usage Guide
Overview
Key Features:
- 10 Preset Voices: Curated selection of professional voices for various use cases
- Advanced Voice Control: Fine-tune stability, similarity, style, and speed for personalized output
- Multi-Language Support: Native support for 29+ languages including English, Spanish, French, German, Polish, and more
- High-Quality Audio: Multiple output formats (MP3, PCM) with adjustable sample rates and bitrates
- Speaker Boost: Enhanced voice clarity and presence
- Deterministic Generation: Set seed for reproducible results
Authentication
All API requests require authentication using an API key. Include your API key in the Authorization header:Submit Video Generation Request
Base URL
Endpoint
Request Format
Request Parameters
| Parameter | Type | Required | Description | Default | Constraints |
|---|---|---|---|---|---|
text | string | Yes | Text content to be converted to speech. | - | Required |
voice_id | string | No | Voice ID for speech synthesis. Each voice has unique characteristics and accents. | ”21m00Tcm4TlvDq8ikWAM” | See Voice ID List below |
stability | enum | No | Voice stability mode for eleven_v3 model. | 0.5 | Options: 0.0 (Creative), 0.5 (Natural), 1.0 (Robust) |
similarity_boost | float | No | Similarity boost enhances the resemblance to the original voice. Higher values make the voice closer to the original. | 0.75 | Range: 0.0–1.0, Step: 0.05 |
style | float | No | Style exaggeration. Higher values amplify the voice’s characteristic style. | 0.0 | Range: 0.0–1.0, Step: 0.05 |
speed | float | No | Speech speed multiplier. 1.0 is normal speed. | 1.0 | Range: 0.5–2.0, Step: 0.1 |
use_speaker_boost | boolean | No | Enable speaker boost for enhanced voice clarity and presence. | true | Optional |
output_format | string | No | Output audio format. MP3 formats use lossy compression, PCM formats are uncompressed. Higher bitrates and sample rates require higher subscription tiers. | ”mp3_44100_128” | See Output Format Options below |
seed | integer | No | Random seed for deterministic generation. Same seed with same parameters produces same result. | 0 | Range: 0–4294967295 |
apply_text_normalization | string | No | Text normalization mode. Auto lets the system decide, On always normalizes (e.g., spelling out numbers), Off skips normalization. | ”auto” | Options: “auto”, “on”, “off” |
Available Voice IDs
| Voice ID | Display Name | Use Case |
|---|---|---|
21m00Tcm4TlvDq8ikWAM | Rachel | Conversational, Narration |
29vD33N1CtxCmqQRPOHJ | Drew | News, Professional |
2EiwWnXFnvU5JabPnv8n | Clyde | Character, Animation |
5Q0t7uMcjvnagumLfvZi | Paul | News, Broadcasting |
9BWtsMINqrJLrRacOk9x | Aria | Educational, Tutorial |
AZnzlk1XvdvUeBnXmlld | Domi | Narration, Audiobooks |
CYw3kZ02Hs0563khs1Fj | Dave | British Accent, Professional |
CwhRBWXzGAHq8TQ4Fs17 | Roger | Conversational, Casual |
D38z5RcWu1voky8WS1ja | Fin | Irish Character, Storytelling |
EXAVITQu4vr4xnSDxMaL | Sarah | Entertainment, Upbeat |
Output Format Options
| Format Value | Display Name | Description |
|---|---|---|
mp3_22050_32 | MP3 22.05kHz 32kbps | Low quality, smallest file size |
mp3_44100_32 | MP3 44.1kHz 32kbps | Standard quality, low bitrate |
mp3_44100_64 | MP3 44.1kHz 64kbps | Standard quality, medium bitrate |
mp3_44100_96 | MP3 44.1kHz 96kbps | Good quality, medium bitrate |
mp3_44100_128 | MP3 44.1kHz 128kbps | High quality (default) |
mp3_44100_192 | MP3 44.1kHz 192kbps | Very high quality |
pcm_16000 | PCM 16kHz | Uncompressed, 16kHz |
pcm_22050 | PCM 22.05kHz | Uncompressed, 22.05kHzl |
pcm_24000 | PCM 24kHz | Uncompressed, 24kHz |
pcm_44100 | PCM 44.1kHz | Uncompressed, CD quality |
pcm_48000 | PCM 48kHz | Uncompressed, studio quality |
Response
Check Request Status
Endpoint
Example
Response
Request Status Values
| Status | Description |
|---|---|
queued | Request is waiting to be processed |
processing | Video generation is in progress |
success | Video generation completed successfully |
failed | Video generation failed |
cancelled | Request was cancelled |
List Your Requests
Endpoint
Example
Get Model Information
Endpoint
Example
List Available Models
Endpoint
Example
Response
Audio Specifications
- Sample Rates: 16 kHz, 22.05 kHz, 24 kHz, 44.1 kHz, 48 kHz
- Formats: MP3 (lossy compression), PCM (uncompressed)
- Bitrates (MP3): 32 kbps, 64 kbps, 96 kbps, 128 kbps, 192 kbps
- Output: Downloadable via signed URL