Learn about the models that power the ElevenLabs API.
Model ID | Description | Languages |
---|---|---|
eleven_v3 | Human-like and expressive speech generation | 70+ languages |
eleven_ttv_v3 | Human-like and expressive voice design model (Text to Voice) | 70+ languages |
eleven_multilingual_v2 | Our most lifelike model with rich emotional expression | en , ja , zh , de , hi , fr , ko , pt , it , es , id , nl , tr , fil , pl , sv , bg , ro , ar , cs , el , fi , hr , ms , sk , da , ta , uk , ru |
eleven_flash_v2_5 | Ultra-fast model optimized for real-time use (~75ms†) | All eleven_multilingual_v2 languages plus: hu , no , vi |
eleven_flash_v2 | Ultra-fast model optimized for real-time use (~75ms†) | en |
eleven_turbo_v2_5 | High quality, low-latency model with a good balance of quality and speed (~250ms-300ms) | en , ja , zh , de , hi , fr , ko , pt , it , es , id , nl , tr , fil , pl , sv , bg , ro , ar , cs , el , fi , hr , ms , sk , da , ta , uk , ru , hu , no , vi |
eleven_turbo_v2 | High quality, low-latency model with a good balance of quality and speed (~250ms-300ms) | en |
eleven_multilingual_sts_v2 | State-of-the-art multilingual voice changer model (Speech to Speech) | en , ja , zh , de , hi , fr , ko , pt , it , es , id , nl , tr , fil , pl , sv , bg , ro , ar , cs , el , fi , hr , ms , sk , da , ta , uk , ru |
eleven_multilingual_ttv_v2 | State-of-the-art multilingual voice designer model (Text to Voice) | en , ja , zh , de , hi , fr , ko , pt , it , es , id , nl , tr , fil , pl , sv , bg , ro , ar , cs , el , fi , hr , ms , sk , da , ta , uk , ru |
eleven_english_sts_v2 | English-only voice changer model (Speech to Speech) | en |
scribe_v1 | State-of-the-art speech recognition model | 99 languages |
scribe_v1_experimental | State-of-the-art speech recognition model with experimental features: improved multilingual performance, reduced hallucinations during silence, fewer audio tags, and better handling of early transcript termination | 99 languages |
Older Models
Model ID | Description | Languages |
---|---|---|
eleven_monolingual_v1 | First generation TTS model (outclassed by v2 models) | en |
eleven_multilingual_v1 | First multilingual model (outclassed by v2 models) | en , fr , de , hi , it , pl , pt , es |
eleven_v3
model ID. The Text to Dialogue API defaults to using the v3 model. Alternatively you can select a preview version which is formatted as eleven_v3_preview_YYYY_MM_DD
. When a preview version has been evaluated and is ready for production, it will be promoted to the eleven_v3
model ID. Use the evergreen eleven_v3
model ID for the most stable experience and the preview version for the latest features.
Text normalization with numbers
apply_text_normalization
parameter to “on” in your request.The Multilingual v2 model does a better job of normalizing numbers, so we recommend using it for phone numbers and other cases where number normalization is important.For low-latency or Conversational AI applications, best practice is to have your LLM normalize the text before passing it to the TTS model, or use the apply_text_normalization
parameter (Enterprise plans only for v2.5 models).Requirements
eleven_multilingual_v2
Best for high-fidelity audio output with rich emotional expressioneleven_multilingual_v2
or eleven_flash_v2_5
Both support up to 32 languageseleven_turbo_v2_5
Good balance between quality and speedUse case
eleven_multilingual_v2
Ideal for professional content, audiobooks & video narration.eleven_flash_v2_5
, eleven_flash_v2
, eleven_multilingual_v2
, eleven_turbo_v2_5
or eleven_turbo_v2
Perfect for real-time conversational applicationseleven_multilingual_sts_v2
Specialized for Speech-to-Speech conversionModel ID | Character limit | Approximate audio duration |
---|---|---|
eleven_flash_v2_5 | 40,000 | ~40 minutes |
eleven_flash_v2 | 30,000 | ~30 minutes |
eleven_turbo_v2_5 | 40,000 | ~40 minutes |
eleven_turbo_v2 | 30,000 | ~30 minutes |
eleven_multilingual_v2 | 10,000 | ~10 minutes |
eleven_multilingual_v1 | 10,000 | ~10 minutes |
eleven_english_sts_v2 | 10,000 | ~10 minutes |
eleven_english_sts_v1 | 10,000 | ~10 minutes |
Plan | Concurrency Limit (Multilingual v2) | Concurrency Limit (Turbo & Flash) | STT Concurrency Limit | Priority level |
---|---|---|---|---|
Free | 2 | 4 | 10 | 3 |
Starter | 3 | 6 | 15 | 4 |
Creator | 5 | 10 | 25 | 5 |
Pro | 10 | 20 | 50 | 5 |
Scale | 15 | 30 | 75 | 5 |
Business | 15 | 30 | 75 | 5 |
Enterprise | Elevated | Elevated | Elevated | Highest |
current-concurrent-requests
and maximum-concurrent-requests
which you can use to monitor your concurrency.
How endpoint requests are made impacts concurrency limits: