OpenAI TTS voices 2026 — 6 options vs EasyVoice's 56-voice catalog
OpenAI's text-to-speech API offers six voices in 2026: alloy, echo, fable, onyx, nova, and shimmer. All six are English-primary with limited multilingual capability. EasyVoice's 2026 voice catalog has 56 voices: 46 multilingual Kokoro-82M voices across 8 languages (American English, British English, Spanish, French, Hindi, Italian, Japanese, and Portuguese), and 10 dedicated Arabic MSA (Modern Standard Arabic) voices from the Supertonic model — 56 voices across 9 languages total. The breadth difference — 6 voices versus 56 — is not cosmetic. It represents a fundamental architectural choice: a narrow set of premium voices versus a broad roster designed for multilingual coverage and tonal variety. This page catalogs the full EasyVoice voice roster as of 2026, maps the six OpenAI voices to their closest EasyVoice equivalents, and explains when voice breadth matters for your TTS application.
5,000 characters per day free, no credit card. Pro $9.99/mo unlimited vs OpenAI $15/1M (tts-1) / $30/1M (tts-1-hd).
How many voices does EasyVoice have in 2026?
EasyVoice has 56 voices as of 2026: 46 multilingual voices powered by the Kokoro-82M model across 8 languages, and 10 Arabic-language voices powered by the Supertonic model — 56 voices across 9 languages total. The 46 Kokoro voices cover American English (20 voices: 11 female af_ + 9 male am_), British English (8 voices: 4 female bf_ + 4 male bm_), Spanish (3), French (1), Hindi (4), Italian (2), Japanese (5), and Portuguese (3). The 10 Supertonic voices are dedicated Arabic MSA (Modern Standard Arabic) voices — male and female — representing a separate purpose-built Arabic TTS engine rather than a multilingual model with Arabic capability bolted on.
The six OpenAI voices — alloy, echo, fable, onyx, nova, shimmer — are all English-primary. OpenAI offers no dedicated Arabic voice; its multilingual capability is best-effort English-model generalization to other languages. For Arabic, Japanese, or Hindi TTS at production quality, EasyVoice's language-specific voices represent a meaningful quality difference, not just more options.
The 2026 EasyVoice voice catalog — language and count matrix
By language and voice count as of 2026-06-30 — the 46 Kokoro-82M multilingual voices span 8 accents (American English carries the deepest bench at 20 voices across neutral, warm, energetic, and authoritative registers), and the 10 Arabic MSA voices run on a dedicated Supertonic engine (5 male / 5 female). OpenAI TTS ships six English-primary voices (alloy, echo, fable, onyx, nova, shimmer) with no language-specific additions announced as of 2026-06-30, so the 56-vs-6 breadth gap is a product decision on EasyVoice and a hard limit on OpenAI:
EasyVoice 2026 voice catalog by language (46 Kokoro-82M multilingual + 10 Arabic Supertonic = 56 voices across 9 languages).
| Language | Voices | Engine |
|---|---|---|
| American English | 20 | Kokoro-82M |
| British English | 8 | Kokoro-82M |
| Spanish | 3 | Kokoro-82M |
| French | 1 | Kokoro-82M |
| Hindi | 4 | Kokoro-82M |
| Italian | 2 | Kokoro-82M |
| Japanese | 5 | Kokoro-82M |
| Portuguese | 3 | Kokoro-82M |
| Arabic (MSA) | 10 | Supertonic |
| Total — 9 languages | 56 | 2 engines |
OpenAI voice to EasyVoice mapping — 2026 catalog
If you're migrating from OpenAI TTS, the six OpenAI voices map to EasyVoice equivalents as follows: alloy → af_alloy (warm female, closest direct sonic match); echo → am_echo (neutral American male, news-reader register); fable → bm_fable (British male storytelling, Received Pronunciation); onyx → am_onyx (deep baritone, cinematic register); nova → af_nova (bright female, expressive); shimmer → bf_lily (soft female, airy British register).
Beyond the six direct equivalents, EasyVoice's catalog includes voices without an OpenAI analog: af_heart (warm American female, optimized for long-form narration), am_adam (authoritative American male), bm_lewis (contemporary British male), and 40+ additional voices across all supported languages. A developer migrating from OpenAI starts with the six direct matches and then has access to 50 additional options to optimize for tone, register, and language.
Arabic MSA — 10 dedicated voices vs OpenAI's generalized approach
The 10 Arabic voices in EasyVoice's 2026 catalog are powered by a separate synthesis engine (Supertonic) specifically trained for Modern Standard Arabic. The voices are labeled ar_m1 through ar_m5 (male) and ar_f1 through ar_f5 (female). These are not Kokoro-82M voices with Arabic language code — they are a different model architecture with different synthesis characteristics, optimized for Arabic phonology and prosody.
OpenAI's TTS API does not include Arabic-specific voices. Its multilingual capability applies the English-trained model to Arabic text, which produces intelligible output for many use cases but misses Arabic prosodic patterns (sentence-final lengthening, emphatic consonant emphasis, vowel reduction patterns) that native-speaker Arabic voice synthesis handles correctly. For Arabic IVR, Arabic content narration, or Arabic customer service, the quality difference between a purpose-built Arabic voice and a multilingual English model doing its best is perceptible to native-speaker listeners.
How often is the EasyVoice voice catalog updated?
The EasyVoice voice catalog as documented here reflects the production catalog as of 2026-06-30. Voice additions are published when new Kokoro model weights become available or when new language-specific engines pass production quality gates. The Arabic Supertonic voices were added as a dedicated engine (replacing the earlier best-effort Arabic output from Kokoro) after the Phase 16 Arabic bake-off verified Supertonic's WER superiority for MSA.
Switch to EasyVoice if you need Arabic, Japanese, Hindi, Portuguese, French, Italian, or Spanish native-speaker-quality voices, or if voice breadth and tonal variety matter for your application — 56 voices across 9 languages give you meaningful per-use-case optimization that 6 options cannot. Stay on OpenAI if English is your only language and you need one of the six specific OpenAI voices with no migration cost — the six direct EasyVoice equivalents are close but not bit-for-bit identical to OpenAI's proprietary voice models.
Voice mapping table — OpenAI → EasyVoice
Click any EasyVoice voice name to reach its dedicated voice page with audio preview. All six voices below are on the free tier.
| OpenAI voice | EasyVoice equivalent | Tone characterization | When to use |
|---|---|---|---|
alloy | Alloyaf_alloy | Warm female mid-range — the closest direct sonic match in the EasyVoice catalog. Bright but unhurried, identical register to OpenAI's default voice. | Default narration, accessibility read-aloud, neutral chatbot responses, product onboarding voiceover. Use this as your starting-point swap if your app was built on OpenAI's default voice and you don't want to change personality during migration. |
echo | Echoam_echo | Neutral American male, slightly above OpenAI baseline pitch. Clean and unornamented — the 'news reader' register rather than the audiobook narrator. | News-style narration, dispatcher / IVR prompts, transactional confirmations (booking confirmations, order updates), enterprise B2B onboarding. Pair with am_adam for variety when one product needs two male voices. |
fable | Fablebm_fable | British male storytelling voice — measured pace, slight literary lilt, Received Pronunciation. The natural fable-narrator fit OpenAI's voice was named for. | Audiobook narration, fiction read-aloud, podcast intros that want a British register, history and documentary voiceover, children's stories. For a contemporary British male with less literary register, see bm_lewis. |
onyx | Onyxam_onyx | Deep American male baritone — the heaviest voice in either vendor's catalog. Slow cadence, dramatic weight, cinematic register. | Movie-trailer-style ads, dramatic narration, video game cutscene voiceover, brand films, gravitas-heavy product launches. Avoid for conversational chatbot use (too dramatic for short prompts); pair with am_michael for variety in the same product. |
nova | Novaaf_nova | Bright female with energy — matches the 'nova' brand signal: confident, modern, slightly higher placed than af_alloy. The most expressive female in the OpenAI-equivalent set. | Social media ads, hook-driven short-form video voiceover, energetic product demos, marketing email audio versions, YouTube channel intros for lifestyle / fitness / tech reviewer formats. |
shimmer | Lilybf_lily | Soft female with airy upper register — captures the 'shimmer' brightness OpenAI's voice was named for, but with a subtle British placement that lends warmth without losing brightness. | Meditation and wellness apps, ASMR-adjacent narration, lullabies and bedtime stories, gentle product onboarding for premium consumer apps, lifestyle podcast cold opens. For an American-female alternative with similar airiness, try af_jessica. |
Sample scripts for blind A/B testing
Paste each into both vendor APIs (OpenAI's audio.speech.create and EasyVoice's /api/v1/audio/speech) to compare blind.
alloy ↔ af_alloy (Alloy)"Welcome to your daily summary. Here are the three updates worth your attention this morning, in two minutes or less."
echo ↔ am_echo (Echo)"Your appointment is confirmed for Thursday at three p.m. We've sent the details to your inbox. Reply 'reschedule' to change the time."
fable ↔ bm_fable (Fable)"It was the kind of morning the village had not seen in a generation — quiet, golden, and so still that the bells of St. Mary's seemed to sound from another century."
onyx ↔ am_onyx (Onyx)"In a world where every second counts, the difference between hesitation and action is measured in heartbeats. This is your moment."
nova ↔ af_nova (Nova)"Three things you didn't know your phone could do — number two will save you ten minutes every morning, no joke."
shimmer ↔ bf_lily (Lily)"Take a slow breath in… and let it go. You're exactly where you need to be. Listen for the next two minutes — that's all this takes."
Voices to try on the free tier
Every voice below is callable via the same voice parameter — preview audio samples and read the full character profile.
Frequently asked questions
How many voices does EasyVoice have in 2026?▾
EasyVoice has 56 voices in 2026: 46 multilingual voices powered by Kokoro-82M across 8 languages (American English 20, British English 8, Spanish 3, French 1, Hindi 4, Italian 2, Japanese 5, Portuguese 3) and 10 dedicated Arabic MSA voices powered by the Supertonic model (5 male, 5 female). OpenAI TTS offers 6 voices, all English-primary, with no dedicated Arabic, Hindi, Japanese, or Portuguese options.
Which EasyVoice voice is closest to OpenAI 'nova'?▾
af_nova is the closest EasyVoice match to OpenAI nova — bright female voice with energy and expression, the most expressive female option in the OpenAI-equivalent set. The match is directional: af_nova captures the nova register (upbeat, modern, slightly higher-placed than neutral) but is produced by the Kokoro-82M model rather than OpenAI's proprietary synthesis stack, so the voice is similar in character, not bit-for-bit identical.
Are the 10 Arabic voices the same model as the multilingual voices?▾
No. The 10 Arabic voices (ar_m1 through ar_m5 male, ar_f1 through ar_f5 female) use the Supertonic synthesis engine, which is a separate model specifically trained for Modern Standard Arabic phonology and prosody. The 46 multilingual voices use the Kokoro-82M engine. Supertonic handles Arabic consonant emphasis, vowel patterns, and sentence-final prosody patterns that a generalist multilingual model cannot replicate at the same quality level.
How often is the EasyVoice voice catalog updated?▾
The catalog is updated when new Kokoro model weights become available or when new language-specific engines pass production quality gates. The 2026 catalog reflects the production state as of 2026-06-30 (46 Kokoro multilingual + 10 Supertonic Arabic = 56 total). The Arabic Supertonic voices replaced a prior best-effort Arabic output from the multilingual Kokoro model after a formal quality bake-off confirmed Supertonic's superiority for MSA synthesis.
Does EasyVoice support Spanish and French TTS in 2026?▾
Yes. EasyVoice's 2026 catalog includes 3 Spanish voices (ef_dora, em_alex, em_santa), 1 French voice (ff_siwis), 4 Hindi voices, and 3 Portuguese voices, all powered by Kokoro-82M. These are native-quality voices for each language — not English-model generalization. OpenAI TTS supports these languages via its multilingual capability on the same six English-primary voices, without language-specific voice options.
Related OpenAI migration guides
OpenAI TTS voices, mapped to free Kokoro alternatives
Map OpenAI's 6 voices (alloy, echo, fable, onyx, nova, shimmer) to closest EasyVoice Kokoro voices. Side-by-side tones, when-to-use, sample scripts. Free.
OpenAI TTS API reference — audio.speech.create mapped to EasyVoice
OpenAI TTS API reference mapped to EasyVoice. audio.speech.create params, error codes 401/400/429/5xx, request/response shapes. OpenAI-compatible endpoint.
Migrate from OpenAI TTS to EasyVoice in 5 lines
OpenAI TTS to EasyVoice migration guide: 5-line code diff in Python + JS. Model, voice, response_format mapping. Streaming compatible. $9.99 flat vs $15/1M.
Vendor comparison: EasyVoice vs OpenAI TTS
Side-by-side feature comparison covering voices, languages, pricing tiers, free limits, API surface, and the why-people-look / where-each-wins breakdown.
Developer-focused OpenAI migration in /tts-api
The developer-onboarding angle of the same migration — request body compatibility deep-dive, streaming behavior, ChatGPT plugin/Realtime API guidance, and the official OpenAI SDK constraint.
Start migrating off OpenAI TTS today
5,000 characters per day free, no credit card. Pro $9.99/mo unlimited replaces OpenAI's $15-$300/mo bills once you cross 666K characters per month.