Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.beyondwords.io/llms.txt

Use this file to discover all available pages before exploring further.

Overview

BeyondWords allows you to generate a custom AI voice from a text prompt, then use this voice to create audio and video. This means you can create voices to perfectly suit your needs, without the need for recordings or approvals. All generated voices are powered by ElevenLabs.

Generate a voice

1

Go to organization settings

In the BeyondWords dashboard, click your avatar then select “Organization”.Then, open the “Custom voices” page.
2

Start creating a voice

Click ”+ Voice” → “Generate voice” → “Continue”. 
3

Enter a prompt

Enter the prompt to generate your voice. (See our voice prompting guidelines below.)Once complete, click “Continue”.
4

Add a custom preview script (optional)

To add a custom preview script, enable the toggle then enter your script. (See our preview script guidelines below.)Otherwise, leave the toggle disabled.Once complete, click “Generate voice”.
5

Review voice options

Once generation is complete, you’ll see three voice options. Click the play button next to each voice to hear a preview.Select your preferred voice, then click “Continue”.
If none of the options are suitable, go back to adjust your prompt and/or preview script.
6

Add voice details

Enter the following voice details: 
  • Voice name: Choose a label you will recognize when picking this voice in your projects
  • Language and accent: Choose how this voice should be labeled for language and accent (if relevant) in your projects
  • Gender: Male, female, or neutral
Once complete, click “Save voice”.
7

Preview your voice and set its scope

Use the preview section to hear how your voice sounds across different voice models, languages, and accents.Once you’re ready, click “Done”.
8

Use your generated voice

The generated voice will appear in the “Custom voices” table and in voice selection menus across all your projects.

Voice prompting guidelines

A well-written prompt and preview script are key to generating the best possible voice for your use case. The prompt should describe the key attributes of the voice you want to create. A good prompt is specific and consistent—vague or contradictory descriptions tend to produce unreliable results. Stick to concrete, observable qualities: how the voice sounds (pitch, tone, resonance), how it speaks (pace, rhythm, emphasis), and who it sounds like (persona, age, accent). Avoid abstract descriptors like “professional” or “engaging” that don’t translate into a distinct vocal quality. We recommend using the following template:
Native <Language and accent>. <Gender>, <Age>. <Quality level>.
Persona: <2–5 words>. Emotion: <2–3 adjectives>.
<1–2 sentences about tone, pacing, and pitch>
AttributeDescriptionExamples
Language and accentThe native language and accent that shape the voice’s characterNative American English, Native Brazilian Portuguese, Native Gulf Arabic
GenderThe perceived gender of the speakerFemale, male, gender-neutral
AgeThe age or age range of the speaker, which affects vocal maturity, texture, and energy20–25, middle-aged
Audio qualityHow polished and produced the voice soundsBroadcast-quality, studio-quality
PersonaThe role or identity of the speaker, which helps shape delivery style and intentNews anchor, financial correspondent, policy analyst
EmotionThe emotional delivery of the voiceCalm, composed, measured, authoritative, assured
ToneThe physical quality of the voice, shaped by pitch, resonance, and textureWarm and resonant, clear and crisp, smooth and neutral
PacingThe speed and rhythm of speechMeasured and deliberate, steady with natural rhythm, slightly brisk but clear
PitchHow high or low the voice soundsMid-range, slightly low-pitched, low and resonant
Broad accents (e.g., American English, Brazilian Portuguese) are more reliable than specific regional varieties (e.g., Glaswegian, Northeastern Brazilian). For specific accents, try voice cloning.

Example voice prompt

Here’s an example of a well-structured voice prompt:
Native English, American. Female, 35–45. Broadcast quality. Persona: seasoned journalist narrator. Emotion: composed, analytical, subtly wry. Warm, mid-pitched timbre with a clear, articulate delivery and precise diction. Speaks at a measured, deliberate pace with confident cadence and restrained emphasis, prioritizing clarity and insight over drama. Maintains a calm, authoritative tone with occasional light dryness, as if guiding the listener through complex ideas with quiet confidence.

Preview script guidelines

The preview script can influence the voice’s energy, pacing, and emotional delivery, so it’s important to use text that reflects your intended use case. We recommend writing around 80 words in the same style and language as your content. Consider copying and pasting a representative section from an existing article.

FAQs

You can generate voices in 74 languages: Afrikaans, Arabic, Armenian, Assamese, Azerbaijani, Belarusian, Bengali, Bosnian, Bulgarian, Catalan, Cebuano, Chichewa, Croatian, Czech, Danish, Dutch, English, Estonian, Filipino, Finnish, French, Galician, Georgian, German, Greek, Gujarati, Hausa, Hebrew, Hindi, Hungarian, Icelandic, Indonesian, Irish, Italian, Japanese, Javanese, Kannada, Kazakh, Kirghiz, Korean, Latvian, Lingala, Lithuanian, Luxembourgish, Macedonian, Malay, Malayalam, Mandarin Chinese, Marathi, Nepali, Norwegian, Pashto, Persian, Polish, Portuguese, Punjabi, Romanian, Russian, Serbian, Sindhi, Slovak, Slovenian, Somali, Spanish, Swahili, Swedish, Tamil, Telugu, Thai, Turkish, Ukrainian, Urdu, Vietnamese, and Welsh.Your language selection may limit your access to certain voice models.
Yes, generated voices are multilingual. They can speak all the languages available within your chosen voice model.
You can generate as many voices and voice previews as needed. Credits are only used when you generate audio or video in a project.
If your generated voice doesn’t sound as expected, it may be because your prompt is unclear or overly complex. We recommend experimenting with different prompts until you get the desired result.Remember that ElevenLabs may be unable to generate voices with highly specific regional accents. For that use case, we recommend voice cloning.