Professional voice cloning

Voice cloning is only available to Enterprise customers.

Introduction

Professional voice cloning lets you train a highly realistic model of a voice. We achieve this by training a dedicated model on a large set of scripted speaker data.

A professional voice clone will mirror the speaker data it is trained on. For an optimal clone, we require speakers to record a tailored script based on your content to help you achieve your desired speaking style. It's important that unwanted artefacts or sounds are not present during the recording. Otherwise, the model will replicate unwanted features, resulting in a subpar voice clone.

Professional voice cloning is currently available in 62 languages.

To speak to a member of our team about professional voice cloning please book a meeting.

Custom script

To help capture your ideal speaking style, we’ll use our Script Generator build a custom script based on your articles. Articles serve as a natural example of the content that the voice clone will be used to narrate, making it easier for the speaker to achieve your desired speaking style. The number of articles selected (typically between 20-50) will depend on their features, length and voice language.

Voice cloning process

Professional voice clones can require between 1-2 hours of speaker data depending on the language, training can take up to 48 hours.

Last updated