Voice cloning (audio) vs Brand voice (text) — different features
Voice Cloning is an audio feature. Train ContentRyte on a sample of your voice, then it generates audio in your voice forever.
Brand Voice is a text feature. Train ContentRyte on your written articles, then it generates text in your writing style.
Both are part of the AI Multimedia + Workspace tiers.
Setting up voice cloning
- Sidebar → Brand Voice → + Clone Voice
- Upload a 30-second to 3-minute audio sample of your voice speaking clearly (no background noise, no music)
- Name the voice (e.g., "Rahul - Podcast")
- Click Train
- After 30–60 seconds, the voice model is ready
Using your cloned voice
In the Text-to-Speech generator:
- Select your cloned voice from the dropdown
- Paste the text you want narrated
- Click Generate
- The MP3 plays back in your cloned voice
Use it for podcasts, video voiceovers, audio versions of articles, custom on-hold messages.
Setting up Brand Voice (text)
- Open a Workspace
- Click Brand Voice → Train
- Paste 3–5 of your best-written articles
- Click Save
From that point forward, every article generated in this Workspace writes in your text style — sentence length, vocabulary, opinion structure, transitions, formatting preferences.
How many voices / styles can I train?
- MegaBundle: 5 cloned voices, 1 brand voice per Workspace
- Complete Bundle: unlimited
- FE Commercial: limited (typically 1 of each, intended for trial)
Was this article helpful?
That’s Great!
Thank you for your feedback
Sorry! We couldn't be helpful
Thank you for your feedback
Feedback sent
We appreciate your effort and will try to fix the article