Voice Changer: Recorded voice to an AI voice

Murf's Voice Changer offers the exciting functionality of swapping a recorded voice with an AI voice. The Murf Studio offers 100+ human-sounding voices across accents, ages and gender, to suit any requirement. All of them are high quality, professional voices built for multimedia content. The Voice Changer is a unique feature with the powerful ability to transform a recording made at home or in an office into studio-quality audio.
 
A high quality speech audio from Murf can elevate the presentation and the overall finish of a variety of projects, from software demos, tutorials and e-learning modules, to web-based applications which may involve multiple screens with instructions.
  • E-learning voiceover - A tutorial might involve a presentation with a number of slides, with accompanied voice. The recorded audio may include surrounding noise, or vary in pitch, speed and volume. It may have been addressed to a specific set of people, or recorded in a particular situation, like a live class. A great way to disseminate this resource, already available with audio and video, to a wider audience, is to replace the recorded voice with a clean, uniform, accent neutral AI voice. A further advantage is the ability to edit long pauses, extraneous words and sounds like ‘ah, oh..’ and control the pace of delivery according to the needs of the project.
  • Software demo voiceover- A software demo, on the other hand, might have multiple screens as well as interactions within one screen or between screens. If including instructions or how-tos, the steps will need to be clearly articulated and demonstrated in order to be useful to those watching it. If the initial demo has been recorded with a voice while also capturing the screen interactions, the Voice Changer at Murf Studio can help smoothen the voice output to match the screens, eliminate unplanned delays and also add any other content that might have been missed out while recording.
If you are concerned about losing the personal touch that comes with the spontaneity of freestyle speech, or you are not sure about how to choose the AI voice for your project, read this blogpost about how to pick the perfect voice.
 
Murf’s AI voices are natural sounding human voices. In addition to being diverse, all voices can be further refined using features like pitch, emphasis, pauses and even pronunciation hints for proper nouns, technical words or phrases in another language. Converting a recorded voice to an AI voice with Murf has benefits beyond sounding better and removing background noise.
 
If you are concerned about losing the personal touch that comes with the spontaneity of freestyle speech, or you are not sure about how to choose the AI voice for your project, read this blogpost about how to pick the perfect voice.
 
Murf’s AI voices are natural sounding human voices. In addition to being diverse, all voices can be further refined using features like pitch, emphasis, pauses and even pronunciation hints for proper nouns, technical words or phrases in another language. Converting a recorded voice to an AI voice with Murf has benefits beyond sounding better and removing background noise.
 
Here’s a step-by-step guide on using the Voice Changer:
 
1. Upload recording
In Murf Studio you can upload a video or an audio file, whichever is available. For easier management of the media, a video uploaded with recorded audio will appear as two separate tracks. Do note that the the video format should be a .mp4 file. Audio files should be in .mp3 format.
 
Upload Audio
 
 
Upload Video
 

2. Speech to Text

When the audio and video tracks are split, they can be viewed in the tracker at the bottom of the Studio screen, separately. The audio is also converted into text: it will be split into smaller blocks, with pauses called out. The recorded audio of each text block can be played by clicking on the button on the top. 

 

 

The module-based splitting of text not only makes it easier to edit, it also allows the voice rendering feature to be applied to individual blocks of text, thereby using less rendering time.

3. Retain recorded voice

When the text is voiced in recorded audio (to know this see the icon to the left of the text - if it matches your login icon on the top right hand corner of the Studio, then it’s the recorded voice), the words can be muted or trimmed, but new words cannot be added. When muting text, the overall timing of the audio remains the same, so a pause of sorts will be heard in place of the muted speech (highlighted in light grey text). With trimming, both the word(s) and the corresponding audio time are removed, so the regenerated recording will have the speech before and after the word or words trimmed out. The overall duration of the recording will also be reduced accordingly.
To view the options available for a recorded voice, select the converted text and right click. Feel free to experiment, replay and redo - the edits can be undone by placing the cursor at the muted or trimmed text, right click and select - undo edits.
 

Recorded Pauses

The pauses that have been separated from the recorded audio can be removed. To do this, select the pause unit in the text block and the menu under the Voices tab will show a white trash can against an orange background. Click on it to delete the pause.
 
 
If you would like to shorten or lengthen the pause, here’s a pro tip: add a blank text block after the recorded voice, put your cursor in the box ‘Enter text here to generate voiceover’, and under the Voices tab, select the drop down under Add Pause. You can select a fixed duration, from Extra Weak (250ms) to Extra Strong (1.2s), or you can enter the exact duration you have in mind.
 

4. Change to AI Voice

If you want to edit the text that has been converted from recorded speech, it has to be changed to an AI voice. To do this, select the block you want to edit, go to the AI Changer tab and select Convert to AI voice. A default AI voice from the Murf Studio will be applied, and the text in the box can now be edited.

5. Edit script

The text can now be edited as easily as in any word document. To add more text at the end, click on the ‘+’ button below, and an empty block of text will appear. You can type out the script, or copy and paste the text from another source, with all the formatting removed. Each time text is added, the sentences will get split into separate units within a single text block.
 
 
 
To include text in a separate line within a block, select the line before or after which you would like add a new block, and click on the ‘+’ at the left hand corner of the top menu under the Voices tab.
 
 
To remove lines within a text block, select the lines you want to delete and click on the orange trash can at the top, between the + add paragraph and the language options.
 
Murf also has an inbuilt grammar and spell check assistant that is enabled by default, and will underline spelling errors, and highlight grammatical inconsistencies. To disable it, toggle the < > button under the Settings tab.
 
 
Insider tip: To add a new text in-between existing text blocks, add it in the end, then select the block in the tracker at the bottom of the screen. Right click, then select to move left or right. Use a new text block when you want to apply a feature to it that is different from the other blocks, like another voice, or a separate image / video.
 

6. Pick the perfect voice

Experiment with as many voices as you like using a single block of text, before applying it to the entire audio. Check out this article on choosing the perfect voiceover.
Pro Tip: If you want to try different voices for different sentences, it is possible! Under the Settings tab, disable the ‘Retain Voice’ button and now each sentence can be changed to a different voice. Once you pick the voice that suits your needs from the Murf Studio, you can enable the ‘Retain voice’ button and now when you select a voice for any of the text, it will be applied to all the text in that block.
 
 

7. Refine the voiceover

At this point you might have changed all the recorded voice into an AI voice, or only a part of it. You might have chosen only one voice for the entire video, or a mix of voices. Murf offers multiple features to help refine the tone and the voice presentation as a whole. Select the text block for which you want to edit the voiceover, and under the Voices tab, work with the pitch, speed and pause buttons. Remember, if the ‘Retain Voice’ button under Settings is enabled, the changes you choose will apply to all the lines in the text block. To apply to one sentence at a time, disable this button.
 
The following features are available under the Voices tab:
  1. Pitch: change the pitch of speech from half an octave below to half an octave above. Want to go back to how it all was? Click Reset.
  2. Speed: Options are available in multiples of 0.1, from zero to twice the speed. Experiment with a single unit of text within a block to save on rendering time.
  3. Pause: Click on the drop down under Add Pause. You can select a fixed duration, from Extra Weak (250ms) to Extra Strong (1.2s), or you can enter the exact duration you have in mind.

8. Advanced Voice Features

Depending on the AI voices you have chosen, some of them will have additional functionalities. These will also be visible under the Voices tab:
  1. Volume: From mute (0) to 1.5 times the default volume, in multiples of 0.1
  2. Emphasis: This allows for a specific part of the line to have an upward or a downward intonation. This is a useful feature for mimicking real speech, in dialogues, or when expressing an emotion. When used effectively, emphasis can also help break the monotony of a continuous voiceover.
  3. Pronunciation - If your voiceover has technical words, phrases or text that has a specific pronunciation, you can set this to your liking through the options in the Pronunciation menu, under the Voices tab. Select the text block with the word or phrase in question, and each word will appear highlighted separately.
 

Click on the word to be pronounced differently and enter the sounds as they need to be pronounced under the head ‘Pronunciation’. For more complex sounds, or for those familiar with Phonemes, enable the advanced button, and the Phonemes button which now appears next to the Pronunciation box. Now enter the phonemes as per the IPA standard for the selected word.

If you would like this setting to be applied to the word throughout the project, enable the Project-level button. If only for this instance, disable it.
 
 

Finishing touches

Murf also offers 50-odd soundtracks, tagged by emotion (happy, soothing, upbeat, to name a few) and categorised by type of project (e-learning, advertisement, podcast etc). You can also add your own soundtrack, but do take care that it is not protected under any copyright, so as to be able to use it and later share your project with others, as well. To add your own soundtrack, click on Upload soundtrack under the Audio/Music tab.
 
Syncing the audio and the video is the final piece of the editing puzzle that can make all the difference to the smoothness of the project itself. The tracker at the bottom gives you a bird’s eye view of the separate tracks, and you can move them around. To do this, select an audio or a video / image block and you will see the black outline, then right click to move left, right, or to delete the block altogether. When you select this track the Video/Image track expands automatically in the top menu of the Studio.
 
 
The time length of the audio and video blocks can also be adjusted. Increasing the audio time will automatically include pauses, and the updated time of the block can be seen at the top < / >. If you trim the text block in the tracker to a point when its duration is lesser than the time taken for the audio, you will see an alert message at the top of the respective text block. Make sure that the time is adequate for all the speech in the text block to be vocalised.

Error warning if length of voice block < voiceover size

 

Adequate length of voice block

 

9. Build and render

At this point, if you have finished putting together the audio track or the audio-video you had in mind, it’s time to get a preview. Click on the Build Video / Build Audio button (with the pickaxe to its left) above the picture screen on the right hand side of your project dashboard. Use the Quick Render option for a cursory check of the output, at lower quality. The final HD render will take more time, and provides a high quality, downloadable video or audio render.
The final render size can also be customised by platform.
 

10. Download and share

Once the final HD render is complete the Download and Share buttons will appear next to the Render button. To download, you need a paid plan. You can share the rendered video through murf with a free account, though, and Murf offers public sharing, restricted sharing and secure sharing as well.
 
To summarise, Murf’s Voice Changer can be the difference between a home recording and a professional sounding project.