Transform voice recordings into high-quality, lifelike AI voices using Murf’s Voice Changer. With just a few parameters, you can change the speaker’s voice, adjust pitch and speed, insert custom pauses—all while preserving the original speaker’s rhythm, tone, and accent automatically.
Try this capability in the API Reference Playground. Simply generate your API key and start exploring the API.
You can Generate your API key from the Murf API Dashboard and optionally set it as an environment variable.
If you’re using Python, you can install Murf’s Python SDK using the following command:
A link to the audio file will be returned in the response. You can use this link to download the audio file and use it wherever you need it. The audio file will be available for download for 24 hours after generation.
The Voice Changer endpoint offers powerful speech transformation capabilities, supporting key features like voiceId, pitch, speed, and pauses.
The maximum allowed input length is 3 minutes per request.
The API accepts the following input audio formats: WAV, MP3, ALAW, ULAW, FLAC.
Our system supports the following output formats: WAV (Default), MP3, FLAC, ALAW, and ULAW. The Voice Changer endpoint offers the same range of sample rates and channel types as the Speech Synthesis endpoint, allowing users to optimize output quality based on their specific needs.
The maximum input length is 3 minutes. Files longer than this will be rejected.
Yes. The system automatically retains the original speaker’s prosody and accent—meaning their rhythm, tone, pacing, and regional speech patterns are preserved in the transformed voice for natural-sounding results. These are always enabled by default and do not need to be configured manually.