Streaming
Murf TTS API supports real-time streaming capabilities, allowing developers to generate and play text-to-speech (TTS) audio dynamically as it is being generated in real-time, reducing the time-to-first-byte. This ensures minimal latency, making it ideal for conversational AI, real-time applications, and voice-enabled assistants.
In addition to HTTP streaming endpoint, Murf TTS will soon support Websocket streaming which enables bidirectional streaming for real-time audio generation.
Quickstart
Streaming enables returning raw audio bytes (e.g., MP3 data) directly over HTTP using chunked transfer encoding. This allows clients to process or play audio incrementally as it is generated. This section focuses on how streaming works for requests made to the Text to Speech API.
Getting Started
Generate an API key here. Store the key in a secure location, as you’ll need it to authenticate your requests. You can optionally save the key as an environment variable in your terminal.
FAQs
What features/ SSML are supported in streaming mode?
Yes, we support tags to control voice styles, pitch and pauses.
Is streaming TTS priced differently from non-streaming TTS?
No, streaming TTS is priced the same as TTS. It is billed per character.
Are there concurrency or rate limits?
Yes. For streaming, concurrency for free and PAYG plan is limited to one. We provide custom concurrency for enterprise plans. Contact sales for scaling needs.
Are the same voices and languages available as in the batch API?
All the voices and languages supported in TTS are available via streaming. A full list is available in our docs.
What audio format will this support
It supports wav and mp3 formats.