Streaming
Murf TTS API supports real-time streaming capabilities, allowing developers to generate and play text-to-speech (TTS) audio dynamically as it is being generated in real-time, reducing the time-to-first-byte. This ensures minimal latency, making it ideal for conversational AI, real-time applications, and voice-enabled assistants.
New (Beta): Pass model = FALCON to use our Falcon model in text-to-speech streaming endpoints, designed for ultra-low latency (~130 ms).
In addition to HTTP streaming endpoint, Murf TTS supports Websocket streaming which enables bidirectional streaming for real-time audio generation.
Quickstart
Streaming enables returning raw audio bytes (e.g., MP3 data) directly over HTTP using chunked transfer encoding. This allows clients to process or play audio incrementally as it is generated. This section focuses on how streaming works for requests made to the Text to Speech API.
Getting Started
Generate an API key here. Store the key in a secure location, as you’ll need it to authenticate your requests. You can optionally save the key as an environment variable in your terminal.
Falcon Supported Voices
English - US & Canada
English - India
Hindi - India
Bengali - India
Endpoint & Concurrency Overview
The Global Router automatically picks the nearest region automatically.The concurrency limit is 15 for the US-East region and 2 for all other regions. To get higher concurrency, use the US-East endpoint directly or contact us to increase limits for regional endpoints.
Available Regions
Use the region closest to your users for the lowest latency.
FAQs
What is Falcon (Beta)?
Falcon is our fastest streaming model (~130 ms latency) optimized for real-time interactions.
Who should use Falcon (Beta)?
Use Falcon when your top priority is ultra-low latency. Typical fits include:
- Conversational agents & live support where snappy turn-taking matters.
- Real-time apps (IVR, gaming, tutoring, assistive tech) that stream audio as users speak.
- Interruptible/barge-in experiences and interactive demos or prototyping.
How do I enable Falcon?
Include model = FALCON in your request (HTTP or WebSocket). If omitted, the default streaming model is used.
What features/ SSML are supported in streaming mode?
Yes, we support tags to control voice styles, pitch and pauses.
Are the same voices and languages available as in the batch API?
All the voices and languages supported in TTS are available via streaming.A full list is available in our docs.
What audio format will this support
- We support MP3, FLAC, WAV, ALAW, ULAW, OGG, and PCM.
- If you need to transmit audio as text, you can Base64-encode any of these.