Falcon (Beta)

Falcon is an ultra-fast, scalable, and reliable speech synthesis model built for real-time conversational AI. Designed for production environments, Falcon delivers natural, human-like speech with ~130ms time-to-first-audio and scales effortlessly to thousands of concurrent sessions.

Character consumption is free while Falcon remains in Beta.

Ideal for

  • Customer support voice agents: Enable fast, natural conversations that feel human, reducing wait times and improving user satisfaction.
  • Debt servicing and collections: Automate payment reminders, due date alerts, and customer interactions with clear, compliant voice communication.
  • Healthcare assistants: Support patients with empathetic, fast, and accurate voice responses for scheduling, triage, or medication info.
  • Sales leads qualification: Serve as the first point of contact for inbound and outbound leads, qualifying them efficiently before handover to human agents.
  • Virtual assistants and chatbots: Power smooth voice interactions across apps and devices with instant response times.
  • Voice IVR systems: Deliver dynamic, real-time responses for inbound and outbound calls in high-volume environments.

Why Falcon

Falcon is built for enterprise Conversational AI applications where speed, scale and natural speech quality cannot be compromised.

Ultra-low Latency

Falcon model is optimized for real-time use cases where responsiveness is critical. With time-to-first-audio under 130ms, it ensures conversations feel seamless, natural, and instant. Deployments near your data center ensure lower latency while meeting privacy and data residency requirements.

Enterprise scale concurrency

Built for large-scale deployments, Falcon can support 10,000+ concurrent calls in parallel without compromising audio quality or stability. This makes it ideal for enterprises running high-volume customer interactions.

Multinative speech

Falcon voices can seamlessly switch between multiple languages within a single sentence while preserving natural pronunciation for each language. For example, a customer support agent could effortlessly switch between English and Spanish, or Hindi and English, just like a bilingual human speaker.

Falcon Supported Voices

Voice IDSupported LocalesVoice Styles
Matthewen-US (English - US & Canada)Conversation
Zionen-US (English - US & Canada)Conversational
Kenen-US (English - US & Canada)Conversation
Riveren-US (English - US & Canada)Conversation
Emilyen-US (English - US & Canada)Narration
Voice IDSupported LocalesVoice Styles
Anishaen-IN (English - India)Conversation
Voice IDSupported LocalesVoice Styles
Namritahi-IN (Hindi - India)Conversation
Voice IDSupported LocalesVoice Styles
Amarabn-IN (Bengali - India)Conversation

Expressive speech

Falcon reproduces the intonation, rhythm, and natural pauses of human speech, creating conversations that feel authentic and empathetic. Its speech patterns are tailored for conversational AI, virtual assistants, and customer-facing agents.

Unparalleled pronunciation accuracy

With 99.37% accuracy, Falcon ensures clarity across industries where precision matters, from financial transactions and healthcare communication to legal and compliance-driven conversations.

Data Residency

Murf offers data residency through isolated regional environments for the streaming & websocket TTS API, allowing Enterprise customers to choose where their text and audio processing takes place.

Available Regions

Use the region closest to your users for the lowest latency.

Region (City/Area)Endpoint
US-Easthttps://us-east.api.murf.ai/v1/speech/stream
US-Westhttps://us-west.api.murf.ai/v1/speech/stream
Indiahttps://in.api.murf.ai/v1/speech/stream
Canadahttps://ca.api.murf.ai/v1/speech/stream
South Koreahttps://kr.api.murf.ai/v1/speech/stream
UAEhttps://me.api.murf.ai/v1/speech/stream
Japanhttps://jp.api.murf.ai/v1/speech/stream
Australiahttps://au.api.murf.ai/v1/speech/stream
EU (Central)https://eu-central.api.murf.ai/v1/speech/stream
UKhttps://uk.api.murf.ai/v1/speech/stream
South America (São Paulo)https://sa-east.api.murf.ai/v1/speech/stream

The Global Router automatically picks the nearest region automatically.The concurrency limit is 15 for the US-East region and 2 for all other regions. To get higher concurrency, use the US-East endpoint directly or contact us to increase limits for regional endpoints.