The Falcon TTS Server is the core engine of Murf’s on-prem text-to-speech system. It processes text input and generates speech locally, delivering fast, high-quality output without relying on external APIs. Running Falcon within your own infrastructure ensures low-latency performance, data privacy, and full operational control.
This guide provides step-by-step instructions for deploying the Murf TTS (Text-to-Speech) service on an AWS EC2 instance using Docker with GPU acceleration.

Check if drivers are already installed:
If you see GPU information displayed, drivers are already installed and you can skip to step 3.
If drivers are NOT installed, install them:
After reboot, verify installation:
Run nvidia-smi to verify the installation.
AWS GPU instances launched with Deep Learning AMIs or GPU-optimized AMIs usually have drivers pre-installed.
Install Docker if not already installed:
Test that Docker can access the GPU:
You should see your GPU listed with driver information.
Murf team will provide you with the Docker image for the Falcon TTS Server. You will be provided with a docker pull command to pull the image.
Create environment variables for your deployment:
Important: Replace your-secure-secret-key-here with your actual production secret key.
Run the Docker container with GPU support:
Command breakdown:
-d : Run in detached mode (background)--gpus all : Enable access to all available GPUs-e TTS_MASTER_SECRET : Pass the master secret for authentication-e LLA_ENDPOINT : (Optional) Specify custom LLA server endpoint--name murf-tts : Name the container “murf-tts”-p 80:8000 : Map host port 80 to container port 8000--restart unless-stopped : Automatically restart container unless manually stoppedLast parameter: Docker image URIMonitor the startup logs to ensure the service initializes correctly:
This should show the startup logs and the service should be ready to use.
Option 1: Health Check via Browser Open your browser and navigate to:
You should see:
Option 2: Interactive Audio Test Page Navigate to the test page:
This provides a web interface to:
Option 3: API Documentation
View the interactive API documentation:
This opens the FastAPI Swagger UI for exploring all available endpoints.