Murf provides dedicated request capacity based on your plan. Each plan includes specific limits for concurrency and WebSocket connections. As your application scales, you can upgrade your plan to increase capacity.
Below is a summary of the concurrent TTS request limits for each plan:
Concurrency refers to the maximum number of generation requests that can be processed simultaneously. For all non-streaming endpoints, this is defined as the number of active requests at any given time.
Our TTS API supports streaming via both HTTP and WebSocket connections. Concurrency for streaming is defined by the number of unique context IDs active at a given time:
If the number of active contexts exceeds your concurrency limit, new context IDs will be rejected, and an error message will be returned.
WebSocket limits define the number of parallel WebSocket connections allowed at given time. Each plan supports up to 10X the streaming concurrency limit in parallel WebSocket connections.
The limits are designed to maintain system performance and ensure a consistent experience for all users. By adhering to the limits and following best practices, you can integrate the Murf API smoothly and efficiently into your applications. If you have additional questions or need guidance on managing API limits, please drop a message in our discord channel or contact our support team.