Stream Speech
Returns a streaming output of generated audio
Headers
Request
The text that is to be synthesised. e.g. ‘Hello there [pause 1s] friend’
Use the GET /v1/speech/voices api to find supported voiceIds.
Valid values: STEREO, MONO
Format of the generated audio file. Valid values: MP3, WAV
Specifies the language for the generated audio, enabling a voice to speak in multiple languages natively. Only available in the Gen2 model. Valid values: “en-US”, “en-UK”, “es-ES”, etc. Use the GET /v1/speech/voices endpoint to retrieve the list of available voices and languages.
Pitch of the voiceover
Speed of the voiceover
Valid values are 8000, 24000, 44100, 48000
The voice style to be used for voiceover generation.
Higher values will add more variation in terms of Pause, Pitch, and Speed to the voice. Only available for Gen2 model.
Response
Ok