Stream Speech
Returns a streaming output of generated audio
Headers
api-key
Request
This endpoint expects an object.
text
The text that is to be synthesised. e.g. ‘Hello there [pause 1s] friend’
voiceId
Use the GET /v1/speech/voices API to find supported voiceIds. You can use either the voiceId (e.g. en-US-natalie) or just the voice actor’s name (e.g. natalie).
channelType
Valid values: STEREO, MONO
format
Format of the generated audio file. Valid values: MP3, WAV
multiNativeLocale
Specifies the language for the generated audio, enabling a voice to speak in multiple languages natively. Only available in the Gen2 model. Valid values: “en-US”, “en-UK”, “es-ES”, etc. Use the GET /v1/speech/voices endpoint to retrieve the list of available voices and languages.
pitch
Pitch of the voiceover
rate
Speed of the voiceover
sampleRate
Valid values are 8000, 24000, 44100, 48000
style
The voice style to be used for voiceover generation.
variation
Higher values will add more variation in terms of Pause, Pitch, and Speed to the voice. Only available for Gen2 model.
Response
Ok