Returns a url to the generated audio file along with other associated properties.
The text that is to be synthesised. e.g. ‘Hello there [pause 1s] friend’
This parameter allows specifying the duration (in seconds) for the generated audio. If the value is 0, this parameter will be ignored. Only available for Gen2 model.
Valid values: STEREO, MONO
Set to true to receive audio in response as a Base64 encoded string instead of a url.
Format of the generated audio file. Valid values: MP3, WAV, FLAC, ALAW, ULAW
Valid values: GEN1, GEN2. Use GEN2 to generate audio using new and advanced model. Outputs from Gen 2 will sound better, but different from the old model
Specifies the language for the generated audio, enabling a voice to speak in multiple languages natively. Only available in the Gen2 model. Valid values: “en-US”, “en-UK”, “es-ES”, etc. Use the GET /v1/speed/voices endpoint to retrieve the list of available voices and languages.
Pitch of the voiceover
An object used to define custom pronunciations.
Example 1: {“live”:{“type”: “IPA”, “pronunciation”: “laɪv”}}.
Example 2: {“2022”:{“type”: “SAY_AS”, “pronunciation”: “twenty twenty two”}}
Speed of the voiceover
Valid values are 8000, 24000, 44100, 48000
The voice style to be used for voiceover generation.
Higher values will add more variation in terms of Pause, Pitch, and Speed to the voice. Only available for Gen2 model.
Ok
Number of characters consumed so far in the current billing cycle.
Remaining number of characters available for synthesis in the current billing cycle.