Original Text in Word Durations
A new parameter, wordDurationsAsOriginalText
, has been added to the /v1/speech/generate
endpoint.
By default, the wordDurations
object in the API response contains normalized text. When you set wordDurationsAsOriginalText
to true
, the response will instead include the original, un-normalized text from your request. This allows for a direct mapping between your input text and the corresponding word-level timestamps.
This feature is currently available for English only.