Original Text in Word Durations

A new parameter, wordDurationsAsOriginalText, has been added to the /v1/speech/generate endpoint.

By default, the wordDurations object in the API response contains normalized text. When you set wordDurationsAsOriginalText to true, the response will instead include the original, un-normalized text from your request. This allows for a direct mapping between your input text and the corresponding word-level timestamps.

This feature is currently available for English only.