Audio Speech Recogition (ASR) Model
'whisper-large-v3' = Multilingual high quality, 'whisper-large-v3-turbo' = Multilingual fast with minimal impact on quality, good balance, 'distil-whisper-large-v3-en' = English only, fastest with also slight impact on quality
Timestamp Granularities
The level of detail of time measurement in the timestamps.
Language