Auto transcribe
Auto transcribe is our most popular option for creating captions. When selected, CaptionHub sends your audio to a speech recognition provider, which returns a raw transcript. CaptionHub then uses its proprietary Natural Captions technology to process the transcript into aligned, human-readable captions.
CaptionHub currently supports five speech recognition providers:
- Speechmatics
- Amazon Transcribe
- Scriptix (Whisper)
- ElevenLabs ASR (Scribe)
- ByteDance ASR (beta)
Each provider offers slightly different functionality, all of which is supported within CaptionHub. To change your default transcription provider, navigate to Team Settings > Transcription.
To use Auto transcribe, select it from either the Create captions dialogue box or the Replace original captions dialogue box. Youβll then need to choose the language spoken in your video.

Supported languages
You can view the languages supported by each transcription engine here.
Custom dictionaries
Depending on your subscription, you may have access to custom dictionaries. Custom dictionaries help bias speech recognition towards predefined terminology such as names, places, products, or brand-specific language.
To use a custom dictionary:
- Create the dictionary in Team Settings
- Select it in the transcription dialogue before submitting your media for transcription