Model Catalog

Ready-to-use configurations for 1-click deployments!

Author avatar

distil-large-v3

Distilled version of the Whisper large-v3 model that is 6.3 times faster, and performs to within 1% WER of large-v3 on long-form audio.

GPU 1x Nvidia L4
$ 0.8
/ hour
Author avatar

whisper-large-v3

New version of the whisper-large model showing improved performance over a wide variety of languages, with 10% to 20% reduction of errors compared to Whisper large-v2.

GPU 1x Nvidia T4
$ 0.5
/ hour
Author avatar

whisper-large-v3-turbo

Finetuned version of a pruned Whisper large-v3. 8x faster than the original, at the expense of a minor quality degradation.

GPU 1x Nvidia T4
$ 0.5
/ hour