Whisper: Speech Recognition Trained on Web-Scale Weak Supervision
Whisper showed that large, diverse, weakly supervised audio data can produce robust multilingual speech recognition and translation models.
Topics
Models for transcribing, translating, and understanding spoken audio.
Whisper showed that large, diverse, weakly supervised audio data can produce robust multilingual speech recognition and translation models.