OpenAI's Whisper is a speech to text, or automatic speech recognition model. It is a "weakly supervised" encoder-decoder transformer trained on 680,000 hours of audio. Not only can it transcribe English, it can transcribe 96 other languages along with also being able to translate from those languages to English.
Neural Networks from Scratch book: https://nnfs.io
Channel membership: / @sentdex
Discord: / discord
Reddit: / sentdex
Support the content: https://pythonprogramming.net/support...
Twitter: / sentdex
Instagram: / sentdex
Facebook: / pythonprogramming.net
Twitch: / sentdex