Universal-2: The Most Powerful Speech-to-Text Ever | Demo & Tutorial

Опубликовано: 07 Ноябрь 2024
на канале: AssemblyAI

22,800

Universal-2: A next-gen speech-to-text model pushing beyond traditional WER (word error rate) metrics. Built on Universal-1's industry-leading performance in just 6 months.

Key results:
24% better at recognizing proper nouns
21% improvement in alphanumeric accuracy
15% enhanced text formatting
73% of users prefer Universal-2 compared to Universal-1
Overall more accurate and robust model especially on real-world speech complexity
Sets new standards across human and technical benchmarks

Architecture:
Smart architecture choices prioritized over simply scaling model size
Universal-2 uses a 660M parameter Conformer RNN-T model
Built an innovative all-neural formatting pipeline
Solved critical challenges like repeated token handling in RNN-T

Announcement Landing Page: https://www.assemblyai.com/universal-2
Try it yourself: https://www.assemblyai.com/playground
Google colab: https://colab.research.google.com/dri...

▬▬▬▬▬▬▬▬▬▬▬▬ CONNECT ▬▬▬▬▬▬▬▬▬▬▬▬

🖥️ Website: https://www.assemblyai.com
🐦 Twitter: / assemblyai
🦾 Discord: / discord
▶️ Subscribe: https://www.youtube.com/c/AssemblyAI?...
🔥 We're hiring! Check our open roles: https://www.assemblyai.com/careers

▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬

#MachineLearning #DeepLearning