GPT-4o Low Latency Screen to Voice Tutorial - SUPER IMPRESSIVE OCR!

Опубликовано: 24 Октябрь 2024
на канале: All About AI
11,479
358

GPT-4o Low Latency Screen to Voice Tutorial - SUPER IMPRESSIVE OCR!

👊 Become a member and get access to GitHub and Code:
   / allaboutai  

🤖 Great AI Engineer Course:
https://scrimba.com/learn/aiengineer?...

🔥 Open GitHub Repos:
https://github.com/AllAboutAI-YT/easy...

📧 Join the newsletter:
https://www.allabtai.com/newsletter/

🌐 My website:
https://www.allabtai.com

Today we recap my livestream where i built a low latency screen to voice reader with great ocr capabilites. This will look at the screen, answer any question or explain a problem, with pretty low latency pre new voice mode from GPT4o.

00:00 GPT4o Screen to Voice Intro
00:57 GPT4o Flowchart
01:42 Lets Build The Screen Reader
06:05 First Test
07:05 Lets Build The Voice
09:48 Second Test with Voice
10:32 Adding Control Key
11:05 Final Tests