“LLAMA2 supercharged with vision & hearing?!” | Multimodal 101 tutorial

Опубликовано: 16 Октябрь 2024
на канале: AI Jason

27,889

Explore Multimodal language model, like LLaVA, which enables you reach GPT4 level multimodal abilities, unlock use cases like chat with images

🔗 Links
Join my community: https://www.skool.com/ai-builder-club...
Follow me on twitter: / jasonzhou1993
Join my AI email list: https://www.ai-jason.com/
My discord: / discord
LLaVA link: https://llava-vl.github.io/

⏱️ Timestamps
0:00 Intro
1:03 What is multimodal?
1:23 LLaVA model
2:08 Demo
3:35 Use case: Product development
5:17 Use case: Content curation
6:27 Use case: Medical
7:07 Use case: Captcha
8:09 Use case: Robots

👋🏻 About Me
My name is Jason Zhou, a product designer who shares interesting AI experiments & products. Email me if you need help building AI apps! [email protected]

#gpt #autogpt #ai #artificialintelligence #tutorial #stepbystep #openai #llm #largelanguagemodels #largelanguagemodel #chatgpt #multimodality #gpt4 #multimodal #llama2 #llama #llava #machinelearning