Explore Multimodal language model, like LLaVA, which enables you reach GPT4 level multimodal abilities, unlock use cases like chat with images 🔗 Links - Join my community: https://www.skool.com/ai-builder-club/about - Follow me on twitter: https://twitter.com/jasonzhou1993 - Join my AI email list: https://www.ai-jason.com/ - My discord: https://discord.gg/eZXprSaCDE - LLaVA link: https://llava-vl.github.io/ ⏱️ Timestamps 0:00 Intro 1:03 What is multimodal? 1:23 LLaVA model 2:08 Demo 3:35 Use case: Product development 5:17 Use case: Content curation 6:27 Use case: Medical 7:07 Use case: Captcha 8:09 Use case: Robots 👋🏻 About Me My name is Jason Zhou, a product designer who shares interesting AI experiments & products. Email me if you need help building AI apps! ask@ai-jason.com #gpt #autogpt #ai #artificialintelligence #tutorial #stepbystep #openai #llm #largelanguagemodels #largelanguagemodel #chatgpt #multimodality #gpt4 #multimodal #llama2 #llama #llava #machinelearning

Ralph-loop 2.0? The real autonomous coder is coming...
20.3K views

New AI coding paradiagm - OpenAI Symphony
42.1K views

Okay, this unleashed my agent
19.0K views

wtf is Harness Engineer & why is it important
84.8K views

How to prompt Gemini 3.1 for Epic animations
24.4K views

Anthropic killed Tool calling
206.1K views