π€ Want your team maximizing Claude? I run 1:1 and team AI workshops for companies doing $1M+ per year: https://aibuilder.academy/yt/YOvxh_ma5qE Multimodal embeddings represent multiple data modalities in the same vector space. Here, I discuss how they are developed and two example use cases: 0-shot classification and image search. Resources: π° Blog: https://medium.com/towards-data-science/multimodal-embeddings-an-introduction-5dc36975966f?sk=8b7b6b81b3e890192aafeda15492c7de π» GitHub Repo: https://github.com/ShawhinT/YouTube-Blog/tree/main/multimodal-ai References: [1] BERT: https://arxiv.org/abs/1810.04805 [2] ViT: https://arxiv.org/abs/2010.11929 [3] CLIP: https://arxiv.org/abs/2103.00020 [4] Though2Text: https://arxiv.org/abs/2410.07507 [5] A Simple Framework for Contrastive Learning of Visual Representations: https://arxiv.org/abs/2002.05709 Introduction - 0:00 What are embeddings? - 1:01 Multimodal Embeddings - 5:08 Contrastive Learning - 6:56 Contrastive Learning (Details) - 8:16 Example 1: 0-shot Image Classification - 15:17 Example 2: Image Search - 19:50 What's Next? - 22:47

The 8 Claude Skills Running My Business
1.2K views

How to Use Claude Better than 99% of Founder-CEOs
798 views

Claude Cowork Explained in 29 Minutes (for non-coders)
1.7K views

How I Taught Claude To Edit My YouTube Videos
4.5K views

How to Automate Anything with Claude (4-Step Framework)
4.4K views

Claude Code for SWE Teams: Building a Shared AI Coding Toolkit
1.9K views