🤝 Want your team maximizing Claude? I run 1:1 and team AI workshops for companies doing $1M+ per year: https://aibuilder.academy/yt/slJqu3N16Xc This is the 2nd video in a larger series on reinforcement learning (RL) with LLMs. Here, I discuss 3 ways people are using RL to train modern LLMs and AI agents. ▶️ Series Playlist: https://www.youtube.com/playlist?list=PLz-ep5RbHosU_UY8NtZAMaraz74sMHo2W References [1] https://youtu.be/3vFISl7qMFI [2] arXiv:2203.02155 [cs.CL] [3] https://youtu.be/6yIMb0K-aS4 [4] https://youtu.be/uaZ3yRdYg8A [5] arXiv:2509.16679 [cs.CL] [6] arXiv:2509.04501 [cs.CL] [7] https://youtu.be/7xTGNNLPyMI [8] arXiv:2212.08073 [cs.CL] [9] arXiv:2501.12948 [cs.CL] [10] https://youtu.be/gEDl9C8s_-4 Introduction - 0:00 Reinforcement Learning (RL) - 0:17 RL with LLMs - 1:29 How LLMs are Trained - 3:25 3 Ways to RL with LLMs - 6:15 Way 1: RLHF - 6:43 Way 2: RLAIF - 9:40 Way 3: RLVR - 13:28 Limitations - 18:19 What's Next? - 19:58

The 8 Claude Skills Running My Business
1.2K views

How to Use Claude Better than 99% of Founder-CEOs
798 views

Claude Cowork Explained in 29 Minutes (for non-coders)
1.7K views

How I Taught Claude To Edit My YouTube Videos
4.5K views

How to Automate Anything with Claude (4-Step Framework)
4.4K views

Claude Code for SWE Teams: Building a Shared AI Coding Toolkit
1.9K views