Vigyata.AI
Is this your channel?

Netflix ML System Design [Explained] Video Recommendation System | Retriever Ranker Models

1.6K views· 61 likes· 42:22· Aug 23, 2025

🛍️ Products Mentioned (12)

Netflix ML System Design [Explained] Video Recommendation System | Retriever Ranker Models 💬 Comment below if you have questions 👍 Like | 🔄 Share | 🔔 Subscribe for more GenAI projects 📌 Source code & updates: GitHub → https://github.com/simranjeet97 ✍️ Read detailed blogs & project breakdowns on Medium: https://bit.ly/3JGXqwc Learning Playlists GenAI Agentic AI Course [14+ Agents] https://www.youtube.com/playlist?list=PLYIE4hvbWhsAkn8VzMWbMOxetpaGp-p4k GenAI Full Course with LLM Fine-Tuning & Evaluation https://bit.ly/4bJwZla Learn RAG from Scratch with GenAI Projects https://bit.ly/3Zl47KD Latest AI / GenAI Research Papers Explained https://bit.ly/4huqEMT RAG & LLM Use-Cases in Finance Domain https://bit.ly/3AGSRQm Prompt Engineering https://bit.ly/42v376M Financial Data Analysis & Modelling https://bit.ly/3OCWI5O Artificial Intelligence Projects https://bit.ly/3L8lhEi End-to-End Data Science Project https://bit.ly/3BfC3N9 Explainable AI (XAI) https://bit.ly/3gsuIxb Face Recognition Projects https://bit.ly/2YphpHm #MLSystemDesign #AIProjects #GenAI #AgenticAI #RAG #LLM #MLOps #AIEngineering #deeplearning Disclaimer: Made with ChatGPT, only for educational purposes.

About This Video

In this video, I break down Netflix-style recommendation system design with the classic two-stage setup: a retriever and a ranker. The big idea is simple—at Netflix scale you can’t score every video for every user in real time, so you first retrieve a small candidate set fast (from millions to hundreds/thousands), and then you run a heavier ranking model to order those candidates for the final homepage rows. I walk through how to think about user signals, item features, and why “retrieval vs ranking” is not just a modeling choice—it’s a latency + cost + quality trade-off. I also connect the dots to practical engineering: what features typically go into each stage, how embeddings help with candidate generation, and why ranking models often look like CTR/watch-time predictors with richer context (device, time, session intent, freshness, diversity). If you’re building your own recommender (or even a RAG-style retrieval pipeline), this is the same system-design thinking: narrow the search space first, then spend compute where it matters. If you want to implement this end-to-end, I’ve shared my code and updates on GitHub and I publish deeper breakdowns on Medium—those links are in the description.

Frequently Asked Questions

🎬 More from FreeBirds Crew - Data Science and GenAI