Explains reinforcement learning through the lens of AlphaGo Zero, which mastered the game of Go entirely through self-play without any human data. Covers the core concepts of agents, environments, rewards, and policies that underpin one of AI's most powerful paradigms.