Series

Reinforcement Learning

Join my journey through Sutton & Barto’s Reinforcement Learning textbook, distilling complex concepts (MDPs, value functions, Q-learning) into intuitive explanations for BSc CS learners.

Reinforcement Learning: First Principles
Initial Concepts and Core Ideas
Feb 5, 20258 min read3
Exploration vs. Exploitation: A Deep Dive into Multi-armed Bandits
A k-armed Bandit Problem Imagine you're at a casino, faced with a row of slot machines (one-armed bandits), each with its own hidden probability of paying out. Your goal is to maximize your winnings over the night, but you don't know which machines h...
Feb 19, 202512 min read6
Demystifying Reinforcement Learning: A Beginner's Guide to the Math
Introduction Imagine teaching a computer to play chess from scratch. How would it learn which moves lead to checkmate and which lead to defeat? How would it understand the long-term consequences of capturing a pawn versus protecting its queen? This i...
Mar 4, 202512 min read4

Command Palette