Reinforcement Learning Course ↗
Completely free and open-source
Prerequisites: Python, deep learning fundamentals
modelsThe most accessible free reinforcement learning course available — covers Q-learning, deep Q-networks, policy gradient methods, PPO, and RLHF (reinforcement learning from human feedback). RLHF is the technique that makes ChatGPT and Claude behave helpfully — understanding it gives you insight into how modern LLMs are trained and aligned. Unique coverage that no other free platform provides at this depth.