Learn/Core Concept How does reinforcement learning work? Reinforcement learning teaches AI systems to make decisions by rewarding good outcomes and penalising bad ones, similar to training a pet with treats. Unlike supervised learning where we show examples of correct answers, RL agents learn through trial and error in an environment. This approach powers game-playing AIs like AlphaGo and is increasingly used for code optimisation, resource allocation, and automated systems. The Ineffable Intelligence funding shows how RL can learn without human data, making it valuable for scenarios where we can't easily define correct answers upfront. PolicyExploration |