Chapter 1: IntroductionChapter 2: Multi-armed BanditsChapter 3: Finite Markov Decision ProcessesChapter 4: Dynamic ProgrammingChapter 5: Monte Carlo MethodsChapter 6: Temporal-Difference LearningChapter 7: n-step BootstrappingChapter 8: Planning and Learning with Tabular Methods