Æ»¹ûÒùÔº

Minerva Class Scheduling Visit for course dates & times.

Dernières mises à jour en lien avec la COVID-19 disponibles ici.
Latest information about COVID-19 available here.

COMP 579 Reinforcement Learning (4 credits)

important

Note: This is the 2021–2022 eCalendar. Update the year in your browser's URL bar for the most recent version of this page, or .

Offered by: Computer Science (Faculty of Science)

Overview

Computer Science (Sci) : Bandit algorithms, finite Markov decision processes, dynamic programming, Monte-Carlo Methods, temporal-difference learning, bootstrapping, planning, approximation methods, on versus off policy learning, policy gradient methods temporal abstraction and inverse reinforcement learning.

Terms: Winter 2022

Instructors: Precup, Doina (Winter)

  • Prerequisite: A university level course in machine learning such as COMP 451 or COMP 551. Background in calculus, linear algebra, probability at the level of MATH 222, MATH 223, MATH 323, respectively.

Back to top