CSE 337 Reinforcement Learning (3)


Hector Munoz-Avila

Course Description

Algorithms for automated learning from interactions with the environment to optimize long-term performance. Markov decision processes, dynamic programming, temporal-difference learning, Monte Carlo reinforcement learning methods. Credit will not be given for both CSE 337 and CSE 437. Prerequisite: Math 231 and CSE 109.


Richard S. Sutton and Andrew G. Barto, "Reinforcement Learning: An Introduction (Adaptive Computation and Machine Learning)", A Bradford Book (1998), 978-0262193986

© 2014-2016 Computer Science and Engineering, P.C. Rossin College of Engineering & Applied Science, Lehigh University, Bethlehem PA 18015.