동적계획법과 강화학습 강의 정리 노트

Date Subject Note
Sep.8, 2022 Markov Decision Process Lecture 1-1 Lecture 1-2
Sep.15, 2022 Value Evaluation and Policy iteration Lecture 2-1 Lecture 2-2
Sep.22, 2022 Value Evaluation and Policy iteration Lecture 3-1 Lecture 3-2
Sep.29, 2022 Model Free Prediction / Simulaiton Lecture 4-1 Lecture 4-2
Oct.6, 2022 Model Free Policy Control Lecture 5-1 Lecture 5-2
Oct.27, 2022 Monte Carlo, TD, SARSA, Q-learning Lecture 7
Nov.10, 2022 Value Function Approximation Lecture 8
Nov.17, 2022 Value Function Approximation Lecture 8
Nov.24, 2022 Policy Gradient Method Lecture 9