資源簡介
Google DeepMind的David Silver的強化學習課程講義,包括Markov Decision Processes、Planning by Dynamic Programming、Model-Free Prediction、Model-Free Control、Function Approximation、Policy Gradient Methods、Integrating Learning and Planning、Exploration and Exploitation以及游戲案例分析。視頻:https://www.youtube.com/playlist?list=PL5X3mDkKaJrL42i_jhE4N-p6E2Ol62Ofa
代碼片段和文件信息
?屬性????????????大小?????日期????時間???名稱
-----------?---------??----------?-----??----
?????文件?????8140824??2016-03-01?05:25??Case?Study?-?RL?in?Games.pdf
?????文件?????2997953??2016-03-01?05:25??Lecture?1?Introduction?to?Reinforcement?Learning.pdf
?????文件??????835315??2016-03-01?05:25??Lecture?2?Markov?Decision?Processes.pdf
?????文件??????823976??2016-03-01?05:25??Lecture?3?Planning?by?Dynamic?Programming.pdf
?????文件?????1455589??2016-03-01?05:25??Lecture?4?Model-Free?Prediction.pdf
?????文件?????1494703??2016-03-01?05:25??Lecture?5?Model-Free?Control.pdf
?????文件?????1996806??2016-03-01?05:25??Lecture?6?Value?Function?Approximation.pdf
?????文件?????1874832??2016-03-01?05:25??Lecture?7?Policy?Gradient?Methods.pdf
?????文件?????5746478??2016-03-01?05:25??Lecture?8?Integrating?Learning?and?Planning.pdf
?????文件?????1339671??2016-03-01?05:25??Lecture?9?Exploration?and?Exploitation.pdf
-----------?---------??----------?-----??----
?????文件?????8140824??2016-03-01?05:25??Case?Study?-?RL?in?Games.pdf
?????文件?????2997953??2016-03-01?05:25??Lecture?1?Introduction?to?Reinforcement?Learning.pdf
?????文件??????835315??2016-03-01?05:25??Lecture?2?Markov?Decision?Processes.pdf
?????文件??????823976??2016-03-01?05:25??Lecture?3?Planning?by?Dynamic?Programming.pdf
?????文件?????1455589??2016-03-01?05:25??Lecture?4?Model-Free?Prediction.pdf
?????文件?????1494703??2016-03-01?05:25??Lecture?5?Model-Free?Control.pdf
?????文件?????1996806??2016-03-01?05:25??Lecture?6?Value?Function?Approximation.pdf
?????文件?????1874832??2016-03-01?05:25??Lecture?7?Policy?Gradient?Methods.pdf
?????文件?????5746478??2016-03-01?05:25??Lecture?8?Integrating?Learning?and?Planning.pdf
?????文件?????1339671??2016-03-01?05:25??Lecture?9?Exploration?and?Exploitation.pdf
評論
共有 條評論