菜單總覽

【暑期短課】A Tutorial on Reinforcement Learning - Prof. Benjamin Van Roy

  • 2019.07.15
  • 活動
A Tutorial on Reinforcement Learning

主題: A Tutorial on Reinforcement Learning

報告人: Prof. Benjamin Van Roy, Stanford University

時間: 10:00 am - 11:30 am, July 15 and July 17, 2019

地點: Room 201, Teaching?Building B (July 15)

? ? ? ? ? Room 208, Cheng Dao Building (July 17)

?

?

摘要:

There is sometimes confusion about what reinforcement learning is about. This is partly because the term alternately refers to a problem, a community who work on the problem, and methods developed by this community, some of which have been useful in addressing other problems. The reinforcement learning problem is that faced by an agent interacting with an uncertain environment aiming to maximize rewards it accumulates over time. This tutorial will introduce the problem and basic policy and value function learning algorithms that aim to address it. We will also discuss data efficiency and the role of exploration, generalized value functions, and hierarchical reinforcement learning.

?

簡介:

Benjamin Van Roy is a Professor at Stanford University, where he has served on the faculty since 1998. His research focuses on understanding how an agent interacting with a poorly understood environment can learn over time to make effective decisions. He is interested in the design of efficient reinforcement learning algorithms, understanding what is possible or impossible in this domain, and applying the technology toward the benefit of society. Beyond academia, he leads a DeepMind Research team in Mountain View, and has also led research programs at Unica (acquired by IBM), Enuvis (acquired by SiRF), and Morgan Stanley.?

He is a Fellow of INFORMS and IEEE and has served on the editorial boards of Machine Learning, Mathematics of Operations Research, for which he co-edits the Learning Theory Area, Operations Research, for which he edited the Financial Engineering Area, and the INFORMS Journal on Optimization.

He received the SB in Computer Science and Engineering and the SM and PhD in Electrical Engineering and Computer Science, all from MIT. He has been a recipient of the MIT George C. Newton Undergraduate Laboratory Project Award, the MIT Morris J. Levin Memorial Master's Thesis Award, the MIT George M. Sprowls Doctoral Dissertation Award, the National Science Foundation CAREER Award, the Stanford Tau Beta Pi Award for Excellence in Undergraduate Teaching, and the Management Science and Engineering Department's Graduate Teaching Award. He has held visiting positions as the Wolfgang and Helga Gaul Visiting Professor at the University of Karlsruhe, the Chin Sophonpanich Foundation Professor and the InTouch Professor at Chulalongkorn University, a Visiting Professor at the National University of Singapore, and a Visiting Professor at the Chinese University of Hong Kong, Shenzhen.

福星彩票网 宁明县 | 赣榆县 | 佛教 | 都安 | 鲜城 | 山丹县 | 乌拉特前旗 | 沙雅县 | 育儿 | 荣昌县 | 大姚县 | 腾冲县 | 株洲县 | 资中县 | 鹤壁市 | 博野县 | 三穗县 | 清流县 | 且末县 | 宁化县 | 怀化市 | 邯郸市 | 同江市 | 濮阳市 | 保亭 | 廉江市 | 澄城县 | 洱源县 | 余庆县 | 平顶山市 | 东辽县 | 尚义县 | 疏附县 | 金阳县 | 辽源市 | 犍为县 | 大名县 | 柘城县 | 杂多县 | 抚顺市 | 华蓥市 | 西安市 | 卓尼县 | 怀柔区 | 石柱 | 莱阳市 | 宁德市 | 翁牛特旗 | 姚安县 | 田林县 | 开原市 | 平乐县 | 都安 | 葵青区 | 淄博市 | 定安县 | 方城县 | 阳新县 | 乐都县 | 永昌县 | 哈巴河县 | 三明市 | 乌兰察布市 | 大安市 | 合作市 | 武夷山市 | 新津县 | 凉城县 | 井冈山市 | 上饶县 | 重庆市 | 岢岚县 | 绥滨县 | 安福县 | 东安县 | 武汉市 | 厦门市 | 报价 | 繁昌县 | 阿拉善左旗 | 阿勒泰市 | 天台县 | 西宁市 | 建昌县 | 盐源县 | 昌吉市 | 新闻 | 南充市 | 镇坪县 | 涞水县 | 大洼县 | 东城区 | 延川县 | 荔波县 | 武穴市 | 姜堰市 | 米泉市 | 彭州市 | 密山市 | 苍梧县 | 永昌县 | 夏邑县 | 兴业县 | 曲松县 | 阳泉市 | 万年县 | 邹平县 | 黄冈市 | 建宁县 | 阿鲁科尔沁旗 | 宿迁市 | 沁源县 | 梁河县 | 高密市 | 长岛县 | 北辰区 | 曲阜市 | 正宁县 | 开化县 | 岑溪市 | 莫力 | 麻阳 | 兴仁县 | 湟源县 | 江津市 | 武胜县 | 河池市 | 公主岭市 | 康保县 | 深泽县 | 亚东县 | 平阴县 | 吴桥县 | 九龙城区 | 德安县 | 高台县 | 枞阳县 | 罗甸县 | 三都 | 濮阳市 | 西充县 | 张掖市 | 昌平区 | 庆阳市 | 年辖:市辖区 | 卓尼县 | 收藏 | 砚山县 | 西林县 | 克东县 | 威信县 | 朔州市 | 璧山县 | 顺平县 | 内黄县 | 平利县 | 多伦县 | 绥滨县 | 马鞍山市 | 奉节县 | 桓仁 | 平乐县 | 黑水县 | 泸定县 | 韶山市 | 定兴县 | 扎囊县 | 婺源县 | 格尔木市 | 嘉定区 | 门源 | 宣恩县 | 普兰店市 | 枝江市 | 康马县 | 教育 | 论坛 | 平塘县 | 南投市 | 萨嘎县 | 昌宁县 | 尤溪县 | 南宫市 | 荥经县 | 临安市 | 平塘县 | 扶余县 | 策勒县 | 海阳市 | 潞西市 | 武清区 | 横峰县 | 天门市 | 大邑县 | 道孚县 | 广东省 | 新安县 | 万年县 | 丹阳市 | 图片 | 潮安县 | 永城市 | 金山区 | 德令哈市 | 荆门市 | 讷河市 | 腾冲县 | 怀柔区 | 安溪县 | 巴中市 | 仁布县 | 安塞县 | 盐池县 | 土默特右旗 | 淮北市 | 五大连池市 | 慈溪市 | 正蓝旗 | 墨玉县 | 宁远县 | 抚顺县 | 宣汉县 | 扎赉特旗 | 大港区 | 新宁县 | 宜兰市 | 漳浦县 | 深圳市 | 饶平县 | 德江县 | 武山县 | 贵德县 | 黄平县 | 凤翔县 | 子洲县 | 措美县 | 淮阳县 | 本溪市 | 永德县 | 丽水市 | 惠来县 | 西乌珠穆沁旗 | 漠河县 | 筠连县 | 长子县 | 彭泽县 | 溆浦县 | 天峨县 | 郓城县 | 灌阳县 | 灌阳县 |