楼主: suntotal
2272 1

[数据挖掘理论与案例] Mastering the game of Go with deep neural networks and tree search [推广有奖]

  • 1关注
  • 0粉丝

已卖:602份资源

本科生

45%

还不是VIP/贵宾

-

威望
0
论坛币
95386 个
通用积分
5.4831
学术水平
1 点
热心指数
1 点
信用等级
1 点
经验
1696 点
帖子
59
精华
0
在线时间
117 小时
注册时间
2007-11-8
最后登录
2021-8-16

楼主
suntotal 发表于 2016-3-11 19:36:27 |AI写论文

+2 论坛币
k人 参与回答

经管之家送您一份

应届毕业生专属福利!

求职就业群
赵安豆老师微信:zhaoandou666

经管之家联合CDA

送您一个全额奖学金名额~ !

感谢您参与论坛问题回答

经管之家送您两个论坛币!

+2 论坛币
2016年一月公布的AlphaGO人工智能算法


The game of Go has long been viewed as the most challenging of classic games for artificial intelligence owing to its enormous search space and the difficulty of evaluating board positions and moves. Here we introduce a new approach to computer Go that uses ‘value networks’ to evaluate board positions and ‘policy networks’ to select moves. These deep neural networks are trained by a novel combination of supervised learning from human expert games, and reinforcement learning from games of self-play. Without any lookahead search, the neural networks play Go at the level of state- of-the-art Monte Carlo tree search programs that simulate thousands of random games of self-play. We also introduce a new search algorithm that combines Monte Carlo simulation with value and policy networks. Using this search algorithm, our program AlphaGo achieved a 99.8% winning rate against other Go programs, and defeated the human European Go champion by 5 games to 0. This is the first time that a computer program has defeated a human professional player in the full-sized game of Go, a feat previously thought to be at least a decade away.

        
向作者们致敬!

二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝

关键词:Mastering Networks network Master search search

10_1038_nature16961.pdf
下载链接: https://bbs.pinggu.org/a-1989196.html

3.32 MB

需要: 12 个论坛币  [购买]

AlphaGO在《自然》杂志上发表的算法论文

沙发
soccy(未真实交易用户) 发表于 2016-3-11 20:27:32
[em33]

您需要登录后才可以回帖 登录 | 我要注册

本版微信群
加好友,备注cda
拉您进交流群
GMT+8, 2025-12-26 23:30