汇聚全球视觉新闻资讯
你所在的位置:汇视网 > 关注 >快讯

AlphaGo设定赢棋可能性最大化未追求胜出数目

发布时间:2017-05-24 10:04  来源:汇视网   编辑:文辉  阅读量:17536   

5月23日,当今世界围棋第一人柯洁九,23日下午在这里执黑289手以四分之一子的微弱劣势负于计算机围棋程序"阿尔法围棋",在围棋"人机大战"三番棋中以0:1落后。

AlphaGo设定赢棋可能性最大化未追求胜出数目

AlphaGo团队在赛后接受媒体采访,对于结果,AlphaGo团队表示系统设定为赢棋可能性最大化,面临决策时会选择稳妥的路线。

Q: 这次比赛柯洁小负AlphaGo,有一种比较有脑洞的说法是AlphaGo已经不满足于仅仅获胜了,而是希望能具体地控制输赢的差距。请问AlphaGo真的达到这样的程度了吗?如果没有的话,还有多久才能做到?

Demis Hassabis: So AlphaGo always tries to maximize its probability of winning rather than to maximize the size of the winning margin. So whenever we see it has a decision to make, it will always try to pick the more certain path… that it thinks is a more certain path to victory with less risk. So often in positions that’s what we see the tradeoff that AlphaGo is making is to decide about how certain it is about the margin of victory and how likely the probability of victory. David, if you want to add anything to that.

AlphaGo总是尽量将赢棋的可能性最大化而不是将赢的目数最大化。我们看到它每次面临决策的时候,总是会选择它自己认为更稳妥、风险更小的路线。在它的落子中我们能看到AlphaGo在判断赢得的目数有多稳妥和胜出的可能性时所做出的权衡。

David Silver: So…it’s a very interesting question. The way AlphaGo works is as Demis said, it maximizes the probability of winning the game. This means that we program into AlphaGo a goal. That goal is in match what we really want it to do, which is to try and win games of Go. You could imagine other objectives being applied, such as maximizing the gap, the margin of victory, but this is not the objective that we chose for AlphaGo to play in the game of Go. So if you really focus on victory, then it leads to these behaviors where AlphaGo will try to win, and in doing so, it may give up a number of points in favor of actually just reducing any risks it may perceives, even if that risk seems to be very small.

很有趣的问题。AlphaGo的决策过程就像是Demis所说的那样,它最大化赢棋的可能性。意思就是我们给AlphaGo植入了一个目标,这个目标才是我们想要它在比赛中做到的,也就是赢得比赛。你可以想象有其他的目标被设定进去,比如将胜出的目数最大化,但是这不是我们为AlphaGo选定的目标。当你把赢棋作为中心的时候,就会导致AlphaGo在争取赢棋时的一些行为,它可能会放弃一些目数以求降低它感知到的风险,即使这个风险非常小。

棋局回顾:

·人机大战首局柯洁执黑先行 在传统开局中求变化·AlphaGo中盘阶段显示实力 柯洁遇考验陷入长考·AlphaGo大局清晰占主动 柯洁孤注一掷图谋大龙·柯洁官子阶段苦觅逆转良机 AlphaGo144手略意外

嘉宾讲棋:

·党毅飞、范蔚菁解析人机大战 柯洁 VS AlphaGo(1) ·党毅飞、范蔚菁解析人机大战 柯洁 VS AlphaGo(2) ·党毅飞、范蔚菁解析人机大战 柯洁 VS AlphaGo(3) ·党毅飞、范蔚菁解析人机大战 柯洁 VS AlphaGo(4) ·党毅飞、范蔚菁解析人机大战 柯洁 VS AlphaGo(5) ·党毅飞、范蔚菁解析人机大战 柯洁 VS AlphaGo(6)

郑重声明:此文内容为本网站转载企业宣传资讯,目的在于传播更多信息,与本站立场无关。仅供读者参考,并请自行核实相关内容。

相关搜索热词:
下一篇: ATP男子单打排名