Research press release





このDavid Silver、Julian Schrittwieser、Karen Simonyan、Demis Hassabisたちの研究グループの論文で紹介されているアルファ碁ゼロは、自己対局のみによって学習するため、最初はランダムな指し手で試合を進め、盤面上の位置と囲碁の石だけを入力データとし、人間によるデータ入力は一切なかった。アルファ碁ゼロは、単一のニューラルネットワークを用いているが、このネットワークは、このプログラム自体が選ぶ指し手と対局の勝者を予測するように「訓練」され、自己対局が繰り返されるたびに性能が向上した。アルファ碁ゼロは、1台のコンピューターと4個のTPUを使用している。


A new version of the AlphaGo computer program is able to teach itself to rapidly master the classic strategy game Go, starting from a blank slate and without human input, reports a paper published in Nature this week. The new program, called AlphaGo Zero, defeated its predecessor (which defeated Go champion Lee Sedol in a tournament in March 2016) by 100 games to 0.

A grand challenge for artificial intelligence is to develop an algorithm that learns challenging concepts from a blank slate and with superhuman proficiency. To beat world-champion human players at Go, a previous version of AlphaGo was trained through a combination of supervised learning based on millions of human expert moves and reinforcement learning from self-play. That version of AlphaGo was trained over several months and required multiple machines and 48 TPUs (specialized chips for neural network training).

Here, David Silver, Julian Schrittwieser, Karen Simonyan, Demis Hassabis and colleagues introduce AlphaGo Zero, which learns solely from the games that it plays against itself, starting from random moves, with only the board and pieces as inputs and without human data. AlphaGo Zero uses a single neural network, which is trained to predict the program’s own move selection and the winner of its games, improving with each iteration of self-play. The new program uses a single machine and 4 TPUs.

After a few days of training - including almost 5 million games of self-play - AlphaGo Zero could outperform humans and defeat all previous versions of AlphaGo. As the program trained, it independently discovered some of the same game principles that took humans thousands of years to conceptualize and also developed novel strategies that provide new insights into this ancient game.

doi: 10.1038/nature24270

「Nature 関連誌注目のハイライト」は、ネイチャー広報部門が報道関係者向けに作成したリリースを翻訳したものです。より正確かつ詳細な情報が必要な場合には、必ず原著論文をご覧ください。

メールマガジンリストの「Nature 関連誌今週のハイライト」にチェックをいれていただきますと、毎週最新のNature 関連誌のハイライトを皆様にお届けいたします。