alphago

AlphaZero generalized to learn more games by itself

From this Cornell University page, Google’s AlphaZero algorithm has been generalized to learn new games given only the game rules: “In this paper, we generalise this approach into a single AlphaZero algorithm that can achieve, tabula rasa, superhuman performance in many challenging domains.

AlphaGo Zero whips AlphaGo, 100-0

NPR reports that a new version of Google’s AlphaGo Zero software became a Go master by learning to play the game only by playing itself, i.e., only by using reinforcement learning (as opposed to supervised learning). Per the report in Nature.com, “AlphaGo Zero achieved superhuman performance, winning 100–0 against the previously published, champion-defeating AlphaGo.”