Tim Wheeler has a good tutorial on How AlphaGo Zero works.
From this Cornell University page, Google’s AlphaZero algorithm has been generalized to learn new games given only the game rules: “In this paper, we generalise this approach into a single AlphaZero algorithm that can achieve, tabula rasa, superhuman performance in many challenging domains.
deepmind.com has a great new article titled, “AlphaGo Zero: Learning from scratch.”
NPR reports that a new version of Google’s AlphaGo Zero software became a Go master by learning to play the game only by playing itself, i.e., only by using reinforcement learning (as opposed to supervised learning). Per the report in Nature.com, “AlphaGo Zero achieved superhuman performance, winning 100–0 against the previously published, champion-defeating AlphaGo.”