Order:
  1. Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm.David Silver, Thomas Hubert, Julian Schrittwieser, Ioannis Antonoglou, Matthew Lai, Arthur Guez, Marc Lanctot, Laurent Sifre, Dharshan Kumaran, Thore Graepel, Timothy Lillicrap, Karen Simonyan & Demis Hassabis - 2017 - .
    No categories
     
    Export citation  
     
    Bookmark   9 citations  
  2.  21
    The Hanabi challenge: A new frontier for AI research.Nolan Bard, Jakob N. Foerster, Sarath Chandar, Neil Burch, Marc Lanctot, H. Francis Song, Emilio Parisotto, Vincent Dumoulin, Subhodeep Moitra, Edward Hughes, Iain Dunning, Shibl Mourad, Hugo Larochelle, Marc G. Bellemare & Michael Bowling - 2020 - Artificial Intelligence 280 (C):103216.
  3.  6
    Negotiating team formation using deep reinforcement learning.Yoram Bachrach, Richard Everett, Edward Hughes, Angeliki Lazaridou, Joel Z. Leibo, Marc Lanctot, Michael Johanson, Wojciech M. Czarnecki & Thore Graepel - 2020 - Artificial Intelligence 288 (C):103356.
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  4.  6
    Algorithms for computing strategies in two-player simultaneous move games.Branislav Bošanský, Viliam Lisý, Marc Lanctot, Jiří Čermák & Mark H. M. Winands - 2016 - Artificial Intelligence 237:1-40.
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark