Imitating play from game trajectories: Temporal difference learning versus preference learning (2012)
Attributed to:
UCT for Games and Beyond
funded by
EPSRC
Abstract
No abstract provided
Bibliographic Information
Digital Object Identifier: http://dx.doi.org/10.1109/cig.2012.6374141
Publication URI: http://dx.doi.org/10.1109/cig.2012.6374141
Type: Conference/Paper/Proceeding/Abstract