Imitating play from game trajectories: Temporal difference learning versus preference learning (2012)

First Author: Runarsson T
Attributed to:  UCT for Games and Beyond funded by EPSRC

Abstract

No abstract provided

Bibliographic Information

Digital Object Identifier: http://dx.doi.org/10.1109/cig.2012.6374141

Publication URI: http://dx.doi.org/10.1109/cig.2012.6374141

Type: Conference/Paper/Proceeding/Abstract