ON THE IDENTIFICATION AND MITIGATION OF WEAKNESSES IN THE KNOWLEDGE GRADIENT POLICY FOR MULTI-ARMED BANDITS (2016)
Attributed to:
Lancaster University - Equipment Account
funded by
EPSRC
Abstract
No abstract provided
Bibliographic Information
Digital Object Identifier: http://dx.doi.org/10.1017/s0269964816000279
Publication URI: http://dx.doi.org/10.1017/s0269964816000279
Type: Journal Article/Review
Parent Publication: Probability in the Engineering and Informational Sciences
Issue: 2