ON THE IDENTIFICATION AND MITIGATION OF WEAKNESSES IN THE KNOWLEDGE GRADIENT POLICY FOR MULTI-ARMED BANDITS (2016)

First Author: Edwards J
Attributed to:  Lancaster University - Equipment Account funded by EPSRC

Abstract

No abstract provided

Bibliographic Information

Digital Object Identifier: http://dx.doi.org/10.1017/s0269964816000279

Publication URI: http://dx.doi.org/10.1017/s0269964816000279

Type: Journal Article/Review

Parent Publication: Probability in the Engineering and Informational Sciences

Issue: 2