The Mirage of Action-Dependent Baselines in Reinforcement Learning (2018)
Attributed to:
Machine Learning for Hearing Aids: Intelligent Processing and Fitting
funded by
EPSRC
Abstract
No abstract provided
Bibliographic Information
Digital Object Identifier: http://dx.doi.org/10.48550/arxiv.1802.10031
Publication URI: https://arxiv.org/abs/1802.10031
Type: Journal Article/Review
Parent Publication: arXiv e-prints