The Mirage of Action-Dependent Baselines in Reinforcement Learning (2018)

First Author: Tucker George

Attributed to: Machine Learning for Hearing Aids: Intelligent Processing and Fitting funded by EPSRC

No abstract provided

Type: Journal Article/Review

Parent Publication: arXiv e-prints