The Mirage of Action-Dependent Baselines in Reinforcement Learning (2018)

First Author: Tucker George

Abstract

No abstract provided

Bibliographic Information

Digital Object Identifier: http://dx.doi.org/10.48550/arxiv.1802.10031

Publication URI: https://arxiv.org/abs/1802.10031

Type: Journal Article/Review

Parent Publication: arXiv e-prints