Interpolated Policy Gradient: Merging On-Policy and Off-Policy Gradient Estimation for Deep Reinforcement Learning (2017)

First Author: Gu Shixiang

Abstract

No abstract provided

Bibliographic Information

Publication URI: http://arxiv.org/abs/1706.00387v1

Type: Journal Article/Review

Parent Publication: arXiv e-prints