Interpolated Policy Gradient: Merging On-Policy and Off-Policy Gradient Estimation for Deep Reinforcement Learning (2017)

First Author: Gu S

Abstract

No abstract provided

Bibliographic Information

Publication URI: http://arxiv.org/abs/1706.00387v1

Type: Working Paper

Volume: arXiv1706.00387v1