Interpolated Policy Gradient: Merging On-Policy and Off-Policy Gradient Estimation for Deep Reinforcement Learning (2017)

First Author: Gu Shixiang

Attributed to: Autonomous behaviour and learning in an uncertain world funded by EPSRC

No abstract provided

Type: Journal Article/Review

Parent Publication: arXiv e-prints