Q-PROP: SAMPLE-EFFICIENT POLICY GRADIENT WITH AN OFF-POLICY CRITIC (2017)

First Author: G, X

Abstract

No abstract provided

Bibliographic Information

Publication URI: https://openreview.net/forum?id=SJ3rcZcxl

Type: Conference/Paper/Proceeding/Abstract