📣 Help Shape the Future of UKRI's Gateway to Research (GtR)

We're improving UKRI's Gateway to Research and are seeking your input! If you would be interested in being interviewed about the improvements we're making and to have your say about how we can make GtR more user-friendly, impactful, and effective for the Research and Innovation community, please email gateway@ukri.org.

Q-Prop: Sample-Efficient Policy Gradient with An Off-Policy Critic (2017)

First Author: Shixiang Gu

Attributed to: Machine Learning for Hearing Aids: Intelligent Processing and Fitting funded by EPSRC

Abstract

No abstract provided

Bibliographic Information

Publication URI: https://openreview.net/group?id=ICLR.cc/2017/conference

Type: Conference/Paper/Proceeding/Abstract