Q-Prop: Sample-Efficient Policy Gradient with An Off-Policy Critic (2017)
Attributed to:
Unifying audio signal processing and machine learning: a fundamental framework for machine hearing
funded by
EPSRC
Abstract
No abstract provided
Bibliographic Information
Publication URI: https://openreview.net/group?id=ICLR.cc/2017/conference
Type: Conference/Paper/Proceeding/Abstract