On-line Active Reward Learning for Policy Optimisation in Spoken Dialogue Systems (2016)
Attributed to:
Open Domain Statistical Spoken Dialogue Systems
funded by
EPSRC
Abstract
No abstract provided
Bibliographic Information
Digital Object Identifier: http://dx.doi.org/10.48550/arxiv.1605.07669
Publication URI: http://dx.doi.org/10.48550/arxiv.1605.07669
Type: Journal Article/Review
Parent Publication: arXiv e-prints