On-line Active Reward Learning for Policy Optimisation in Spoken Dialogue Systems (2016)

First Author: Su Pei-Hao
Attributed to:  Open Domain Statistical Spoken Dialogue Systems funded by EPSRC

Abstract

No abstract provided

Bibliographic Information

Digital Object Identifier: http://dx.doi.org/10.48550/arxiv.1605.07669

Publication URI: http://dx.doi.org/10.48550/arxiv.1605.07669

Type: Journal Article/Review

Parent Publication: arXiv e-prints