On-line Active Reward Learning for Policy Optimisation in Spoken Dialogue Systems (2016)

First Author: Su Pei-Hao

Attributed to: Open Domain Statistical Spoken Dialogue Systems funded by EPSRC

No abstract provided

Type: Journal Article/Review

Parent Publication: arXiv e-prints