Natural actor and belief critic Reinforcement algorithm for learning parameters of dialogue systems modelled as POMDPs (2011)

First Author: JurcĂ­cek F

Abstract

No abstract provided

Bibliographic Information

Digital Object Identifier: http://dx.doi.org/10.1145/1966407.1966411

Publication URI: http://dx.doi.org/10.1145/1966407.1966411

Type: Journal Article/Review

Parent Publication: ACM Transactions on Speech and Language Processing

Issue: 3