Off-Policy Confidence Interval Estimation with Confounded Markov Decision Process (2022)
Attributed to:
Statistical Methods in Offline Reinforcement Learning
funded by
EPSRC
Abstract
No abstract provided
Bibliographic Information
Digital Object Identifier: http://dx.doi.org/10.1080/01621459.2022.2110878
Publication URI: http://dx.doi.org/10.1080/01621459.2022.2110878
Type: Journal Article/Review
Parent Publication: Journal of the American Statistical Association
Issue: 545