A Minimax Learning Approach to Off-Policy Evaluation in Confounded Partially Observable Markov Decision Processes (2022)
Attributed to:
Statistical Methods in Offline Reinforcement Learning
funded by
EPSRC
Abstract
No abstract provided
Bibliographic Information
Publication URI: https://proceedings.mlr.press/v162/shi22f/shi22f.pdf
Type: Conference/Paper/Proceeding/Abstract
Volume: 162