A Minimax Learning Approach to Off-Policy Evaluation in Confounded Partially Observable Markov Decision Processes (2022)

First Author: Shi C

Abstract

No abstract provided

Bibliographic Information

Publication URI: https://proceedings.mlr.press/v162/shi22f/shi22f.pdf

Type: Conference/Paper/Proceeding/Abstract

Volume: 162