📣 Help Shape the Future of UKRI's Gateway to Research (GtR)

We're improving UKRI's Gateway to Research and are seeking your input! If you would be interested in being interviewed about the improvements we're making and to have your say about how we can make GtR more user-friendly, impactful, and effective for the Research and Innovation community, please email gateway@ukri.org.

A Minimax Learning Approach to Off-Policy Evaluation in Confounded Partially Observable Markov Decision Processes (2022)

First Author: Shi C

Attributed to: Statistical Methods in Offline Reinforcement Learning funded by EPSRC

Abstract

No abstract provided

Bibliographic Information

Publication URI: https://proceedings.mlr.press/v162/shi22f/shi22f.pdf

Type: Conference/Paper/Proceeding/Abstract

Volume: 162