Explicit Explore, Exploit, or Escape ($$E^4$$): near-optimal safety-constrained reinforcement learning in polynomial time (2022)

First Author: Bossens D

Attributed to: UKRI Trustworthy Autonomous Systems Hub funded by SPF

Abstract

No abstract provided

Bibliographic Information

Digital Object Identifier: http://dx.doi.org/10.1007/s10994-022-06201-z

Publication URI: http://dx.doi.org/10.1007/s10994-022-06201-z

Type: Journal Article/Review

Parent Publication: Machine Learning

Issue: 3