Explicit Explore, Exploit, or Escape ($$E^4$$): near-optimal safety-constrained reinforcement learning in polynomial time (2022)
Attributed to:
UKRI Trustworthy Autonomous Systems Hub
funded by
SPF
Abstract
No abstract provided
Bibliographic Information
Digital Object Identifier: http://dx.doi.org/10.1007/s10994-022-06201-z
Publication URI: http://dx.doi.org/10.1007/s10994-022-06201-z
Type: Journal Article/Review
Parent Publication: Machine Learning
Issue: 3