Learning rewards from exploratory demonstrations using probabilistic temporal ranking (2023)
Attributed to:
UKRI Trustworthy Autonomous Systems Node in Governance and Regulation
funded by
SPF
Abstract
No abstract provided
Bibliographic Information
Digital Object Identifier: http://dx.doi.org/10.1007/s10514-023-10120-w
Publication URI: http://dx.doi.org/10.1007/s10514-023-10120-w
Type: Journal Article/Review
Parent Publication: Autonomous Robots
Issue: 6