Convergence of Policy Gradient for Entropy Regularized MDPs with Neural Network Approximation in the Mean-Field Regime (2022)
Attributed to:
FAIR: Framework for responsible adoption of Artificial Intelligence in the financial seRvices industry
funded by
EPSRC
Abstract
No abstract provided
Bibliographic Information
Publication URI: https://proceedings.mlr.press/v162/leahy22a.html
Type: Conference/Paper/Proceeding/Abstract