Convergence of Policy Gradient for Entropy Regularized MDPs with Neural Network Approximation in the Mean-Field Regime (2022)

First Author: Leahy J

Attributed to: FAIR: Framework for responsible adoption of Artificial Intelligence in the financial seRvices industry funded by EPSRC

No abstract provided

Type: Conference/Paper/Proceeding/Abstract