Convergence of Policy Gradient for Entropy Regularized MDPs with Neural Network Approximation in the Mean-Field Regime (2022)

Abstract

No abstract provided

Bibliographic Information

Publication URI: https://proceedings.mlr.press/v162/leahy22a.html

Type: Conference/Paper/Proceeding/Abstract