Linear-Quadratic Mean-Field Reinforcement Learning: Convergence of Policy Gradient Methods René Carmona , Mathieu Laurière , Zongjun Tan Sep 1, 2019 PDF Cite