- www.policygradientbook.org
In Chapter 2, we introduce the idea of policy gradient methods, where the policy is optimized directly through a gradient of some performance function. we also showcase REINFORCE, the simplest policy gradient algorithm...
Relevance: 21.218203