log in  |  register  |  feedback?  |  help  |  web accessibility
Logo
MDP Homomorphic Networks: Group Symmetries in Reinforcement Learning by van der Pol et al. (2020)
Joshua McClellan - UMD
Tuesday, February 15, 2022, 5:00-6:00 pm Calendar
  • You are subscribed to this talk through .
  • You are watching this talk through .
  • You are subscribed to this talk. (unsubscribe, watch)
  • You are watching this talk. (unwatch, subscribe)
  • You are not subscribed to this talk. (watch, subscribe)
Abstract

Paper: MDP Homomorphic Networks: Group Symmetries in Reinforcement Learning by van der Pol et al. (2020)

URL: https://proceedings.neurips.cc/paper/2020/hash/2be5f9c2e3620eb73c2972d7552b6cb5-Abstract.html

Paper Abstract: "This paper introduces MDP homomorphic networks for deep reinforcement learning. MDP homomorphic networks are neural networks that are equivariant under symmetries in the joint state-action space of an MDP. Current approaches to deep reinforcement learning do not usually exploit knowledge about such structure. By building this prior knowledge into policy and value networks using an equivariance constraint, we can reduce the size of the solution space. We specifically focus on group-structured symmetries (invertible transformations). Additionally, we introduce an easy method for constructing equivariant network layers numerically, so the system designer need not solve the constraints by hand, as is typically done. We construct MDP homomorphic MLPs and CNNs that are equivariant under either a group of reflections or rotations. We show that such networks converge faster than unstructured baselines on CartPole, a grid world and Pong."

For more information and our full schedule, see our website (https://leesharma.com/physics-ai-reading-group/)

This talk is organized by Lee Sharma