MDP homomorphic networks: Group symmetries in reinforcement learning

Open Access
Authors
Publication date 2021
Host editors
  • H. Larochelle
  • M. Ranzato
  • R. Hadsell
  • M.F. Balcan
  • H. Lin
Book title 34th Concerence on Neural Information Processing Systems (NeurIPS 2020)
Book subtitle online, 6-12 December 2020
ISBN
  • 9781713829546
Series Advances in Neural Information Processing Systems
Event Advances in Neural Information Processing Systems 2020
Volume | Issue number 6
Pages (from-to) 4199-4210
Publisher San Diego, CA: Neural Information Processing Systems Foundation
Organisations
  • Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract
This paper introduces MDP homomorphic networks for deep reinforcement learning. MDP homomorphic networks are neural networks that are equivariant under symmetries in the joint state-action space of an MDP. Current approaches to deep reinforcement learning do not usually exploit knowledge about such structure. By building this prior knowledge into policy and value networks using an equivariance constraint, we can reduce the size of the solution space. We specifically focus on group-structured symmetries (invertible transformations). Additionally, we introduce an easy method for constructing equivariant network layers numerically, so the system designer need not solve the constraints by hand, as is typically done. We construct MDP homomorphic MLPs and CNNs that are equivariant under either a group of reflections or rotations. We show that such networks converge faster than unstructured baselines on CartPole, a grid world and Pong.
Document type Conference contribution
Note With supplemental file
Language English
Published at https://papers.nips.cc/paper/2020/hash/2be5f9c2e3620eb73c2972d7552b6cb5-Abstract.html
Other links https://www.proceedings.com/59066.html
Downloads
Supplementary materials
Permalink to this page
Back