MDP homomorphic networks: Group symmetries in reinforcement learning

E. Van Der Pol; D. Worrall; H. Van Hoof; F. Oliehoek; M. Welling

MDP homomorphic networks: Group symmetries in reinforcement learning

Authors	E. Van Der Pol D. Worrall H. Van Hoof F. Oliehoek M. Welling
Publication date	2021
Host editors	H. Larochelle M. Ranzato R. Hadsell M.F. Balcan H. Lin
Book title	34th Concerence on Neural Information Processing Systems (NeurIPS 2020)
Book subtitle	online, 6-12 December 2020
ISBN	9781713829546
Series	Advances in Neural Information Processing Systems
Event	Advances in Neural Information Processing Systems 2020
Volume \| Issue number	6
Pages (from-to)	4199-4210
Publisher	San Diego, CA: Neural Information Processing Systems Foundation
Organisations	Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract	This paper introduces MDP homomorphic networks for deep reinforcement learning. MDP homomorphic networks are neural networks that are equivariant under symmetries in the joint state-action space of an MDP. Current approaches to deep reinforcement learning do not usually exploit knowledge about such structure. By building this prior knowledge into policy and value networks using an equivariance constraint, we can reduce the size of the solution space. We specifically focus on group-structured symmetries (invertible transformations). Additionally, we introduce an easy method for constructing equivariant network layers numerically, so the system designer need not solve the constraints by hand, as is typically done. We construct MDP homomorphic MLPs and CNNs that are equivariant under either a group of reflections or rotations. We show that such networks converge faster than unstructured baselines on CartPole, a grid world and Pong.
Document type	Conference contribution
Note	With supplemental file
Language	English
Published at	https://papers.nips.cc/paper/2020/hash/2be5f9c2e3620eb73c2972d7552b6cb5-Abstract.html (Accepted author manuscript)
Other links	https://www.proceedings.com/59066.html
Downloads	NeurIPS-2020-mdp-homomorphic-networks-group-symmetries-in-reinforcement-learning-Paper (Accepted author manuscript)
Supplementary materials	NeurIPS-2020-mdp-homomorphic-networks-group-symmetries-in-reinforcement-learning-Supplemental
Permalink to this page

Back

UvA-DARE

Digital Academic Repository

MDP homomorphic networks: Group symmetries in reinforcement learning