Learning binary trees by argmin differentiation

V. Zantedeschi; M.J. Kusner; V. Niculae

Learning binary trees by argmin differentiation

Authors	V. Zantedeschi M.J. Kusner V. Niculae
Publication date	2021
Journal	Proceedings of Machine Learning Research
Event	38th International Conference on Machine Learning
Volume \| Issue number	139
Pages (from-to)	12298-12309
Organisations	Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract	We address the problem of learning binary decision trees that partition data for some downstream task. We propose to learn discrete parameters (i.e., for tree traversals and node pruning) and continuous parameters (i.e., for tree split functions and prediction functions) simultaneously using argmin differentiation. We do so by sparsely relaxing a mixed-integer program for the discrete parameters, to allow gradients to pass through the program to continuous parameters. We derive customized algorithms to efficiently compute the forward and backward passes. This means that our tree learning procedure can be used as an (implicit) layer in arbitrary deep networks, and can be optimized with arbitrary loss functions. We demonstrate that our approach produces binary trees that are competitive with existing single tree and ensemble approaches, in both supervised and unsupervised settings. Further, apart from greedy approaches (which do not have competitive accuracies), our method is faster to train than all other tree-learning baselines we compare with.
Document type	Article
Note	International Conference on Machine Learning, 18-24 July 2021, Virtual. - With supplementary file.
Language	English
Published at	https://proceedings.mlr.press/v139/zantedeschi21a.html
Downloads	zantedeschi21a (Final published version)
Supplementary materials	zantedeschi21a-supp
Permalink to this page

Back

UvA-DARE

Digital Academic Repository

Learning binary trees by argmin differentiation