Relative Representations Topological and Geometric Perspectives

Open Access
Authors
Publication date 2024
Journal Proceedings of Machine Learning Research
Event 2nd Edition of the Workshop on Unifying Representations in Neural Models, UniReps 2024
Volume | Issue number 285
Pages (from-to) 219-231
Number of pages 12
Organisations
  • Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract

Relative representations are an established approach to zero-shot model stitching, consisting of a non-trainable transformation of the latent space of a deep neural network. Based on insights of topological and geometric nature, we propose two improvements to relative representations. First, we introduce a normalization procedure in the relative transformation, resulting in invariance to non-isotropic rescalings and permutations. The latter coincides with the symmetries in parameter space induced by common activation functions. Second, we propose to deploy topological densification when fine-tuning relative representations, a topological regularization loss encouraging clustering within classes. We provide an empirical investigation on a natural language task, where both the proposed variations yield improved performance on zero-shot model stitching.

Document type Article
Note Proceedings of UniReps: the Second Edition of the Workshop on Unifying Representations in Neural Models, 14 December 2024, Vancouver Convention Center, Vancouver, Canada. - Revised version of paper available at ArXiv.
Language English
Published at https://doi.org/10.48550/arXiv.2409.10967
Published at https://proceedings.mlr.press/v285/garcia-castellanos24a.html
Other links https://www.scopus.com/pages/publications/105014754343
Downloads
garcia-castellanos24a-1 (Final published version)
2409.10967v3 (Other version)
Permalink to this page
Back