Information distance in multiples

Authors
Publication date 04-2011
Journal IEEE Transactions on Information Theory
Volume | Issue number 57 | 4
Pages (from-to) 2451-2456
Number of pages 6
Organisations
  • Interfacultary Research - Institute for Logic, Language and Computation (ILLC)
Abstract

Information distance is a parameter-free similarity measure based on compression, used in pattern recognition, data mining, phylogeny, clustering and classification. The notion of information distance is extended from pairs to multiples (finite lists). We study maximal overlap, metricity, universality, minimal overlap, additivity and normalized information distance in multiples. We use the theoretical notion of Kolmogorov complexity which for practical purposes is approximated by the length of the compressed version of the file involved, using a real-world compression program.

Document type Article
Language English
Published at https://doi.org/10.1109/TIT.2011.2110130
Other links https://www.scopus.com/pages/publications/79952852160
Permalink to this page
Back