Large Scale Metric Learning for Distance-Based Image Classification on Open Ended Data Sets

Open Access
Authors
Publication date 2013
Host editors
  • G.M. Farinella
  • S. Battiato
  • R, Cipolla
Book title Advanced topics in computer vision
ISBN
  • 9781447155195
Series Advances in computer vision and pattern recognition
Pages (from-to) 243-276
Publisher London: Springer
Organisations
  • Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract
Many real-life large-scale datasets are open-ended and dynamic: new images are continuously added to existing classes, new classes appear over time, and the semantics of existing classes might evolve too. Therefore, we study large-scale image classification methods that can incorporate new classes and training images continuously over time at negligible cost. To this end, we consider two distance-based classifiers, the k-nearest neighbor (k-NN) and nearest class mean (NCM) classifiers. Since the performance of distance-based classifiers heavily depends on the used distance function, we cast the problem into one of learning a low-rank metric, which is shared across all classes. For the NCM classifier, we introduce a new metric learning approach, and we also introduce an extension to allow for richer class representations.

Experiments on the ImageNet 2010 challenge dataset, which contains over one million training images of thousand classes, show that, surprisingly, the NCM classifier compares favorably to the more flexible k-NN classifier. Moreover, the NCM performance is comparable to that of linear SVMs which obtain current state-of-the-art performance. Experimentally we study the generalization performance to classes that were not used to learn the metrics. Using a metric learned on 1,000 classes, we show results for the ImageNet-10K dataset which contains 10,000 classes, and obtain performance that is competitive with the current state-of-the-art, while being orders of magnitude faster.
Document type Chapter
Language English
Published at https://doi.org/10.1007/978-1-4471-5520-1_9
Downloads
MensinkATCV2013 (Submitted manuscript)
Permalink to this page
Back