Visual Word Ambiguity

J.C. van Gemert; C.J. Veenman; A.W.M. Smeulders; J.M. Geusebroek

doi:https://doi.org/10.1109/TPAMI.2009.132

Visual Word Ambiguity

Authors	J.C. van Gemert C.J. Veenman A.W.M. Smeulders J.M. Geusebroek
Publication date	2010
Journal	IEEE Transactions on Pattern Analysis and Machine Intelligence
Volume \| Issue number	32 \| 7
Pages (from-to)	1271-1283
Organisations	Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract	This paper studies automatic image classification by modeling soft assignment in the popular codebook model. The codebook model describes an image as a bag of discrete visual words selected from a vocabulary, where the frequency distributions of visual words in an image allow classification. One inherent component of the codebook model is the assignment of discrete visual words to continuous image features. Despite the clear mismatch of this hard assignment with the nature of continuous features, the approach has been successfully applied for some years. In this paper, we investigate four types of soft assignment of visual words to image features. We demonstrate that explicitly modeling visual word assignment ambiguity improves classification performance compared to the hard assignment of the traditional codebook model. The traditional codebook model is compared against our method for five well-known data sets: 15 natural scenes, Caltech-101, Caltech-256, and Pascal VOC 2007/2008. We demonstrate that large codebook vocabulary sizes completely deteriorate the performance of the traditional model, whereas the proposed model performs consistently. Moreover, we show that our method profits in high-dimensional feature spaces and reaps higher benefits when increasing the number of image categories.
Document type	Article
Language	English
Published at	https://doi.org/10.1109/TPAMI.2009.132 (Final published version)
Downloads	vanGemertTPAMI2010 (Submitted manuscript)
Permalink to this page

Back

UvA-DARE

Digital Academic Repository

Visual Word Ambiguity