Modelling Visual Properties and Visual Context in Multimodal Semantics

Authors	C. Davis L. Bulat A. Vero E. Shutova
Publication date	12-2018
Book title	Visually Grounded Interaction and Language (ViGIL)
Book subtitle	NeurIPS 2018 Workshop, Montreal, Canada. Accepted Papers
Event	Visually Grounded Interaction and Language (ViGIL)
Article number	1
Number of pages	6
Publisher	NIPS
Organisations	Interfacultary Research - Institute for Logic, Language and Computation (ILLC)
Abstract	Multimodal semantic models that extend linguistic representations with additional perceptual input have proved successful in a range of natural language processing (NLP) tasks. However, existing research has extracted visual features from complete images, and has not examined how different kinds of visual information impact performance. We construct multimodal models that differentiate between internal visual properties of the objects and their external visual context. We evaluate the models on the task of decoding brain activity associated with the meanings of nouns, demonstrating their advantage over those based on complete images.
Document type	Conference contribution
Language	English
Published at	https://nips2018vigil.github.io/static/papers/accepted/1.pdf (Final published version)
Downloads	1-12 (Final published version)
Permalink to this page

Back

UvA-DARE