Be Different to Be Better (BD2BB)

Creators
  • R.B. Bernardi
Publication date 12-05-2021
Description BD2BB is a novel language and vision benchmark that requires multimodal models combine complementary information from the two modalities. Recently, impressive progress has been made to develop universal multimodal encoders suitable for virtually any language and vision tasks. However, current approaches often require them to combine redundant information provided by language and vision. Inspired by real-life communicative contexts, we propose a novel task where either modality is necessary but not sufficient to make a correct prediction.
Publisher GitHub
Organisations
  • Interfacultary Research - Institute for Logic, Language and Computation (ILLC)
Document type Dataset
Related publication <i>Be Different to Be Better!</i>
Other links https://sites.google.com/view/bd2bb/home https://github.com/sandropezzelle/bd2bb
Permalink to this page
Back