VL-KGE: Vision-Language Models Meet Knowledge Graph Embeddings

Open Access
Authors
Publication date 2026
Book title WWW '26
Book subtitle Proceedings of the ACM Web Conference 2026 : April 13-17, 2026, Dubai, United Arab Emirates
ISBN (electronic)
  • 9798400723070
Event 35th ACM Web Conference, WWW 2026
Pages (from-to) 7552-7563
Number of pages 12
Publisher New York, NY: Association for Computing Machinery
Organisations
  • Faculty of Economics and Business (FEB) - Amsterdam Business School Research Institute (ABS-RI)
  • Faculty of Science (FNWI) - Informatics Institute (IVI)
  • Faculty of Economics and Business (FEB)
Abstract

Real-world multimodal knowledge graphs (MKGs) are inherently heterogeneous, modeling entities that are associated with diverse modalities. Traditional knowledge graph embedding (KGE) methods excel at learning continuous representations of entities and relations, yet they are typically designed for unimodal settings. Recent approaches extend KGE to multimodal settings but remain constrained, often processing modalities in isolation, resulting in weak cross-modal alignment, and relying on simplistic assumptions such as uniform modality availability across entities. Vision - Language Models (VLMs) offer a powerful way to align diverse modalities within a shared embedding space. We propose Vision - Language Knowledge Graph Embeddings (VL-KGE), a framework that integrates cross-modal alignment from VLMs with structured relational modeling to learn unified multimodal representations of knowledge graphs. Experiments on WN9-IMG and two novel fine art MKGs, WikiArt-MKG-v1 and WikiArt-MKG-v2, demonstrate that VL-KGE consistently improves over traditional unimodal and multimodal KGE methods in link prediction tasks. Our results highlight the value of VLMs for multimodal KGE, enabling more robust and structured reasoning over large-scale heterogeneous knowledge graphs.

Document type Conference contribution
Language English
Published at https://doi.org/10.1145/3774904.3792677
Published at https://dl.acm.org/doi/10.1145/3774904.3792677
Other links https://www.scopus.com/pages/publications/105038592950
Downloads
3774904.3792677 (Final published version)
Permalink to this page
Back