Structural Properties as Proxy for Semantic Relevance in RDF Graph Sampling

L. Rietveld; R. Hoekstra; S. Schlobach; C. Guéret

doi:https://doi.org/10.1007/978-3-319-11915-1_6

Structural Properties as Proxy for Semantic Relevance in RDF Graph Sampling

Authors	L. Rietveld R. Hoekstra S. Schlobach C. Guéret
Publication date	2014
Host editors	P. Mika T. Tudorache A. Bernstein C. Welty C. Knoblock D. Vrandečić P. Groth N. Noy K. Janowicz C. Goble
Book title	The Semantic Web – ISWC 2014
Book subtitle	13th International Semantic Web Conference, Riva del Garda, Italy, October 19-23, 2014: proceedings
ISBN	9783319119144
ISBN (electronic)	9783319119151
Series	Lecture Notes in Computer Science
Event	International Semantic Web Conference (ISWC 2014)
Volume \| Issue number	2
Pages (from-to)	81-96
Publisher	Cham: Springer
Organisations	Faculty of Science (FNWI) - Informatics Institute (IVI) Faculty of Law (FdR) - Leibniz Center for Law (FdR)
Abstract	The Linked Data cloud has grown to become the largest knowledge base ever constructed. Its size is now turning into a major bottleneck for many applications. In order to facilitate access to this structured information, this paper proposes an automatic sampling method targeted at maximizing answer coverage for applications using SPARQL querying. The approach presented in this paper is novel: no similar RDF sampling approach exist. Additionally, the concept of creating a sample aimed at maximizing SPARQL answer coverage, is unique. We empirically show that the relevance of triples for sampling (a semantic notion) is influenced by the topology of the graph (purely structural), and can be determined without prior knowledge of the queries. Experiments show a significantly higher recall of topology based sampling methods over random and naive baseline approaches (e.g. up to 90% for Open-BioMed at a sample size of 6%).
Document type	Conference contribution
Language	English
Published at	https://doi.org/10.1007/978-3-319-11915-1_6 (Final published version)
Permalink to this page

Back

UvA-DARE

Digital Academic Repository

Structural Properties as Proxy for Semantic Relevance in RDF Graph Sampling