Meet Charles, big data query advisor

Open Access
Authors
Publication date 2013
Book title 6th Biennial Conference on Innovative Data Systems Research: CIDR 2013: January 6-9, 2013, Asilomar, California
Event CIDR 2013
Pages (from-to) 94
Publisher CIDRB
Organisations
  • Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract
In scientific data management and business analytics, the most informative queries are a holy grail. Data collection becomes increasingly simpler, yet data exploration gets significantly harder. Exploratory querying is likely to return an empty or an overwhelming result set. On the other hand, data mining algorithms require extensive preparation, ample time and do not scale well.
In this paper, we address this challenge at its core, i.e., how to query the query space associated with a given database. The space considered is formed by conjunctive predicates. To express them, we introduce the Segmentation Description Language (SDL). The user provides a query. Charles, our query advisory system, breaks its extent into meaningful segments and returns the subsequent SDL descriptions. This provides insight into the set described and offers the user directions for further exploration.
We introduce a novel algorithm to generate SDL answers. We evaluate them using four orthogonal criteria: homogeneity, simplicity, breadth, and entropy. A prototype implementation has been constructed and the landscape of follow-up research is sketched.
Document type Conference contribution
Language English
Published at http://www.cidrdb.org/cidr2013/Papers/CIDR13_Paper94.pdf
Downloads
CIDR13_Paper94 (Final published version)
Permalink to this page
Back