Just-In-Time Data Distribution for Analytical Query Processing

Authors
Publication date 2012
Host editors
  • T. Morzy
  • T. Härder
  • R. Wrembel
Book title Advances in Databases and Information Systems
Book subtitle 16th East European Conference, ADBIS 2012, Poznań, Poland, September 18-21, 2012: proceedings
ISBN
  • 9783642330735
ISBN (electronic)
  • 9783642330742
Series Lecture Notes in Computer Science
Event 16th Advances in Databases and Information Systems
Pages (from-to) 209-222
Publisher Heidelberg: Springer
Organisations
  • Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract
Distributed processing commonly requires data spread across machines using a priori static or hash-based data allocation. In this paper, we explore an alternative approach that starts from a master node in control of the complete database, and a variable number of worker nodes for delegated query processing. Data is shipped just-in-time to the worker nodes using a need to know policy, and is being reused, if possible, in subsequent queries. A bidding mechanism among the workers yields a scheduling with the most efficient reuse of previously shipped data, minimizing the data transfer costs.
Just-in-time data shipment allows our system to benefit from locally available idle resources to boost overall performance. The system is maintenance-free and allocation is fully transparent to users. Our experiments show that the proposed adaptive distributed architecture is a viable and flexible alternative for small scale MapReduce-type of settings.
Document type Conference contribution
Language English
Published at https://doi.org/10.1007/978-3-642-33074-2_16
Permalink to this page
Back