Just-In-Time Data Distribution for Analytical Query Processing

M. Ivanova; M. Kersten; F. Groffen

doi:https://doi.org/10.1007/978-3-642-33074-2_16

Just-In-Time Data Distribution for Analytical Query Processing

Authors	M. Ivanova M. Kersten F. Groffen
Publication date	2012
Host editors	T. Morzy T. Härder R. Wrembel
Book title	Advances in Databases and Information Systems
Book subtitle	16th East European Conference, ADBIS 2012, Poznań, Poland, September 18-21, 2012: proceedings
ISBN	9783642330735
ISBN (electronic)	9783642330742
Series	Lecture Notes in Computer Science
Event	16th Advances in Databases and Information Systems
Pages (from-to)	209-222
Publisher	Heidelberg: Springer
Organisations	Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract	Distributed processing commonly requires data spread across machines using a priori static or hash-based data allocation. In this paper, we explore an alternative approach that starts from a master node in control of the complete database, and a variable number of worker nodes for delegated query processing. Data is shipped just-in-time to the worker nodes using a need to know policy, and is being reused, if possible, in subsequent queries. A bidding mechanism among the workers yields a scheduling with the most efficient reuse of previously shipped data, minimizing the data transfer costs. Just-in-time data shipment allows our system to benefit from locally available idle resources to boost overall performance. The system is maintenance-free and allocation is fully transparent to users. Our experiments show that the proposed adaptive distributed architecture is a viable and flexible alternative for small scale MapReduce-type of settings.
Document type	Conference contribution
Language	English
Published at	https://doi.org/10.1007/978-3-642-33074-2_16 (Final published version)
Permalink to this page

Back

UvA-DARE

Digital Academic Repository

Just-In-Time Data Distribution for Analytical Query Processing