Just-In-Time Data Distribution for Analytical Query Processing
| Authors |
|
|---|---|
| Publication date | 2012 |
| Host editors |
|
| Book title | Advances in Databases and Information Systems |
| Book subtitle | 16th East European Conference, ADBIS 2012, Poznań, Poland, September 18-21, 2012: proceedings |
| ISBN |
|
| ISBN (electronic) |
|
| Series | Lecture Notes in Computer Science |
| Event | 16th Advances in Databases and Information Systems |
| Pages (from-to) | 209-222 |
| Publisher | Heidelberg: Springer |
| Organisations |
|
| Abstract |
Distributed processing commonly requires data spread across machines using a priori static or hash-based data allocation. In this paper, we explore an alternative approach that starts from a master node in control of the complete database, and a variable number of worker nodes for delegated query processing. Data is shipped just-in-time to the worker nodes using a need to know policy, and is being reused, if possible, in subsequent queries. A bidding mechanism among the workers yields a scheduling with the most efficient reuse of previously shipped data, minimizing the data transfer costs.
Just-in-time data shipment allows our system to benefit from locally available idle resources to boost overall performance. The system is maintenance-free and allocation is fully transparent to users. Our experiments show that the proposed adaptive distributed architecture is a viable and flexible alternative for small scale MapReduce-type of settings. |
| Document type | Conference contribution |
| Language | English |
| Published at | https://doi.org/10.1007/978-3-642-33074-2_16 |
| Permalink to this page | |