Data declustering methods of parallel databases have significant effect on the performance of the parallel algorithms on the databases. If the characteristics of the declustering methods of the parallel database can be adequately exploited in designing parallel algorithms for implementing the database operations in parallel database systems, high performance parallel algorithms could be resulted. Unfortunately, the advantages of the underlying declustering methods of the parallel databases are rarely considered in existent parallel algorithms on the parallel databases. The focus of this paper is on how to utilize the advantages of the underlying declustering methods of the databases in designing parallel join algorithms. A set of parallel join algorithms based on the CMD declustering method are proposed. Theoretical and experimental results show that the proposed algorithms are more efficient than other parallel join algorithms.
1 李建中.并行数据库的查询并行化技术与物理设计方法.软件学报,1994,5(10):1~10
(Li Jian-zhong. Parallelization techniques for query processing and data declustering methods. Journal of Softwate, 1994,5(10):1~10)
2 李建中.并行数据操作算法和查询优化技术.软件学报,1994,5(10):11~23
(Li Jian-zhong. Parallel data operation algorithms and query optimization techniques. Journal of Software, 1994,5(10):11~23)
3 BoralH et al. Join on a cube: analysis, simulation and implementation. In: Kitsuregawa M, Tanaka H eds. Database Machines and Knowledge Base Machines. Boston: Kluwer, 1988. 61~74
4 Schneider D A, DeWitt D J. A performance evaluation of parallel in algorithms in a shared-nothing multiprocessor environment. In: Maier D ed. Proceedings of the ACM SIGMOD'89, USA, 1989. Baltimore: ACM Press, 1989. 110~121
5 Kitsuregawa M, Tanaka H, Motooka T. Application of hash to data base machine and its architecture. New Generation Computing, 1983,1(1):25~39
6 Omiecinski E R, Lin E T. Hash-based and index-based join algorithms for cube and ring connected multicomputers. IEEE Transactions on Knowledge and Data Engineering, 1989,1(3):329~343
7 Kitsuregawa M, Nakayama M, Takagi M. The effect of bucket size tuning in the dynamic hybrid GRACE hash join method. In: Proceedings of the International Conference on Very Large Data Bases'89. San Mateo: Morgan Kaufmann Publishers, Inc., 1989. 257~266
8 Kitsuregawa M, Ogawa Y. Bucket spreading parallel hash: a new bobust, parallel hash join method for data skew in the super database computer (SDC). In: Proceedings of the International Conference on Very Large Data Bases'90. Palo Alto: Morgan Kaufmann Publishers, Inc., 1990. 210~221
9 Hua K A, Lee C. Handling data skew in multiprocessor database computer systems using partition tuning. In: Proceedings of the of the International Conference on Very Large Data Bases'91, Barcelona, 1991. San mateo: Morgan Kaufmann publishers, Inc., 1991. 525~536
10 Li Jian-zhong, Jaideep Srivastava, Doron Rotem. CMD: a multidimensional declustering method for parallel database systems. In: Proceedings of the 18th International Conference on Very Large Data Bases Conference. Canada, 1992
11 Li Jian-zhong. Range query procession in multi-disk systems. Journal of Computer Science and Technology, 1992,7(4):316~327.
12 Stonebraker M. The case for shared nothing. Database Engineering, 1986,9(1):17~24