Data Placement Strategy for MapReduce Cluster Environment

doi:10.13328/j.cnki.jos.004807

微信服务号

微信订阅号

2025-5-13- 4

Home > Archive>Volume 26, Issue 8, 2015 >2056-2073. DOI:10.13328/j.cnki.jos.004807

PDF HTML XML Export Cite reminder

Data Placement Strategy for MapReduce Cluster Environment
DOI:
                        10.13328/j.cnki.jos.004807
                    
Author:
                        XUN Ya-LingXUN Ya-Ling
School of Computer Science and Technology, Taiyuan University of Science and Technology, Taiyuan 030024, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
ZHANG Ji-FuZHANG Ji-Fu
School of Computer Science and Technology, Taiyuan University of Science and Technology, Taiyuan 030024, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
QIN XiaoQIN Xiao
Department of Computer Science and Software Engineering, Auburn University, USA)
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site

                    
Affiliation:
Clc Number:
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

As an effective programming model for large-scale data-intensive applications, MapReduce has been widely and successfully applied in the field of parallel and distributed computing, and has the characteristics of good fault-tolerance and easy to implement and extend. Because MapReduce extends computing to the nodes of large-scale cluster system, reasonable placement of processing data has become one of the key factors affecting the performance of MapReduce cluster system, including energy efficiency, resource utilization, communications and I/O throughput, response time, and reliability. This study first analyzes characteristics of the default data placement strategy of Hadoop, which is a typical implementation of MapReduce programming model. Next, it investigates popular data placement strategies for MapReduce cluster computing environments. Finally, it presents future research directions in the area of data placement strategies for MapReduce-based cluster computing systems.

Key words:data placement;MapReduce;programming model;energy-efficient;load balancing

Get Citation

荀亚玲,张继福,秦啸. MapReduce集群环境下的数据放置策略.软件学报,2015,26(8):2056-2073

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:April 08,2014
Revised:December 21,2014
Adopted:
Online: August 05,2015
Published:

You are the firstVisitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-4
Address：4# South Fourth Street, Zhong Guan Cun, Beijing 100190,Postal Code：100190
Phone：010-62562563 Fax：010-62562533 Email：jos@iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063

微信服务号

微信订阅号

Get Citation

Share

微信扫一扫：分享

Article Metrics

History