P2P持久存储研究
作者:
基金项目:

Supported by the National Natural Science Foundation of China under Grant No.90412008 (国家自然科学基金); the National Basic Research Program of China under Grant No.2004CB318204 (国家重点基础研究发展计划(973)); the Research Fund for the Doctoral Program of Higher Education of China under Grant No.20060001044 (国家教育部博士点基金)


Study on Durable Peer-to-Peer Storage Techniques
  • 摘要
  • | |
  • 访问统计
  • |
  • 参考文献 [56]
  • |
  • 相似文献 [20]
  • |
  • 引证文献
  • | |
  • 文章评论
    摘要:

    P2P(peer-to-peer)的组织模式已经成为新一代互联网应用的重要形式,它为应用带来了更好的扩展性、容错性和高性能.P2P存储系统一直是研究界所关注的热点,被认为是P2P最具前途的应用之一.数据的持久存储是制约P2P存储系统发展的关键问题,也是其研究的难点.综述了P2P存储系统及数据持久存储相关技术的研究现状.首先概述了P2P存储系统的基本技术组成及其在不同应用环境中的优势,并介绍了数据冗余、数据分发、错误检测和冗余数据维护等多种持久存储的基本技术.在一个P2P存储系统研究框架下,介绍了目前知名的P2P存储系统及其使用的持久存储技术.对各种技术进行了详细综述和对比讨论,分析了各种技术的适应环境及优劣,指出了存在的问题和未来研究的方向.

    Abstract:

    Peer-to-Peer (P2P) has been one of the most important architectures for Internet applications for its inherent scalability, fault tolerance, and high performance. The research of P2P storage systems is one of the hottest issues, and P2P storage system is regarded as one of the most promising P2P applications. However, to provide durable data storage is not trivial work and sets great barrier to real deployed systems. This paper surveys the P2P storage systems and techniques for durable storage. This paper first introduces the basic components of a durable P2P storage system and the advantages by using P2P architecture, and then introduces a series of techniques for durable storage including data redundancy, data placement, failure detection and data maintenance. Following the research framework, some typical P2P storage systems and their techniques are introduced. By a detailed comparison, the pros and cons of the techniques are discussed. In the end, the problems in current research and some future research issues are outlined.

    参考文献
    [1]Zhang Z,Lin S,Lian Q,Jin C.RepStore:A self-managing and self-tuning storage backend with smart bricks.In:Proc.of the Int'l Conf.on Autonomic Computing.2004.122-129.http://ieeexplore.ieee.org/xpl/freeabs_all.jsp-arnumber=1301355&fromcon
    [2]Stoica I,Morris R,Karger D,Kaashoek M,Balakrishnan H.Chord:A scalable peer-to-peer lookup service for internet applications.Proc.of the 2001 SIGCOMM Conf.,2001,31(4):149-160.
    [3]Zhao B,Kubiatowicz J,Joseph A.Tapestry:An infrastructure for fault-tolerant wide-area location and routing.Technical Report,UCB//CSD-01-1141,Berkeley Computer Science Division,University of California,2001.
    [4]Ratnasamy S,Francis P,Handley M,Karp R,Schenker S.A scalable content-addressable network.In:Proc.of the ACM SIGCOMM Symp.on Communication,Architecture,and Protocols.ACM SIGCOMM,2001.161-172.http://www.acm.org/sigs/ sigcomm/sigcomm/sigcomm2001/p13-ratnasamy.pdf
    [5]Rowstron A,Druschel P.Pastry:Scalable,distributed object location and routing for large-scale peer-to-peer systems.In:Proc.of the IFIP/ACM Int'l Conf.on Distributed Systems Platforms (Middleware).2001.329-350.http://citeseer.ist.psu.edu/ rowstron01pastry.html
    [6]Maymounkov P,Mazieres D.Kademlia:A peer-to-peer information system based on the XOR metric.In:Proc.of the 1st Int'l Workshop on Peer-to-Peer Systems.2002.258-263.http://citeseer.ist.psu.edu/maymounkov02kademlia.html
    [7]Schlosser M,Sintek M,Decker S,Nejdl W.HyperCuP-Hypercubes,ontologies and efficient search on P2P networks.In:Proc.of the Int'l Workshop on Agents and Peer-to-Peer Computing.2002.112-124.http://citeseer.ist.psu.edu/532386.html
    [8]Mitzenmacher M.Digital fountains:A survey and look forward.In:Proc.of the Information Theory Workshop.2004.271-276.http://ieeexplore.ieee.org/xpls/abs_all.jsp-arnumber=1405313
    [9]Plank J.A tutorial on reed-solomon coding for fault-tolerance in RAID-like systems.Software Practice and Experience,1997,27(9):995-1012.
    [10]Chun B,Dabek F,Haeberlen A,Sit E,Weatherspoon H,Kaashoek M,Kubiatowicz J,Morris R.Efficient replica maintenance for distributed storage systems.In:Proc.of the 3rd Symp.on Networked Systems Design and Implementation.2006.45-58.http://oceanstore.cs.berkeley.edu/publications/papers/pdf/carbonite06.pdf
    [11]Li J,Dabek F,Berkeley U,MIT M.F2F:Reliable storage in open networks.In:Proc.of the 5th Int'l Workshop on Peer-to-Peer Systems.2006.http://iptps06.cs.ucsb.edu/papers/Li-F2F06.pdf
    [12]PlanetLab.2007.http://www.planet-lab.org/
    [13]Wells C.The oceanstore archive:Goals,structures,and self-repair[MS Thesis].UC Berkeley:University of California,2002.
    [14]Kubiatowicz J,Wells C,Zhao B,Bindel D,Chen Y,Czerwinski S,Eaton P,Geels D,Gummadi R,Rhea S.OceanStore:An architecture for global-scale persistent storage.In:Proc.of the 9th Int'l Conf.on Architectural Support for Programming Languages and Operating Systems.2000.190-201.
    [15]Rhea S,Eaton P,Geels D,Weatherspoon H,Zhao B,Kubiatowicz J.Pond:The OceanStore prototype.In:Proc.of the 2nd USENIX Conf.on File and Storage Technologies.2003.1-14.http://www.oceanstore.org/publications/papers/pdf/fast2003-pond.pdf
    [16]Bhagwan R,Tati K,Cheng Y,Savage S,Voelker G.Total recall:System support for automated availability management.In:Proc.of the 1st ACM/Usenix Symp.on Networked Systems Design and Implementation.2004.http://citeseer.ist.psu.edu/ bhagwan04total.html
    [17]Yang M,Zhang Z,Li X,Dai Y.An empirical study of free-riding behavior in the maze P2P file-sharing system.In:Proc.of the 4th Int'l Workshop on Peer-to-Peer Systems.2005.http://iptps05.cs.cornell.edu/PDFs/CameraReady_171.pdf
    [18]Gnutella.2005.http://rfc-gnutella.sourceforge.net/
    [19]Napster.2001.http://www.napster.com/
    [20]Overnet.2006.http://www.overnet.com
    [21]KaZaA.2006.http://www.kazaa.com
    [22]Yang M,Zhao B,Dai Y,Zhang Z.Deployment of a large scale peer-to-peer social network.In:Proc.of the 1st Workshop on Real,Large Distributed Systems.2004.http://research.microsoft.com/asia/dload_files/group/system/maze.pdf
    [23]Blake C,Rodrigues R.High availability,scalable storage,dynamic peer networks:Pick two.In:Proc.of the 9th Workshop on Hot Topics in Operating Systems.2003.1-6.http://pdos.csail.mit.edu/~cblake/p2p-scl.pdf
    [24]Upstore.2006.http://upstore.grids.cn
    [25]Dabek F,Kaashoek M,Karger D,Morris R,Stoica I.Wide-Area cooperative storage with CFS.In:Proc.of the 18th ACM Symp.on Operating Systems Principles.2001.202-215.http://portal.acm.org/citation.cfm-id=502034.502054
    [26]Stribling J.OverCite:A cooperative digital research library.In:Proc.of the 4th Int'l Workshop on Peer-to-Peer Systems.2005.http://citeseer.ist.psu.edu/stribling05overcite.html
    [27]Druschel P,Rowstron A.PAST:A large-scale,persistent peer-to-peer storage utility.In:Proc.of the 8th IEEE Workshop on Hot Topics in Operating Systems.2001.http://ieeexplore.ieee.org/xpl/freeabs_all.jsp-arnumber=990064&fromcon
    [28]Adya A,Wattenhofer R,Bolosky W,Castro M,Cermak G,Chaiken R,Douceur J,Howell J,Lorch J,Theimer M.Farsite:Federated,available,and reliable storage for an incompletely trusted environment.In:Proc.of the 5th Symp.on Operating Systems Design and Implementation.2002.1-12.https://research.microsoft.com/Farsite/OSDI2002.pdf
    [29]Zhang Z,Lian Q,Lin S,Chen W,Chen Y,Jin C.BitVault:A highly reliable distributed data retention platform.Technical Report,MSR-TR-2005-179,Microsoft Research,2005.
    [30]Bhagwan R,Savage S,Voelker G.Understanding availability.In:Proc.of the 2nd Int'l Workshop on Peer-to-Peer Systems.2003.http://www.springerlink.com/content/ehcfgw36n3j1ypr6/
    [31]Zhang Z,Lian Q,Chen Y.XRing:Achieving high-performance routing adaptively in structured P2P.Technical Report,MSR-TR-2004-93,Microsoft Research,2004.
    [32]Lian Q,Chen W,Zhang Z.On the impact of replica placement to the reliability of distributed brick storage systems.In:Proc.of the 25th IEEE Int'l Conf.on Distributed Computing Systems.2005.187-196.http://ieeexplore.ieee.org/xpls/abs_all.jsp-arnumber= 1437083
    [33]Zheng W,Hu J,Li M.Granary:Architecture of object oriented Internet storage service.In:Proc.of the IEEE Int'l Conf.on E-Commerce Technology for Dynamic E-Business.2004.294-297.http://ieeexplore.ieee.org/xpls/abs_all.jsp-arnumber=1388340
    [34]Hu J,Li M,Zheng W,Wang D,Ning N,Dong H.SmartBoa:Constructing P2P overlay network in the heterogeneous Internet using irregular routing tables.In:Proc.of the 3rd Int'l Workshop on Peer-to-Peer Systems.2004.http://iptps04.cs.ucsd.edu/papers/ hu-smartboa.pdf
    [35]Lunar.2005.http://maze.pku.edu.cn/lunar.htm
    [36]eMule.2007.http://emule.org/
    [37]Tian J,Yang Z,Dai Y.SEC:A practical secure erasure coding scheme for peer-to-peer storage system.In:Proc.of the 14th Symp.on Storage System and Technology.2006.210-222.http://upstore.grids.cn/publications/SEC.pdf
    [38]Batten C,Barr K,Saraf A,Trepetin S.pStore:A secure peer-to-peer backup system.Technical Report,LCS Technical Memo 632,MIT Laboratory for Computer Science,2002.http://www.mit.edu/~cbatten/work/pstore-tr02.pdf
    [39]Cox L,Murray C,Noble B.Pastiche:Making backup cheap and easy.In:Proc.of the 5th Symp.on Operating Systems Design and Implementation.New York:ACM Press,2002.285-298.http://www.eecs.umich.edu/~lpcox/osdi02.pdf
    [40]Ghemawat S,Gobioff H,Leung S.The google file system.In:Proc.of the 19th ACM Symp.on Operating Systems Principles.2003.29-43.http://portal.acm.org/citation.cfm-id=945445.945450
    [41]Weatherspoon H,Kubiatowicz J.Erasure coding vs.replication:A quantitative comparison.In:Proc.of the 1st Int'l Workshop on Peer-to-Peer Systems.2002.328-337.http://www.springerlink.com/content/e1kmcf729e6updgm/
    [42]Bhagwan R,Savage S,Voelker G.Replication strategies for highly available peer-to-peer storage systems.Technical Report,CS2002-0726,UCSD,2002.
    [43]Lin W,Chiu D,Lee Y.Erasure code replication revisited.In:Proc.of the 4th Int'l Conf.on Peer-to-Peer Computing.2004.90-97.http://ieeexplore.ieee.org/xpls/abs_all.jsp-arnumber=1334935
    [44]Rodrigues R,Liskov B.High availability in DHTs:Erasure coding vs.replication.In:Proc.of the 4th Int'l Workshop on Peer-to-Peer Systems.2005.226-239.http://citeseer.ist.psu.edu/729877.html
    [45]Tian J,Dai Y,Wang H,Yang M.Understanding session durability in peer-to-peer storage system.In:Proc.of the Int'l Conf.on Computational Science.2006.428-435.http://upstore.grids.cn/publications/iccs06_lncs.pdf
    [46]Douceur J,Wattenhofer R.Competitive hill-climbing strategies for replica placement in a distributed file system.In:Proc.of the 15th Int'l Conf.on Distributed Computing.2001.48-62.http://www.springerlink.com/content/g4rjk02tpn71mh5m/
    [47]Ramanathan M.Increasing object availability in peer-to-peer systems.In:Proc.of the 18th Int'l Parallel and Distributed Processing Symp.2004.129-133.http://ieeexplore.ieee.org/iel5/9132/28950/01303097.pdf-arnumber=1303097
    [48]Schwarz TJE,Xin Q,Miller EL.Availability in global peer-to-peer storage systems.In:Proc.of the 6th Workshop on Distributed Data and Structures.2004.http://citesee.ist.psu.edu/schwarz04availability.html
    [49]Tati K,Voelker G.On object maintenance in peer-to-peer systems.In:Proc.of the 5th Int'l Workshop on Peer-to-Peer Systems.2006.http://www.cs.ucsd.edu/~voelker/pubs/objmaint-iptps06.pdf
    [50]Weatherspoon H,Chun B,So C,Kubiatowicz J.Long-Term data maintenance in wide-area storage systems:A quantitative approach.Technical Report,UCB/CSD-05-1404,Berkeley Computer Science Division,University of California,2005.
    [51]Ramabhadran S,Pasquale J.Analysis of long-running replicated systems.In:Proc.of the 25th IEEE Annual Conf.on Computer Communications.2006.http://www-cse.ucsd.edu/~pasquale/Papers/infocom06.pdf
    [52]Sit E,Haeberlen A,Dabek F,Chun B,Weatherspoon H,Morris R,Kaashoek M,Kubiatowicz J.Proactive replication for data durability.In:Proc.of the 5th Int'l Workshop on Peer-to-Peer Systems.2006.http://iptps06.cs.ucsb.edu/papers/Sit-tempo.pdf
    [53]Guha S,Daswani N,Jain R.An experimental study of the skype peer-to-peer voip system.In:Proc.of the 5th Int'l Workshop on Peer-to-Peer Systems.2006.http://citeseer.ist.psu.edu/guha06experimental.html
    [54]Haeberlen A,Mislove A,Post A,Druschel P.Fallacies in evaluating decentralized systems.In:Proc.of the 5th Int'l Workshop on Peer-to-Peer Systems.2006.http://iptps06.cs.ucsb.edu/papers/Post-fallacies06.pdf
    [55]Tian J,Dai Y.Understanding the dynamic of peer-to-peer systems.In:Proc.of the 6th Int'l Workshop on Peer-to-Peer Systems.2007.http://research.microsoft.com/workshops/IPTPS2007/papers/TianDai.pdf
    [56]Utard G,Vernois A.Data durability in peer to peer storage systems.In:Proc.of the IEEE Int'l Symp.on Cluster Computing and the Grid.2004.90-97.http://ieeexplore.ieee.org/xpls/abs_all.jsp-arnumber=1336553
    网友评论
    网友评论
    分享到微博
    发 布
引用本文

田敬,代亚非. P2P持久存储研究.软件学报,2007,18(6):1379-1399

复制
分享
文章指标
  • 点击次数:7606
  • 下载次数: 9491
  • HTML阅读次数: 0
  • 引用次数: 0
历史
  • 收稿日期:2006-12-13
  • 最后修改日期:2007-01-24
文章二维码
您是第19728414位访问者
版权所有:中国科学院软件研究所 京ICP备05046678号-3
地址:北京市海淀区中关村南四街4号,邮政编码:100190
电话:010-62562563 传真:010-62562533 Email:jos@iscas.ac.cn
技术支持:北京勤云科技发展有限公司

京公网安备 11040202500063号