This paper discusses how to completely debug indeterminate MPI/PVM parallelprograms.Due tothe indeterminacy,the previous bugs may be non-repeatable in successive executions during a cyclic debuggingsession.Based on the FIFO communication model of MPI/PVM,an implementation of record and replay tech-nique is presented.Moreover,users are provided with an easy way to completely debug their programs by cover-ing all possible execution paths through controllable replay.Comparied with other solutions,the proposedmethod produces much less temporaland spatialoverhead.The implementation has been completed on two kindsof message passing architectures:one is Dawning-2000 super server(that was developed by the National Re-search Center for Intelligent Computing Systems ofChina)with single-processor(PowerPC)nodes which are in-terconnected by a custom-built wormhole mesh network;the other is a cluster ofworkstations(PowerPC/AIX)which has been built in NationalHigh Performance Computing Center at Hefei.
You are the firstVisitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-4
Address:4# South Fourth Street, Zhong Guan Cun, Beijing 100190,Postal Code:100190
Phone:010-62562563 Fax:010-62562533 Email:jos@iscas.ac.cn
Technical Support:Beijing Qinyun Technology Development Co., Ltd.