Abstract:The Internet provides an open, dynamic, and uncertain environment. A composite service in this environment faces more failure threats caused by resource faults and component failures. The existing failure recovery measures can improve the availability of composite service and at the same time, bring the negative impact on service performance. In this paper, an analytical model considering failure recovery is proposed to quantitatively analyze the performance of composite service with the failure probability and recovery ability through synthesizes the fault taxonomy and recovery strategies of service failure. Queueing Petri nets (QPNs) are used to model the occurrence of failures and the corresponding recovery handling processes, which focuses on targeting the service executing process under retrying and replacement strategies. The inner structure of server node and link QPN models considering failure and recovery impact are expatiated and depicted. On this ground, a performance model of composite service with decentralized execution is given, and five interaction relations are taken into account. Finally, QPME tool is used to simulate and compare the performance measure results under different failure rates, failure styles proportions, and recovery strategies. The numerical results show that this method can quantitatively analyze the impact of failure recovery measure on service performance, and can also be helpful for providing rational solutions to implement failure recovery measure for information service system in uncertain network environments.