Survey of State-of-the-art Distributed Tracing Technology

doi:10.13328/j.cnki.jos.006047

微信服务号

微信订阅号

2025-6-1- 21

Home > Archive>Volume 31, Issue 7, 2020 >2019-2039. DOI:10.13328/j.cnki.jos.006047

PDF HTML XML Export Cite reminder

Survey of State-of-the-art Distributed Tracing Technology
DOI:
                        10.13328/j.cnki.jos.006047
                    
Author:
                        YANG YongYANG Yong
School of Software and Microelectronics, Peking University, Beijing 102600, China;National Engineering Research Center for Software Engineering, Peking University, Beijing 100871, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
LI YingLI Ying
School of Software and Microelectronics, Peking University, Beijing 102600, China;National Engineering Research Center for Software Engineering, Peking University, Beijing 100871, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site
WU Zhong-HaiWU Zhong-Hai
School of Software and Microelectronics, Peking University, Beijing 102600, China;National Engineering Research Center for Software Engineering, Peking University, Beijing 100871, China
Find this author on CNKI
Find this author on BaiDu
Search for this author on this site

                    
Affiliation:
Clc Number:
Fund Project:Key R&D Project of Guangdong Province (2020B010164003)

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

As distributed computing and distributed systems are being widely applied in various areas, how to improve the efficiency of system operations to guarantee the stability and reliability of the services provided by these distributed systems have gained massive momentum from both academia and industry. However, system operation tasks are confronted with tough challenges due the large scale, the intricate structures and dependency, the continuous updating and concurrent service requests of distributed systems. Previous component-/node-/process-/thread-centric monitoring and tracing methods are not sufficient to support the system operation tasks such as fault diagnosis, performance optimization, and system understanding in a distributed system. To address this issue, distributed tracing is proposed and designed. Distributed tracing identifies all the events belonging to the same request and causally correlates these events. Distributed tracing technology precisely and fine-grainedly depicts the behavior of a distributed system in a service-request or workflow-centric way, which is critical to improve the efficiency of system operations. This paper presents a comprehensive survey of existing research work and application of distributed tracing technology. A research framework is proposed and existing research achievements in this field are compared and analyzed with this framework from four perspectives which are acquiring tracing data, identifying the events from the same request, determining the causal relationships among these events, and representing the request execution path. Then the research work of applying distributed tracing technology to system operation tasks such as fault diagnosis and performance optimization is briefly introduced. Finally, the data dependency issue, the generality issue, and evaluation metrics issue of distributed tracing are discussed and a perspective of the future research direction in distributed tracing technology is presented.

Key words:distributed tracing;fault diagnosis;distributed system

Get Citation

杨勇,李影,吴中海.分布式追踪技术综述.软件学报,2020,31(7):2019-2039

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:May 30,2019
Revised:September 04,2019
Adopted:
Online: April 21,2020
Published: July 06,2020

You are the first2049631Visitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-4
Address：4# South Fourth Street, Zhong Guan Cun, Beijing 100190,Postal Code：100190
Phone：010-62562563 Fax：010-62562533 Email：jos@iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063

微信服务号

微信订阅号

Get Citation

Share

微信扫一扫：分享

Article Metrics

History