High-precision Data Race Detection Method for Large Scale Programs
Author:
Affiliation:

Fund Project:

National Natural Science Foundation of China (62032010)

  • Article
  • | |
  • Metrics
  • |
  • Reference [36]
  • |
  • Related [20]
  • | | |
  • Comments
    Abstract:

    With the development of techniques, the uncertainty in software systems is continuously increasing. Data race is a typical bug in current programs, which is a classic type of uncertainty programs. Despite significant progress in recent years, the important problem of practical static race detection remains open. Previous static techniques either suffer from a high false positive rate due to the compromise of precision, or scalability issues caused by a highly precise analysis. This paper presents GUARD, a staged approach to resolve this paradox. First, it performs a lightweight context-sensitive data access analysis, based on the value flow of a program, to identify the candidate data race subpaths instead of the whole program paths. Second, may-happen-in-parallel (MHP) analysis is employedto identify whether two data accesses in a program may execute concurrently. This stage is scalable, due to the design of the thread flow graph (TFG), which encodes thread information to query MHP relationship of the subpaths. Finally, for each subpath whose two data accesses are MHP, the heavyweight path-sensitive analysis is appliedto verify the feasibility of the data races. The evaluation demonstrates that GUARD can finish checking industrial-sized projects, up to 1.3MLoC, in 1 870s with an average false positive rate of 16.0%. Moreover, GUARD is faster than the state-of-the-art techniques with the average speedup 6.08X and significantly fewer false positives. Besides, GUARD has found 12 new race bugs in real-world programs. All of them are reportedtothe developers and 8 of them have been confirmed.

    Reference
    [1] Savage S, Burrows M, Nelson G, Sobalvarro P, Anderson T. Eraser:A dynamic data race detector for multithreaded programs. ACM Trans. on Computer Systems (TOCS), 1997,15(4):391-411.
    [2] CVE security vulnerabilities related to CWE (Common Weakness Enumeration) 362. 2020. https://www.cvedetails.com/vulnerability-list/cweid-362/vulnerabilities.html
    [3] Chen D, Jiang Y, Xu C, Ma X, Lu J. Testing multithreaded programs via thread speed control. In:Proc. of the 26th ACM Joint Meeting on European Software Engineering Conf. and Symp. on the Foundations of Software Engineering (ESEC/FSE). ACM, 2018. 15-25.
    [4] Pratikakis P, Foster JS, Hicks M. LOCKSMITH:Context-sensitive correlation analysis for race detection. ACM SIGPLAN Notices, 2006,41(6):320-331.
    [5] Vojdani V, Apinis K, Rõtov V, Seidl H, Vene V, Vogler R. Static race detection for device drivers:The Goblint approach. In:Proc. of the 31st IEEE/ACM Int'l Conf. on Automated Software Engineering (ASE). IEEE, 2016. 391-402.
    [6] Voung JW, Jhala R, Lerner S. RELAY:Static race detection on millions of lines of code. In:Proc. of the 6th Joint Meeting of the European Software Engineering Conf. and the ACM SIGSOFT Symp. on the Foundations of Software Engineering (ESEC/FSE). ACM, 2007. 205-214.
    [7] Wang Y, Wang L, Yu T, Zhao J, Li X. Automatic detection and validation of race conditions in interrupt-driven embedded software. In:Proc. of the 26th ACM SIGSOFT Int'l Symp. on Software Testing and Analysis (ISSTA). ACM, 2017. 113-124.
    [8] Zhou Q, Li L, Wang L, Xue J, Feng X. May-happen-in-parallel analysis with static vector clocks. In:Proc. of the Int'l Symp. on Code Generation and Optimization. ACM, 2018. 228-240.
    [9] Huang J, Rajagopalan AK. Precise and maximal race detection from incomplete traces. ACM SIGPLAN Notices, 2016,51(10):462-476.
    [10] Huang J, Zhang C, Dolby J. CLAP:Recording local executions to reproduce concurrency failures. ACM SIGPLAN Notices, 2013, 48(6):141-152.
    [11] Machado N, Lucia B, Rodrigues L. Concurrency debugging with differential schedule projections. ACM SIGPLAN Notices, 2015,50(6):586-595.
    [12] Zhang T, Jung C, Lee D. ProRace:Practical data race detection for production use. ACM SIGPLAN Notices, 2017,52(4):149-162.
    [13] Blackshear S, Gorogiannis N, O'Hearn PW, Sergey I. RacerD:Compositional static race detection. In:Proc. of the ACM on Programming Languages (OOPSLA). 2018. 1-28.
    [14] Di P, Sui Y, Ye D, Xue J. Region-based may-happen-inparallel analysis for C programs. In:Proc. of the 44th Int'l Conf. on Parallel Processing (ICPP). IEEE, 2015. 889-898.
    [15] Zhan S, Huang J. ECHO:Instantaneous in situ race detection in the IDE. In:Proc. of the 24th ACM SIGSOFT Int'l Symp. on Foundations of Software Engineering (FSE). ACM, 2016. 775-786.
    [16] Dillig I, Dillig T, Aiken A. Sound, complete and scalable path-sensitive analysis. ACM SIGPLAN Notices, 2008,43(6):270-280.
    [17] Shi Q, Xiao X, Wu R, Zhou J, Fan G, Zhang C. Pinpoint:Fast and precise sparse value flow analysis for million lines of code. In:Proc. of the 39th ACM SIGPLAN Conf. on Programming Language Design and Implementation (PLDI). ACM, 2018. 693-706.
    [18] Sui Y, Xue J. SVF:Interprocedural static value-flow analysis in LLVM. In:Proc. of the 25th Int'l Conf. on Compiler Construction. ACM, 2016. 265-266.
    [19] Agarwal S, Barik R, Sarkar V, Shyamasundar RK. May-happen-in-parallel analysis of X10 programs. In:Proc. of the 12th ACM SIGPLAN Symp. on Principles and Practice of Parallel Programming. ACM, 2007. 183-193.
    [20] Albert E, Genaim S, Gordillo P. May-happen-in-parallel analysis for asynchronous programs with inter-procedural synchronization. In:Proc. of the Int'l Static Analysis Symp. Springer-Verlag, 2015. 72-89.
    [21] Barik R. Efficient computation of may-happen-in-parallel information for concurrent Java programs. In:Proc. of the Int'l Workshop on Languages and Compilers for Parallel Computing. Springer-Verlag, 2005. 152-169.
    [22] Naumovich G, Avrunin GS, Clarke LA. An efficient algorithm for computing MHP information for concurrent Java programs. In:Proc. of the Joint Meeting of the European Software Engineering Conf. and the ACM SIGSOFT Symp. on the Foundations of Software Engineering (ESEC/FSE). Springer-Verlag, 1999. 338-354.
    [23] Sankar A, Chakraborty S, Nandivada VK. Improved MHP analysis. In:Proc. of the 25th Int'l Conf. on Compiler Construction. ACM, 2016. 207-217.
    [24] Cherem S, Princehouse L, Rugina R. Practical memory leak detection using guarded value-flow analysis. ACM SIGPLAN Notices, 2007,42(6):480-491.
    [25] The LLVM Compiler Infrastructure. 2020. http://llvm.org
    [26] Moura LD, Bjørner N. Z3:An efficient SMT solver. In:Proc. of the Int'l Conf. on Tools and Algorithms for the Construction and Analysis of Systems. Springer-Verlag, 2008. 337-340.
    [27] Writing an LLVM Pass. 2020. http://llvm.org/docs/WritingAnLLVMPass.html
    [28] Barney B. POSIX threads programming. 2017. https://computing.llnl.gov/tutorials/pthreads/
    [29] Thread-C++ Reference. 2020. http://www.cplusplus.com/reference/thread/thread/
    [30] Xie Y, Aiken A. Saturn:A SAT-based tool for bug detection. In:Proc. of the Int'l Conf. on Computer Aided Verification (CAV). Springer-Verlag, 2005. 139-143.
    [31] Joshi S, Shyamasundar RK, Aggarwal SK. A new method of MHP analysis for languages with dynamic barriers. In:Proc. of the 26th IEEE Int'l Parallel and Distributed Processing Symp. Workshops & PhD Forum (IPDPSW). IEEE, 2012. 519-528.
    [32] Sui Y, Di P, Xue J. Sparse flow-sensitive pointer analysis for multithreaded programs. In:Proc. of the Int'l Symp. on Code Generation and Optimization. ACM, 2016. 160-170.
    [33] Engler D, Ashcraft K. RacerX:Effective, static detection of race conditions and deadlocks. ACM SIGOPS Operating Systems Review, 2003,37(5):237-252.
    [34] Li Y, Liu B, Huang J. SWORD:A scalable whole program race detector for Java. In:Proc. of the 41st Int'l Conf. on Software Engineering:Companion Proc. IEEE, 2019. 75-78.
    [35] Naik M, Aiken A, Whaley J. Effective static race detection for Java. In:Proc. of the 27th ACM SIGPLAN Conf. on Programming Language Design and Implementation (PLDI). ACM, 2006. 308-319.
    [36] Radoi C, Dig D. Practical static race detection for Java parallel loops. In:Proc. of the Int'l Symp. on Software Testing and Analysis (ISSTA). ACM, 2013. 178-190.
    Cited by
    Comments
    Comments
    分享到微博
    Submit
Get Citation

高凤娟,王豫,周金果,徐安孜,王林章,吴荣鑫,张川,苏振东.高精度的大规模程序数据竞争检测方法.软件学报,2021,32(7):2039-2055

Copy
Share
Article Metrics
  • Abstract:
  • PDF:
  • HTML:
  • Cited by:
History
  • Received:September 05,2020
  • Revised:October 26,2020
  • Online: January 22,2021
  • Published: July 06,2021
You are the first2033286Visitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-4
Address:4# South Fourth Street, Zhong Guan Cun, Beijing 100190,Postal Code:100190
Phone:010-62562563 Fax:010-62562533 Email:jos@iscas.ac.cn
Technical Support:Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063