Runtime predictive analysis
Runtime predictive analysis (or predictive analysis) is a runtime verification technique in computer science for detecting property violations in program executions inferred from an observed execution. An important class of predictive analysis methods has been developed for detecting concurrency errors (such as data races) in concurrent programs, where a runtime monitor is used to predict errors which did not happen in the observed run, but can happen in an alternative execution of the same program. The predictive capability comes from the fact that the analysis is performed on an abstract model extracted online from the observed execution, which admits a class of executions beyond the observed one.[1]
Overview
Informally, given an execution , predictive analysis checks errors in a reordered trace of . is called feasible from if any program that can generate can also generate .
In the context of concurrent programs, a predictive technique is sound if it only predicts concurrency errors in feasible executions of the causal model of the observed trace. Assuming the analysis has no knowledge about the source code of the program, the analysis is complete (also called maximal[2][3]) if the inferred class of executions contains all executions that have the same program order and communication order prefix of the observed trace.
Applications
Predictive analysis has been applied to detect a wide class of concurrency errors, including:
- Data races
- Deadlocks[4][5]
- Atomicity violations[6]
- Order violations, e.g., use-after-free errors[7]
Implementation
As is typical with dynamic program analysis, predictive analysis first instruments the source program. At runtime, the analysis can be performed online, in order to detect errors on the fly. Alternatively, the instrumentation can simply dump the execution trace for offline analysis. The latter approach is preferred for expensive refined predictive analyses that require random access to the execution trace or take more than linear time.
Incorporating data and control-flow analysis
Static analysis can be first conducted to gather data and control-flow dependence information about the source program, which can help construct the causal model during online executions. This allows predictive analysis to infer a larger class of executions based on the observed execution. Intuitively, a feasible reordering can change the last writer of a memory read (data dependence) if the read, in turn, cannot affect whether any accesses execute (control dependence).[8][9]
Approaches
Partial order based techniques
Partial order based techniques are most often employed for online race detection. At runtime, a partial order over the events in the trace is constructed, and any unordered pairs of critical events are reported as races. Many predictive techniques for race detection are based on the happens-before relation or a weakened version of it. Such techniques can typically be implemented efficiently with vector clock algorithms, allowing only one pass of the whole input trace as it is being generated, and are thus suitable for online deployment. [10] [11] [12]
SMT-based techniques
SMT encodings allow the analysis to extract a refined causal model from an execution trace, as a (possibly very large) mathematical formula. Furthermore, control-flow information can be incorporated into the model. SMT-based techniques can achieve soundness and completeness (also called maximal causality[3] [2]), but has exponential-time complexity with respect to the trace size. In practice, the analysis is typically deployed to bounded segments of an execution trace, thus trading completeness for scalability. [8] [13] [14] [15]
Other techniques
In the context of data race detection, sound polynomial-time predictive analyses have been developed, with good, close to maximal predictive capability. [16]
Tools
Here is a partial list of tools that use predictive analyses to detect concurrency errors, sorted alphabetically.
- "Rapid".: a lightweight framework for implementing dynamic race detection engines.
- "RoadRunner".: a dynamic analysis framework designed to facilitate rapid prototyping and experimentation with dynamic analyses for concurrent Java programs.
- "RV-Predict".: SMT-based predictive race detection.
- "UFO".: SMT-based predictive use-after-free detection.
References
- "Runtime Predictive Analysis". November 10, 2008.
- Şerbănuţă, Traian Florin; Chen, Feng; Roşu, Grigore (2013). "Maximal Causal Models for Sequentially Consistent Systems". 7687: 136–150. doi:10.1007/978-3-642-35632-2_16. hdl:2142/27708. ISSN 0302-9743. Cite journal requires
|journal=
(help) - Huang, Jeff (2015). "Stateless model checking concurrent programs with maximal causality reduction": 165–174. doi:10.1145/2737924.2737975. Cite journal requires
|journal=
(help) - Kalhauge, Christian Gram; Palsberg, Jens (2018). "Sound deadlock prediction". Proceedings of the ACM on Programming Languages. 2 (OOPSLA): 1–29. doi:10.1145/3276516. ISSN 2475-1421.
- "Sound Dynamic Deadlock Prediction in Linear Time" (PDF).
- "Atomicity Checking in Linear Time using Vector Clocks" (PDF).
- Huang, Jeff (2018). "UFO": 609–619. doi:10.1145/3180155.3180225. Cite journal requires
|journal=
(help) - Huang, Jeff; Meredith, Patrick O'Neil; Rosu, Grigore (2013). "Maximal sound predictive race detection with control flow abstraction": 337–348. doi:10.1145/2594291.2594315. Cite journal requires
|journal=
(help) - Genç, Kaan; Roemer, Jake; Xu, Yufan; Bond, Michael D. (2019). "Dependence-aware, unbounded sound predictive race detection". Proceedings of the ACM on Programming Languages. 3 (OOPSLA): 1–30. doi:10.1145/3360605. ISSN 2475-1421.
- Smaragdakis, Yannis; Evans, Jacob; Sadowski, Caitlin; Yi, Jaeheon; Flanagan, Cormac (2012). "Sound predictive race detection in polynomial time". ACM SIGPLAN Notices. 47 (1): 387. doi:10.1145/2103621.2103702. ISSN 0362-1340.
- Kini, Dileep; Mathur, Umang; Viswanathan, Mahesh (2017). "Dynamic race prediction in linear time": 157–170. arXiv:1704.02432. doi:10.1145/3062341.3062374. Cite journal requires
|journal=
(help) - Roemer, Jake; Genç, Kaan; Bond, Michael D. (2018). "High-coverage, unbounded sound predictive race detection": 374–389. doi:10.1145/3192366.3192385. Cite journal requires
|journal=
(help) - Liu, Peng; Tripp, Omer; Zhang, Xiangyu (2016). "IPA: improving predictive analysis with pointer analysis": 59–69. doi:10.1145/2931037.2931046. Cite journal requires
|journal=
(help) - Wang, Chao; Kundu, Sudipta; Ganai, Malay; Gupta, Aarti (2009). "Symbolic Predictive Analysis for Concurrent Programs". 5850: 256–272. doi:10.1007/978-3-642-05089-3_17. ISSN 0302-9743. Cite journal requires
|journal=
(help) - Said, Mahmoud; Wang, Chao; Yang, Zijiang; Sakallah, Karem (2011). "Generating Data Race Witnesses by an SMT-Based Analysis". 6617: 313–327. doi:10.1007/978-3-642-20398-5_23. ISSN 0302-9743. Cite journal requires
|journal=
(help) - Pavlogiannis, Andreas (2020). "Fast, sound, and effectively complete dynamic race prediction". Proceedings of the ACM on Programming Languages. 4 (POPL): 1–29. doi:10.1145/3371085. ISSN 2475-1421.