Research Collection School Of Computing and Information Systems

On reliability of patch correctness assessment

Xuan-Bach D. LE, Carnegie Mellon University
Lingfeng BAO, Zhejiang University
David LO, Singapore Management UniversityFollow
Xin XIA, Monash University
Shanping LI, Zhejiang University
Corina S. PASAREANU, NASA Ames Research Center

Publication Type

Conference Proceeding Article

Version

acceptedVersion

Publication Date

5-2019

Abstract

Current state-of-the-art automatic software repair (ASR) techniques rely heavily on incomplete specifications, or test suites, to generate repairs. This, however, may cause ASR tools to generate repairs that are incorrect and hard to generalize. To assess patch correctness, researchers have been following two methods separately: (1) Automated annotation, wherein patches are automatically labeled by an independent test suite (ITS) – a patch passing the ITS is regarded as correct or generalizable, and incorrect otherwise, (2) Author annotation, wherein authors of ASR techniques manually annotate the correctness labels of patches generated by their and competing tools. While automated annotation cannot ascertain that a patch is actually correct, author annotation is prone to subjectivity. This concern has caused an on-going debate on the appropriate ways to assess the effectiveness of numerous ASR techniques proposed recently. In this work, we propose to assess reliability of author and automated annotations on patch correctness assessment. We do this by first constructing a gold set of correctness labels for 189 randomly selected patches generated by 8 state-of-the-art ASR techniques through a user study involving 35 professional developers as independent annotators. By measuring inter-rater agreement as a proxy for annotation quality – as commonly done in the literature – we demonstrate that our constructed gold set is on par with other high-quality gold sets. We then compare labels generated by author and automated annotations with this gold set to assess reliability of the patch assessment methodologies. We subsequently report several findings and highlight implications for future studies.

Keywords

Automated program repair, empirical study, test case generation

Discipline

Software Engineering

Research Areas

Software and Cyber-Physical Systems

Publication

2019 IEEE/ACM 41st International Conference on Software Engineering (ICSE): Montreal, Canada, May 25-31: Proceedings

First Page

524

Last Page

535

ISBN

9781728108698

Identifier

10.1109/ICSE.2019.00064

Publisher

IEEE

City or Country

Piscataway, NJ

Citation

LE, Xuan-Bach D.; BAO, Lingfeng; LO, David; XIA, Xin; LI, Shanping; and PASAREANU, Corina S.. On reliability of patch correctness assessment. (2019). 2019 IEEE/ACM 41st International Conference on Software Engineering (ICSE): Montreal, Canada, May 25-31: Proceedings. 524-535.
Available at: https://ink.library.smu.edu.sg/sis_research/4481

Copyright Owner and License

Authors

Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.

Additional URL

https://doi.org/10.1109/ICSE.2019.00064

Download

Find it in your library

Included in

Software Engineering Commons

COinS

Research Collection School Of Computing and Information Systems

On reliability of patch correctness assessment

Publication Type

Version

Publication Date

Abstract

Keywords

Discipline

Research Areas

Publication

First Page

Last Page

ISBN

Identifier

Publisher

City or Country

Citation

Copyright Owner and License

Creative Commons License

Additional URL

Included in

Search

Links

Browse

Links

Research Collection School Of Computing and Information Systems

On reliability of patch correctness assessment

Author

Publication Type

Version

Publication Date

Abstract

Keywords

Discipline

Research Areas

Publication

First Page

Last Page

ISBN

Identifier

Publisher

City or Country

Citation

Copyright Owner and License

Creative Commons License

Additional URL

Included in

Share

Search

Links

Browse

Links