Publication Type
Journal Article
Version
acceptedVersion
Publication Date
4-2025
Abstract
Automatic summarisation has been used efficiently in recent years to condense texts, conversations, audio, code, and various other artefacts. A range of methods, from simple template-based summaries to complex machine learning techniques -- and more recently, large language models -- have been employed to generate these summaries. Summarising software design patterns is important because it helps developers quickly understand and reuse complex design concepts, thereby improving software maintainability and development efficiency. However, the generation of summaries for software design patterns has not yet been explored.Our approach utilises code features and JavaParser to parse the code and create a JSON representation. Using an NLG library on this JSON representation, we convert it into natural language text that acts as a summary of the code, capturing the contextual information of the design pattern. Our empirical results indicate that the summaries generated by our approach capture the context in which patterns are applied in the codebase. Statistical evaluations demonstrate that our summaries closely align with human-written summaries, as evident from high values in the ROUGE-L, BLEU-4, NIST, and FrugalScore metrics. A follow-up survey further shows that DPS summaries were rated as capturing context better than human-generated summaries. Additionally, a time based task activity shows that summaries increase the time of understanding of design pattern for developer better than when the summaries are not present.
Keywords
Design Patterns, Code Features, Summarisation, JSON, JavaParser
Discipline
Software Engineering
Research Areas
Intelligent Systems and Optimization
Areas of Excellence
Digital transformation
Publication
Empirical Software Engineering
First Page
1
Last Page
32
ISSN
1382-3256
Identifier
10.48550/arXiv.2504.11081
Publisher
Springer
Citation
NAZAR, Najam; SIKKA, Sameer; and TREUDE, Christoph.
DPS: Design pattern summarisation using code features. (2025). Empirical Software Engineering. 1-32.
Available at: https://ink.library.smu.edu.sg/sis_research/10626
Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.