Publication Type
Conference Proceeding Article
Version
acceptedVersion
Publication Date
4-2024
Abstract
Non-inclusive language in software artefacts has been recognised as a serious problem. We describe a tool to find and fix non-inclusive language in a variety of GitHub repository artefacts. These include various README files, PDFs, code comments, and code. A wide variety of non-inclusive language including racist, ageist, ableist, violent and others are located and issues created, tagging the artefacts for checking. Suggested fixes can be generated using third-party LLM APIs, and approved changes made to documents, including code refactorings, and committed to the repository. The tool and evaluation data are available from: https://github. com/LiamTodd/github-inclusifier
Keywords
Inclusive language, refactoring, biased language, inappropriate language, software documentation, software maintenance tools
Discipline
Software Engineering
Research Areas
Software and Cyber-Physical Systems
Publication
ICSE-Companion '24: Proceedings of the 2024 IEEE/ACM 46th International Conference on Software Engineering, Lisbon, April 14-20
First Page
89
Last Page
93
ISBN
9798400705021
Identifier
10.1145/3639478.3640025
Publisher
ACM
City or Country
New York
Citation
TODD, Liam; GRUNDY, John; and TREUDE, Christoph.
GitHubInclusifier: Finding and fixing non-inclusive language in GitHub repositories. (2024). ICSE-Companion '24: Proceedings of the 2024 IEEE/ACM 46th International Conference on Software Engineering, Lisbon, April 14-20. 89-93.
Available at: https://ink.library.smu.edu.sg/sis_research/8918
Copyright Owner and License
Authors
Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.
Additional URL
https://doi.org/10.1145/3639478.3640025