Publication Type
Conference Proceeding Article
Version
acceptedVersion
Publication Date
4-2023
Abstract
Audio descriptions (AD) make videos accessible to those who cannot see them. But many videos lack AD and remain inaccessible as traditional approaches involve expensive professional production. We aim to lower production costs by involving novices in this process. We present an AD authoring system that supports novices to write scene descriptions (SD)—textual descriptions of video scenes—and convert them into AD via text-to-speech. The system combines video scene recognition and natural language processing to review novice-written SD and feeds back what to mention automatically. To assess the effectiveness of this automatic feedback in supporting novices, we recruited 60 participants to author SD with no feedback, human feedback, and automatic feedback. Our study shows that automatic feedback improves SD’s descriptiveness, objectiveness, and learning quality, without affecting qualities like sufficiency and clarity. Though human feedback remains more effective, automatic feedback can reduce production costs by 45%.
Keywords
accessibility, AI-supported writing, individuals with disabilities, assistive technologies
Discipline
Artificial Intelligence and Robotics | Databases and Information Systems | Graphics and Human Computer Interfaces
Research Areas
Intelligent Systems and Optimization
Publication
CHI '23: Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems, Hamburg, Germany, April 23-28
First Page
1
Last Page
18
ISBN
9781450394215
Identifier
10.1145/3544548.3581023
Publisher
ACM
City or Country
New York
Citation
NATALIE, Rosiana; TSENG, Joshua Shi-hao; KACORRI, Hernisa; and HARA, Kotaro.
Supporting novices author audio descriptions via automatic feedback. (2023). CHI '23: Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems, Hamburg, Germany, April 23-28. 1-18.
Available at: https://ink.library.smu.edu.sg/sis_research/8318
Copyright Owner and License
Authors
Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.
Additional URL
https://doi.org/10.1145/3544548.3581023
Included in
Artificial Intelligence and Robotics Commons, Databases and Information Systems Commons, Graphics and Human Computer Interfaces Commons