Publication Type
Conference Proceeding Article
Version
publishedVersion
Publication Date
6-2022
Abstract
A conversation corpus is essential to build interactive AI applications. However, the demographic information of the participants in such corpora is largely underexplored mainly due to the lack of individual data in many corpora. In this work, we analyze a Korean nationwide daily conversation corpus constructed by the National Institute of Korean Language (NIKL) to characterize the participation of different demographic (age and sex) groups in the corpus.
Keywords
Text categorization, topic recognition, demographic/gender/age identification, Human computer interaction, social media tools, navigation and visualization, New social media applications, interfaces, interaction techniques, Psychological, personality-based and ethnographic studies of social media
Discipline
Data Storage Systems | Social Media
Research Areas
Data Science and Engineering
Publication
Proceedings of the Sixteenth International AAAI Conference on Web and Social Media, Atlanta, Georgia, 2022 June 6-9
First Page
1409
Last Page
1413
Identifier
10.1609/icwsm.v16i1.19397
Publisher
Association for the Advancement of Artificial Intelligence
City or Country
Menlo Park, California, United States
Citation
KWAK, Haewoon; AN, Jisun; and PARK, Kunwoo.
Who is missing? Characterizing the participation of different demographic groups in a Korean nationwide daily conversation corpus. (2022). Proceedings of the Sixteenth International AAAI Conference on Web and Social Media, Atlanta, Georgia, 2022 June 6-9. 1409-1413.
Available at: https://ink.library.smu.edu.sg/sis_research/7499
Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.
Additional URL
https://doi.org/10.1609/icwsm.v16i1.19397