Publication Type
Conference Proceeding Article
Version
acceptedVersion
Publication Date
12-2024
Abstract
Advanced natural language processing (NLP) models are increasingly applied in music composition and performance, particularly for generating vocal melodies and simulating singing voices. While NLP techniques have been effective in analyzing vocal performance data to assess quality and style, the automatic transcription of vocal performances into sheet music remains a significant challenge. Manual transcription tools often fall short due to the intricate dynamics of vocal expression. This study tackles the automation of vocal performance transcription into sheet music using innovative techniques, including large language models (LLMs). We propose a method to translate vocal audio input into display-ready sheet music effectively. Our findings reveal the strengths and limitations of various approaches, especially in transcribing a cappella performances into notes and lyrics. This research advances the field of NLP-driven music analysis and underscores the transformative potential of these models in vocal transcription.
Keywords
natural language processing, vocal transcription, sheet music automation, large language models, a cappella, music analysis, vocal melodies, music composition, singing voice simulation, transcription tools
Discipline
Artificial Intelligence and Robotics
Research Areas
Data Science and Engineering
Publication
Proceedings of the IEEE International Conference on Data Mining
Identifier
ICDMW65004.2024.00063
Publisher
IEEE
City or Country
Piscataway, New Jersey, USA
Citation
JIANG, Jinjing; NICOLE ANNE HUI-YING TEO; PEN, Haibo; HO, Seng-Beng; and WANG, Zhaoxia.
Converting vocal performances into sheet music leveraging large language models. (2024). Proceedings of the IEEE International Conference on Data Mining.
Available at: https://ink.library.smu.edu.sg/sis_research/9704
Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.
Additional URL
https://doi.org/ICDMW65004.2024.00063