Publication Type

Conference Proceeding Article

Version

acceptedVersion

Publication Date

12-2024

Abstract

Advanced natural language processing (NLP) models are increasingly applied in music composition and performance, particularly for generating vocal melodies and simulating singing voices. While NLP techniques have been effective in analyzing vocal performance data to assess quality and style, the automatic transcription of vocal performances into sheet music remains a significant challenge. Manual transcription tools often fall short due to the intricate dynamics of vocal expression. This study tackles the automation of vocal performance transcription into sheet music using innovative techniques, including large language models (LLMs). We propose a method to translate vocal audio input into display-ready sheet music effectively. Our findings reveal the strengths and limitations of various approaches, especially in transcribing a cappella performances into notes and lyrics. This research advances the field of NLP-driven music analysis and underscores the transformative potential of these models in vocal transcription.

Keywords

natural language processing, vocal transcription, sheet music automation, large language models, a cappella, music analysis, vocal melodies, music composition, singing voice simulation, transcription tools

Discipline

Artificial Intelligence and Robotics

Research Areas

Data Science and Engineering

Publication

Proceedings of the IEEE International Conference on Data Mining

Identifier

ICDMW65004.2024.00063

Publisher

IEEE

City or Country

Piscataway, New Jersey, USA

Additional URL

https://doi.org/ICDMW65004.2024.00063

Share

COinS