Publication Type

Conference Proceeding Article

Version

acceptedVersion

Publication Date

12-2024

Abstract

Advanced natural language processing (NLP) models are increasingly applied in music composition and performance, particularly for generating vocal melodies and simulating singing voices. While NLP techniques have been effective in analyzing vocal performance data to assess quality and style, the automatic transcription of vocal performances into sheet music remains a significant challenge. Manual transcription tools often fall short due to the intricate dynamics of vocal expression. This study tackles the automation of vocal performance transcription into sheet music using innovative techniques, including large language models (LLMs). We propose a method to translate vocal audio input into display-ready sheet music effectively. Our findings reveal the strengths and limitations of various approaches, especially in transcribing a cappella performances into notes and lyrics. This research advances the field of NLP-driven music analysis and underscores the transformative potential of these models in vocal transcription.

Keywords

natural language processing, vocal transcription, sheet music automation, large language models, a cappella, music analysis, vocal melodies, music composition, singing voice simulation, transcription tools

Discipline

Artificial Intelligence and Robotics | Music

Research Areas

Data Science and Engineering

Publication

2024 IEEE International Conference on Data Mining Workshops (ICDMW): December 9, Abu Dhabi: Proceedings

First Page

445

Last Page

452

ISBN

9798331530631

Identifier

10.1109/ICDMW65004.2024.00063

Publisher

IEEE

City or Country

Piscataway, NJ

Copyright Owner and License

Authors

Additional URL

https://doi.org/10.1109/ICDMW65004.2024.00063

Share

COinS