Détail du document
Identifiant

oai:arXiv.org:2405.13527

Sujet
Computer Science - Sound Computer Science - Artificial Inte... Electrical Engineering and Systems...
Auteur
Zeng, Wei He, Xian Wang, Ye
Catégorie

Computer Science

Année

2024

Date de référencement

29/05/2024

Mots clés
human hierarchical recordings audio model piano science synthetic performance
Métrique

Résumé

Piano audio-to-score transcription (A2S) is an important yet underexplored task with extensive applications for music composition, practice, and analysis.

However, existing end-to-end piano A2S systems faced difficulties in retrieving bar-level information such as key and time signatures, and have been trained and evaluated with only synthetic data.

To address these limitations, we propose a sequence-to-sequence (Seq2Seq) model with a hierarchical decoder that aligns with the hierarchical structure of musical scores, enabling the transcription of score information at both the bar and note levels by multi-task learning.

To bridge the gap between synthetic data and recordings of human performance, we propose a two-stage training scheme, which involves pre-training the model using an expressive performance rendering (EPR) system on synthetic audio, followed by fine-tuning the model using recordings of human performance.

To preserve the voicing structure for score reconstruction, we propose a pre-processing method for **Kern scores in scenarios with an unconstrained number of voices.

Experimental results support the effectiveness of our proposed approaches, in terms of both transcription performance on synthetic audio data in comparison to the current state-of-the-art, and the first experiment on human recordings.

;Comment: 8 pages, 5 figures, accepted by IJCAI 2024 - AI, Arts & Creativity Track

Zeng, Wei,He, Xian,Wang, Ye, 2024, End-to-End Real-World Polyphonic Piano Audio-to-Score Transcription with Hierarchical Decoding

Document

Ouvrir

Partager

Source

Articles recommandés par ES/IODE IA

Lung cancer risk and exposure to air pollution: a multicenter North China case–control study involving 14604 subjects
lung cancer case–control air pollution never-smokers nomogram model controls lung-related 14604 subjects north polluted consistent smokers quit exposure lung cancer risk air people factor smoking pollution study history