论文标题
Pirhdy:学习音调,节奏和动态感知符号音乐的嵌入
PiRhDy: Learning Pitch-, Rhythm-, and Dynamics-aware Embeddings for Symbolic Music
论文作者
论文摘要
当今深度学习中,确定的嵌入仍然是计算音乐学对符号音乐的基本挑战。音乐类似于自然语言,可以将音乐建模为一系列令牌。这激发了大多数现有解决方案探索单词嵌入模型构建音乐嵌入的利用。但是,音乐在两个关键方面与自然语言不同:(1)音乐令牌是多方面的 - 它包括音调,节奏和动态信息; (2)音乐背景是二维的 - 每个音乐令牌都取决于旋律和谐波背景。在这项工作中,我们通过提出一个名为Pirhdy的新型框架来提供全面的解决方案,该框架无缝地集成了音高,节奏和动态信息。 Pirhdy采用了一个分层策略,可以将其分解为两个步骤:(1)令牌(即注释事件)建模,该建模分别代表俯仰,节奏和动力学,并将它们集成到单个令牌嵌入中; (2)上下文建模,它利用旋律和谐波知识来训练令牌嵌入。对Pirhdy的每个组件和子策略进行了彻底的研究。我们进一步验证了三个下游任务中的嵌入 - 旋律完成,伴奏建议和流派分类。结果表明,象征音乐的神经方法以及Pirhdy作为广泛象征性音乐应用的验证工具的显着进步。
Definitive embeddings remain a fundamental challenge of computational musicology for symbolic music in deep learning today. Analogous to natural language, music can be modeled as a sequence of tokens. This motivates the majority of existing solutions to explore the utilization of word embedding models to build music embeddings. However, music differs from natural languages in two key aspects: (1) musical token is multi-faceted -- it comprises of pitch, rhythm and dynamics information; and (2) musical context is two-dimensional -- each musical token is dependent on both melodic and harmonic contexts. In this work, we provide a comprehensive solution by proposing a novel framework named PiRhDy that integrates pitch, rhythm, and dynamics information seamlessly. PiRhDy adopts a hierarchical strategy which can be decomposed into two steps: (1) token (i.e., note event) modeling, which separately represents pitch, rhythm, and dynamics and integrates them into a single token embedding; and (2) context modeling, which utilizes melodic and harmonic knowledge to train the token embedding. A thorough study was made on each component and sub-strategy of PiRhDy. We further validate our embeddings in three downstream tasks -- melody completion, accompaniment suggestion, and genre classification. Results indicate a significant advancement of the neural approach towards symbolic music as well as PiRhDy's potential as a pretrained tool for a broad range of symbolic music applications.