重新创造创作：抒情循环的新范式

论文标题

重新创造创作：抒情循环的新范式

Re-creation of Creations: A New Paradigm for Lyric-to-Melody Generation

论文作者

Lv, Ang, Tan, Xu, Qin, Tao, Liu, Tie-Yan, Yan, Rui

论文摘要

歌词到融合的生成是歌曲创作的重要任务，并且由于其独特的特征也很具有挑战性：生成的旋律不仅应该遵循良好的音乐模式，而且还应与节奏和结构等歌词中的功能保持一致。由于几个问题，这些特征无法通过以端到端方式学习抒情式映射的神经产生模型可以很好地处理：（1）缺乏对齐的抒情式锻炼数据，以充分学习抒情术特征对齐；（2）产生缺乏可控性，无法更好地和明确地对齐抒情液特征。在本文中，我们建议重新创建Creations（ROC），这是一种新的抒情性到同伴的范式。 ROC根据给定的歌词以及用户指定的和弦进展的条件生成旋律。它通过一代网络管道解决了上述问题。具体而言，我们的范式有两个阶段：（1）创建阶段，其中通过几个关键功能（例如和弦，音调，节奏和结构信息）在数据库中索引了由神经旋律语言模型产生的大量音乐片段；（2）重新创建阶段，根据歌词的关键功能从数据库中检索音乐片段，并根据构图指南和旋律语言模型分数串联最佳音乐片段。 ROC具有多个优点：（1）它只需要未配对的旋律数据来训练旋律语言模型，而不是以前模型中配对的抒情数据。（2）它在抒情循环的生成中实现了良好的抒情式特征对齐。在由英语和中文的歌词测试中，ROC在客观和主观指标上都胜过以前的基于神经的歌词到循环模型。

Lyric-to-melody generation is an important task in songwriting, and is also quite challenging due to its unique characteristics: the generated melodies should not only follow good musical patterns, but also align with features in lyrics such as rhythms and structures. These characteristics cannot be well handled by neural generation models that learn lyric-to-melody mapping in an end-to-end way, due to several issues: (1) lack of aligned lyric-melody training data to sufficiently learn lyric-melody feature alignment; (2) lack of controllability in generation to better and explicitly align the lyric-melody features. In this paper, we propose Re-creation of Creations (ROC), a new paradigm for lyric-to-melody generation. ROC generates melodies according to given lyrics and also conditions on user-designated chord progression. It addresses the above issues through a generation-retrieval pipeline. Specifically, our paradigm has two stages: (1) creation stage, where a huge amount of music fragments generated by a neural melody language model are indexed in a database through several key features (e.g., chords, tonality, rhythm, and structural information); (2) re-creation stage, where melodies are re-created by retrieving music fragments from the database according to the key features from lyrics and concatenating best music fragments based on composition guidelines and melody language model scores. ROC has several advantages: (1) It only needs unpaired melody data to train melody language model, instead of paired lyric-melody data in previous models. (2) It achieves good lyric-melody feature alignment in lyric-to-melody generation. Tested by English and Chinese lyrics, ROC outperforms previous neural based lyric-to-melody generation models on both objective and subjective metrics.

下载PDF全文

下载文献需遵守相关版权规定

论文标题