论文标题

基于摄像机的钢琴乐谱识别

Camera-Based Piano Sheet Music Identification

论文作者

Yang, Daniel, Tsai, TJ

论文摘要

本文介绍了一种大规模检索钢琴表音乐图像的方法。我们的作品与以前关于乐谱音乐检索的研究不同。首先,我们使用整个IMSLP数据集中的所有独奏钢琴乐谱图像作为可搜索的数据库,研究问题比以前的研究大得多。其次,我们使用乐谱音乐的手机图像作为我们的输入查询,该查询将自己适用于实用的,面向用户的应用程序。我们表明,以前提出的用于乐谱检索的指纹方法对于实时应用来说太慢了,我们诊断出其缺点。我们提出了一种新颖的哈希方案,称为动态N-gram指纹识别,该方案可显着降低运行时,同时提高检索精度。在IMSLP数据的实验中,我们提出的方法的平均相互等级为0.85,平均运行时间为0.98秒。

This paper presents a method for large-scale retrieval of piano sheet music images. Our work differs from previous studies on sheet music retrieval in two ways. First, we investigate the problem at a much larger scale than previous studies, using all solo piano sheet music images in the entire IMSLP dataset as a searchable database. Second, we use cell phone images of sheet music as our input queries, which lends itself to a practical, user-facing application. We show that a previously proposed fingerprinting method for sheet music retrieval is far too slow for a real-time application, and we diagnose its shortcomings. We propose a novel hashing scheme called dynamic n-gram fingerprinting that significantly reduces runtime while simultaneously boosting retrieval accuracy. In experiments on IMSLP data, our proposed method achieves a mean reciprocal rank of 0.85 and an average runtime of 0.98 seconds per query.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源