论文标题

文档导航性:需要打印障碍

Document Navigability: A Need for Print-Impaired

论文作者

Kumar, Anukriti, Ganu, Tanuja, Guha, Saikat

论文摘要

对于盲人,低视觉和其他印刷(BLV)个人而言,印刷文档仍然是一个挑战。在本文中,我们重点介绍了对引文,脚注,数字,表和方程式内部引用的(内部参考)的特定问题。虽然视力用户可以翻转参考内容并在几秒钟内翻转,但BLV个人所依赖的线性音频叙事使这些参考文献非常困难。我们提出了一种基于视觉的技术来找到参考内容,并提取(在随后的工作中)将内容摘要列为音频叙事所需的元数据。我们将技术应用于科学文档中的引用,并发现它在出生数字和扫描文件上都很好地效果。

Printed documents continue to be a challenge for blind, low-vision, and other print-disabled (BLV) individuals. In this paper, we focus on the specific problem of (in-)accessibility of internal references to citations, footnotes, figures, tables and equations. While sighted users can flip to the referenced content and flip back in seconds, linear audio narration that BLV individuals rely on makes following these references extremely hard. We propose a vision based technique to locate the referenced content and extract metadata needed to (in subsequent work) inline a content summary into the audio narration. We apply our technique to citations in scientific documents and find it works well both on born-digital as well as scanned documents.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源