论文标题
视频:确定带有专业照片视频中的突出显示时刻作为先验
Videogenic: Identifying Highlight Moments in Videos with Professional Photographs as a Prior
论文作者
论文摘要
本文研究了从视频中提取突出显示时刻的挑战。要执行此任务,我们需要了解什么构成任意视频域的亮点,同时能够跨不同域进行扩展。我们的主要见解是,摄影师拍摄的照片倾向于捕捉活动中最杰出或最上镜的时刻。利用这种见解,我们提出了视频,这是一种能够为各种域名创建特定领域的重点视频的技术。在人类评估研究(n = 50)中,我们表明,与基于剪辑的检索相结合的高质量照片集(使用具有图像语义知识的神经网络)可以作为查找视频亮点的出色先验。在受试者内专家研究(n = 12)中,我们演示了视频编辑器在帮助视频编辑器创建重点视频的用处,其中较轻的工作量,更短的任务完成时间和更好的可用性。
This paper investigates the challenge of extracting highlight moments from videos. To perform this task, we need to understand what constitutes a highlight for arbitrary video domains while at the same time being able to scale across different domains. Our key insight is that photographs taken by photographers tend to capture the most remarkable or photogenic moments of an activity. Drawing on this insight, we present Videogenic, a technique capable of creating domain-specific highlight videos for a diverse range of domains. In a human evaluation study (N=50), we show that a high-quality photograph collection combined with CLIP-based retrieval (which uses a neural network with semantic knowledge of images) can serve as an excellent prior for finding video highlights. In a within-subjects expert study (N=12), we demonstrate the usefulness of Videogenic in helping video editors create highlight videos with lighter workload, shorter task completion time, and better usability.