论文标题

低质量视频的动作识别模型的绩效评估

Performance Evaluation of Action Recognition Models on Low Quality Videos

论文作者

Otani, Aoi, Hashiguchi, Ryota, Omi, Kazuki, Fukushima, Norishige, Tamaki, Toru

论文摘要

在行动识别模型的设计中,视频的质量是一个重要的问题。但是,质量和性能之间的权衡通常被忽略。一般而言,动作识别模型经过高质量视频的培训,因此,尚不知道模型性能在低质量视频上测试时如何降低,培训视频的质量对性能有多大影响。视频质量问题很重要,但是到目前为止尚未对其进行研究。这项研究的目的是通过定量绩效评估培训和测试视频的质量之间的权衡,该绩效评估的几种动作识别模型的质量不同。首先,我们展示视频质量如何影响预训练模型的性能。我们通过更改JPEG(压缩强度)和H.264/AVC(CRF)的质量控制参数(CRF)来转码Kinetics400的原始验证视频。然后,我们使用转码视频来验证预训练的模型。其次,我们展示了模型在经过编码的视频中训练时的性能。我们通过更改JPEG和H.264/AVC的质量参数来转码Kinetics400的原始培训视频。然后,我们将模型训练在经过跨编码的培训视频上,并使用原始和转码验证视频进行验证。 JPEG转编码的实验结果表明,在没有视觉观察到未观察到质量下降的情况下,压缩强度没有严重的性能降解(高达-1.5%),并且在80个大于80的情况下,相对于质量指数线性降级。 H.264/AVC转编码的实验表明,CRF30没有明显的性能损失(高达-1%),而视频文件的总尺寸减少到30%。

In the design of action recognition models, the quality of videos is an important issue; however, the trade-off between the quality and performance is often ignored. In general, action recognition models are trained on high-quality videos, hence it is not known how the model performance degrades when tested on low-quality videos, and how much the quality of training videos affects the performance. The issue of video quality is important, however, it has not been studied so far. The goal of this study is to show the trade-off between the performance and the quality of training and test videos by quantitative performance evaluation of several action recognition models for transcoded videos in different qualities. First, we show how the video quality affects the performance of pre-trained models. We transcode the original validation videos of Kinetics400 by changing quality control parameters of JPEG (compression strength) and H.264/AVC (CRF). Then we use the transcoded videos to validate the pre-trained models. Second, we show how the models perform when trained on transcoded videos. We transcode the original training videos of Kinetics400 by changing the quality parameters of JPEG and H.264/AVC. Then we train the models on the transcoded training videos and validate them with the original and transcoded validation videos. Experimental results with JPEG transcoding show that there is no severe performance degradation (up to -1.5%) for compression strength smaller than 70 where no quality degradation is visually observed, and for larger than 80 the performance degrades linearly with respect to the quality index. Experiments with H.264/AVC transcoding show that there is no significant performance loss (up to -1%) with CRF30 while the total size of video files is reduced to 30%.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源