论文标题

自适应实时流媒体的内容自适应编码器预测

Content-adaptive Encoder Preset Prediction for Adaptive Live Streaming

论文作者

Menon, Vignesh V, Amirpour, Hadi, Rajendran, Prajit T, Ghanbari, Mohammad, Timmerer, Christian

论文摘要

在实时流媒体应用程序中,通常使用固定的比特量分辨率对(称为比特率梯子)来避免其他预处理运行时,以分析每个视频内容的复杂性并确定优化的比特率梯子。此外,实时编码器使用最快的预设进行编码,以确保流媒体的最小延迟。对于实时编码器,预计编码速度等于视频帧速率。优化的编码预设可能会导致(i)在编码时提高了经验质量(QOE)和(ii)改进的CPU利用率。据此,本文介绍了一种适应性的编码器预测方案(CAPS),以适用于自适应实时视频流应用程序。在此方案中,编码器预设使用离散余弦变换(DCT) - 基于每个视频段的基于能量的低复杂空间和时间特征,为每个编码实例分配的CPU线程数以及目标编码速度。实验结果表明,与使用X265 HEVC Open-Open-Open-Open-Open-Open-Open-Open-Open-Open-Open-Open-Open-Open-Open-Open-Open-Open-Open-Open-Open-Open-Open-Open-Open-Open-Open-Open-Open-Open-Open-Open-Open-Open-Open-Open-Open-Open-Open-Open-Open-Open-Open-Opence Encoder相比,CAPS的总体质量提高了0.83 dB PSNR和3.81 VMAF,具有相同的比特率。这是通过保持所需的编码速度和减少CPU空闲时间来实现的。

In live streaming applications, a fixed set of bitrate-resolution pairs (known as bitrate ladder) is generally used to avoid additional pre-processing run-time to analyze the complexity of every video content and determine the optimized bitrate ladder. Furthermore, live encoders use the fastest available preset for encoding to ensure the minimum possible latency in streaming. For live encoders, it is expected that the encoding speed is equal to the video framerate. An optimized encoding preset may result in (i) increased Quality of Experience (QoE) and (ii) improved CPU utilization while encoding. In this light, this paper introduces a Content-Adaptive encoder Preset prediction Scheme (CAPS) for adaptive live video streaming applications. In this scheme, the encoder preset is determined using Discrete Cosine Transform (DCT)-energy-based low-complexity spatial and temporal features for every video segment, the number of CPU threads allocated for each encoding instance, and the target encoding speed. Experimental results show that CAPS yields an overall quality improvement of 0.83 dB PSNR and 3.81 VMAF with the same bitrate, compared to the fastest preset encoding of the HTTP Live Streaming (HLS) bitrate ladder using x265 HEVC open-source encoder. This is achieved by maintaining the desired encoding speed and reducing CPU idle time.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源