摘要翻译:
与传统视频相比,全方位视频支持头戴式显示器,实现了球形观看方向,提供了交互式和沉浸式体验。不幸的是,据我们所知,很少有主观或客观的视觉质量评估(VQA)方法用于全向视频编码。本文提出了评估全向视频编码质量损失的主观和客观方法。具体来说,我们首先提出了一个新的数据库,其中包含了来自多个观看全方位视频序列的对象的观看方向数据。然后,从我们的数据库中,我们发现在不同主题的观看方向上有很高的一致性。观看方向通常分布在前面区域的中心,但有时会落入与视频内容相关的其他区域。基于这一发现,我们提出了一种主观VQA方法来测量整体和区域全方位视频的差异平均意见得分(DMOS),分别从整体DMOS(O-DMOS)和矢量化DMOS(V-DMOS)两个方面进行。此外,针对全向视频的感知特性,提出了两种面向编码全向视频的客观VQA方法。一种方法考虑全景图中人的偏好,根据像素到前方区域中心的距离来衡量像素的失真。另一种方法根据视频内容预测观看方向,然后利用预测的观看方向对目标VQA方法中每个像素的失真进行加权。最后,我们的实验结果验证了本文所提出的主观和客观的VQA方法都推进了目前最先进的全向视频VQA。
---
英文标题:
《Assessing Visual Quality of Omnidirectional Videos》
---
作者:
Mai Xu, Chen Li, Zulin Wang, Zhenzhong Chen, Zhenyu Guan
---
最新提交年份:
2019
---
分类信息:
一级分类:Electrical Engineering and Systems Science 电气工程与系统科学
二级分类:Image and Video Processing 图像和视频处理
分类描述:Theory, algorithms, and architectures for the formation, capture, processing, communication, analysis, and display of images, video, and multidimensional signals in a wide variety of applications. Topics of interest include: mathematical, statistical, and perceptual image and video modeling and representation; linear and nonlinear filtering, de-blurring, enhancement, restoration, and reconstruction from degraded, low-resolution or tomographic data; lossless and lossy compression and coding; segmentation, alignment, and recognition; image rendering, visualization, and printing; computational imaging, including ultrasound, tomographic and magnetic resonance imaging; and image and video analysis, synthesis, storage, search and retrieval.
用于图像、视频和多维信号的形成、捕获、处理、通信、分析和显示的理论、算法和体系结构。感兴趣的主题包括:数学,统计,和感知图像和视频建模和表示;线性和非线性滤波、去模糊、增强、恢复和重建退化、低分辨率或层析数据;无损和有损压缩编码;分割、对齐和识别;图像渲染、可视化和打印;计算成像,包括超声、断层和磁共振成像;以及图像和视频的分析、合成、存储、搜索和检索。
--
---
英文摘要:
In contrast with traditional video, omnidirectional video enables spherical viewing direction with support for head-mounted displays, providing an interactive and immersive experience. Unfortunately, to the best of our knowledge, there are few visual quality assessment (VQA) methods, either subjective or objective, for omnidirectional video coding. This paper proposes both subjective and objective methods for assessing quality loss in encoding omnidirectional video. Specifically, we first present a new database, which includes the viewing direction data from several subjects watching omnidirectional video sequences. Then, from our database, we find a high consistency in viewing directions across different subjects. The viewing directions are normally distributed in the center of the front regions, but they sometimes fall into other regions, related to video content. Given this finding, we present a subjective VQA method for measuring difference mean opinion score (DMOS) of the whole and regional omnidirectional video, in terms of overall DMOS (O-DMOS) and vectorized DMOS (V-DMOS), respectively. Moreover, we propose two objective VQA methods for encoded omnidirectional video, in light of human perception characteristics of omnidirectional video. One method weighs the distortion of pixels with regard to their distances to the center of front regions, which considers human preference in a panorama. The other method predicts viewing directions according to video content, and then the predicted viewing directions are leveraged to allocate weights to the distortion of each pixel in our objective VQA method. Finally, our experimental results verify that both the subjective and objective methods proposed in this paper advance state-of-the-art VQA for omnidirectional video.
---
PDF链接:
https://arxiv.org/pdf/1709.06342


雷达卡



京公网安备 11010802022788号







