论文标题
重新思考曲折的扁平化以进行图像阅读
Rethinking the Zigzag Flattening for Image Reading
论文作者
论文摘要
单词矢量的顺序排序对于文本阅读非常重要,这在自然语言处理(NLP)中得到了证明。但是,计算机视觉(CV)中不同顺序排序的规则尚未得到很好的探索,例如,为什么````Zigzag)''扁平(ZF)通常被用作默认选项来获取视觉网络中的图像补丁订单。值得注意的是,当ZF无法维持ZF的范围时,ZF无法维持该范围的范围。 (HF)作为CV中的另一种序列和ZF的对比。研究图像阅读的扁平策略。
Sequence ordering of word vector matters a lot to text reading, which has been proven in natural language processing (NLP). However, the rule of different sequence ordering in computer vision (CV) was not well explored, e.g., why the ``zigzag" flattening (ZF) is commonly utilized as a default option to get the image patches ordering in vision networks. Notably, when decomposing multi-scale images, the ZF could not maintain the invariance of feature point positions. To this end, we investigate the Hilbert fractal flattening (HF) as another method for sequence ordering in CV and contrast it against ZF. The HF has proven to be superior to other curves in maintaining spatial locality, when performing multi-scale transformations of dimensional space. And it can be easily plugged into most deep neural networks (DNNs). Extensive experiments demonstrate that it can yield consistent and significant performance boosts for a variety of architectures. Finally, we hope that our studies spark further research about the flattening strategy of image reading.