论文标题
神经架构在计算机视觉中寻找密集的预测任务
Neural Architecture Search for Dense Prediction Tasks in Computer Vision
论文作者
论文摘要
近年来,深度学习的成功导致对神经网络建筑工程的需求不断上升。结果,旨在以数据驱动的方式而不是手动设计神经网络体系结构的神经体系结构搜索(NAS)已经发展为流行的研究领域。随着体重共享策略的出现,NAS已适用于更广泛的问题。特别是,现在有许多用于计算机视觉中的密集预测任务的出版物,它们需要像素级预测,例如语义分割或对象检测。这些任务带有新颖的挑战,例如由于高分辨率数据,学习多尺度表示,较长的培训时间以及更复杂和更大的神经体系结构而引起的更高记忆足迹。在本手稿中,我们通过详细阐述这些新颖的挑战和调查方法来解决它们,以减轻现有方法在新型问题上的未来研究和应用,以提供密集预测任务的NAS概述。
The success of deep learning in recent years has lead to a rising demand for neural network architecture engineering. As a consequence, neural architecture search (NAS), which aims at automatically designing neural network architectures in a data-driven manner rather than manually, has evolved as a popular field of research. With the advent of weight sharing strategies across architectures, NAS has become applicable to a much wider range of problems. In particular, there are now many publications for dense prediction tasks in computer vision that require pixel-level predictions, such as semantic segmentation or object detection. These tasks come with novel challenges, such as higher memory footprints due to high-resolution data, learning multi-scale representations, longer training times, and more complex and larger neural architectures. In this manuscript, we provide an overview of NAS for dense prediction tasks by elaborating on these novel challenges and surveying ways to address them to ease future research and application of existing methods to novel problems.