论文标题
BEV-LANEDET:简单有效的3D车道检测基线
BEV-LaneDet: a Simple and Effective 3D Lane Detection Baseline
论文作者
论文摘要
3D车道检测在车辆路由中起着至关重要的作用,最近一直是自主驾驶中迅速发展的主题。以前的作品由于其复杂的空间转换和3D车道的僵化表示而与实用性作斗争。面对这些问题,我们的工作提出了一个有效且健壮的单眼3D车道检测,称为BEV-LANEDET,并具有三个主要贡献。首先,我们介绍了统一安装在不同车辆上的相机的IN/外部参数的虚拟摄像头,以确保摄像机之间空间关系的一致性。由于统一的视觉空间,它可以有效地促进学习过程。其次,我们提出了一个简单但有效的3D车道表示,称为密钥点表示。该模块更适合表示复杂和多样化的3D车道结构。最后,我们提出一个名为“空间转换金字塔”的轻巧和芯片友好的空间变换模块,以将多尺度前视特征转换为BEV功能。实验结果表明,我们的工作在F-SCORE方面优于最先进的方法,在Openlane数据集上高出10.6%,而Apollo 3D合成数据集则高出5.9%,速度为185 fps。源代码将在https://github.com/gigo-team/bev_lane_det上发布。
3D lane detection which plays a crucial role in vehicle routing, has recently been a rapidly developing topic in autonomous driving. Previous works struggle with practicality due to their complicated spatial transformations and inflexible representations of 3D lanes. Faced with the issues, our work proposes an efficient and robust monocular 3D lane detection called BEV-LaneDet with three main contributions. First, we introduce the Virtual Camera that unifies the in/extrinsic parameters of cameras mounted on different vehicles to guarantee the consistency of the spatial relationship among cameras. It can effectively promote the learning procedure due to the unified visual space. We secondly propose a simple but efficient 3D lane representation called Key-Points Representation. This module is more suitable to represent the complicated and diverse 3D lane structures. At last, we present a light-weight and chip-friendly spatial transformation module named Spatial Transformation Pyramid to transform multiscale front-view features into BEV features. Experimental results demonstrate that our work outperforms the state-of-the-art approaches in terms of F-Score, being 10.6% higher on the OpenLane dataset and 5.9% higher on the Apollo 3D synthetic dataset, with a speed of 185 FPS. The source code will released at https://github.com/gigo-team/bev_lane_det.