论文标题
超越地图:更好地评估实例细分
Beyond mAP: Towards better evaluation of instance segmentation
论文作者
论文摘要
实例分割的正确性构成计算对象数量,正确定位所有预测并对每个局部预测进行分类。平均精度是用于测量分割的所有这些成分的事实上的度量。但是,该度量标准不会在高核心范围内惩罚重复的预测,并且无法区分正确局部但错误地分类的实例。这种弱点无意中导致了网络设计,从而在AP中获得了显着增长,但也引入了大量的假阳性。因此,我们不能依靠AP选择一个模型,该模型在误报和高召回率之间提供最佳的权衡。为了解决这一难题,我们回顾了文献中的替代指标,并提出了两种新措施,以明确测量空间和分类重复预测的数量。我们还建议使用语义排序和NMS模块,以根据像素占用匹配方案删除这些重复项。实验表明,现代分割网络在AP中具有显着增长,但也包含相当数量的重复项。我们的语义排序和NMS可以作为插件播放模块添加,以减轻对冲预测并保留AP。
Correctness of instance segmentation constitutes counting the number of objects, correctly localizing all predictions and classifying each localized prediction. Average Precision is the de-facto metric used to measure all these constituents of segmentation. However, this metric does not penalize duplicate predictions in the high-recall range, and cannot distinguish instances that are localized correctly but categorized incorrectly. This weakness has inadvertently led to network designs that achieve significant gains in AP but also introduce a large number of false positives. We therefore cannot rely on AP to choose a model that provides an optimal tradeoff between false positives and high recall. To resolve this dilemma, we review alternative metrics in the literature and propose two new measures to explicitly measure the amount of both spatial and categorical duplicate predictions. We also propose a Semantic Sorting and NMS module to remove these duplicates based on a pixel occupancy matching scheme. Experiments show that modern segmentation networks have significant gains in AP, but also contain a considerable amount of duplicates. Our Semantic Sorting and NMS can be added as a plug-and-play module to mitigate hedged predictions and preserve AP.