论文标题
机器人技术的时尚地标检测和类别分类
Fashion Landmark Detection and Category Classification for Robotics
论文作者
论文摘要
由于对机器人服装操纵,自动化的服装分类和回收以及在线购物等领域的潜在影响,对服装类别和时尚地标的自动化,基于图像的识别的研究最近引起了人们的兴趣。已经创建了几个公开和注释的时尚数据集,以促进这一方向的研究进展。在这项工作中,我们迈出了利用基于视觉机器人服装操纵任务中时尚形象分析的数据和技术迈出的第一步。我们专注于可以从大规模时尚数据集推广到在机器人实验室中收集的结构化的小数据集的技术。具体而言,我们提出了培训数据增强方法,例如弹性翘曲,以及模型调整,例如旋转不变卷积,以使模型更具推广。我们的实验表明,在以前看不见的数据集测试时,我们的方法优于服装类别分类和时尚地标检测的艺术模型的表现。此外,我们在一个新的数据集中介绍了一个新数据集,该数据集由机器人拿着不同服装的图像组成,该图像在我们的实验室中收集了不同的服装。
Research on automated, image based identification of clothing categories and fashion landmarks has recently gained significant interest due to its potential impact on areas such as robotic clothing manipulation, automated clothes sorting and recycling, and online shopping. Several public and annotated fashion datasets have been created to facilitate research advances in this direction. In this work, we make the first step towards leveraging the data and techniques developed for fashion image analysis in vision-based robotic clothing manipulation tasks. We focus on techniques that can generalize from large-scale fashion datasets to less structured, small datasets collected in a robotic lab. Specifically, we propose training data augmentation methods such as elastic warping, and model adjustments such as rotation invariant convolutions to make the model generalize better. Our experiments demonstrate that our approach outperforms stateof-the art models with respect to clothing category classification and fashion landmark detection when tested on previously unseen datasets. Furthermore, we present experimental results on a new dataset composed of images where a robot holds different garments, collected in our lab.