论文标题

ZEROC:一种用于零拍的概念识别和获取的神经符号模型

ZeroC: A Neuro-Symbolic Model for Zero-shot Concept Recognition and Acquisition at Inference Time

论文作者

Wu, Tailin, Tjandrasuwita, Megan, Wu, Zhengxuan, Yang, Xuelin, Liu, Kevin, Sosič, Rok, Leskovec, Jure

论文摘要

人类具有以零拍的方式识别和获取新颖的视觉概念的非凡能力。考虑到以前学到的视觉概念及其关系的高级,象征性的描述,人类可以识别新颖的概念而不看到任何例子。此外,他们可以通过学习的视觉概念和关系来解析和传达符号结构,从而获得新的概念。赋予机器中的这些功能在提高推理时提高其概括能力方面至关重要。在这项工作中,我们介绍了零拍的概念识别和获取(ZEROC),这是一种神经符号结构,可以以零拍的方式识别和获取新颖的概念。零表示概念是组成概念模型(节点)及其关系(作为边缘)的图表。为了允许推理时间组成,我们采用基于能量的模型(EBM)来建模概念和关系。我们设计了ZEROC体系结构,以便它可以在概念的符号图结构及其相应的EBM之间进行一对一的映射,该图是第一次允许在推理时间内获取新概念,传达其图形结构并将其应用于分类和检测任务(甚至跨域)。我们介绍了用于学习和推断Zeroc的算法。我们在一个充满挑战的网格世界数据集上评估了零,该数据集旨在探测零照片的概念识别和获取,并证明其功能。

Humans have the remarkable ability to recognize and acquire novel visual concepts in a zero-shot manner. Given a high-level, symbolic description of a novel concept in terms of previously learned visual concepts and their relations, humans can recognize novel concepts without seeing any examples. Moreover, they can acquire new concepts by parsing and communicating symbolic structures using learned visual concepts and relations. Endowing these capabilities in machines is pivotal in improving their generalization capability at inference time. In this work, we introduce Zero-shot Concept Recognition and Acquisition (ZeroC), a neuro-symbolic architecture that can recognize and acquire novel concepts in a zero-shot way. ZeroC represents concepts as graphs of constituent concept models (as nodes) and their relations (as edges). To allow inference time composition, we employ energy-based models (EBMs) to model concepts and relations. We design ZeroC architecture so that it allows a one-to-one mapping between a symbolic graph structure of a concept and its corresponding EBM, which for the first time, allows acquiring new concepts, communicating its graph structure, and applying it to classification and detection tasks (even across domains) at inference time. We introduce algorithms for learning and inference with ZeroC. We evaluate ZeroC on a challenging grid-world dataset which is designed to probe zero-shot concept recognition and acquisition, and demonstrate its capability.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源