用于信息提取的跨域变化胶囊

论文标题

用于信息提取的跨域变化胶囊

Cross-domain Variational Capsules for Information Extraction

论文作者

Nagaraj, Akash, K, Akhil, Venkatesh, Akshay, HR, Srikanth

论文摘要

在本文中，我们提出了一种特征提取算法和特征标记图像的多域图像特征数据集，以模拟人脑对跨域信息进行分类并生成洞察力的方式。目的是确定数据中的突出特征，并使用这种识别机制自动从其他看不见的域中的数据自动产生洞察力。提出了一种信息提取算法，该算法是变异自动编码器（VAE）和胶囊网络的组合。胶囊网络用于将图像分解为其各个特征，VAE用于探索这些分解功能的变化。因此，使该模型从数据的变化中识别特征来稳健。值得注意的一点是，该算法使用数据的有效分层解码，这有助于更丰富的输出解释。在属于各个域的图像中包含可见特征的数据集数量中，多域图像特征数据集创建并公开可用。它由跨三个领域的数千个图像组成。该数据集的创建是为了在将来引入针对细粒度特征识别任务的新基准。

In this paper, we present a characteristic extraction algorithm and the Multi-domain Image Characteristics Dataset of characteristic-tagged images to simulate the way a human brain classifies cross-domain information and generates insight. The intent was to identify prominent characteristics in data and use this identification mechanism to auto-generate insight from data in other unseen domains. An information extraction algorithm is proposed which is a combination of Variational Autoencoders (VAEs) and Capsule Networks. Capsule Networks are used to decompose images into their individual features and VAEs are used to explore variations on these decomposed features. Thus, making the model robust in recognizing characteristics from variations of the data. A noteworthy point is that the algorithm uses efficient hierarchical decoding of data which helps in richer output interpretation. Noticing a dearth in the number of datasets that contain visible characteristics in images belonging to various domains, the Multi-domain Image Characteristics Dataset was created and made publicly available. It consists of thousands of images across three domains. This dataset was created with the intent of introducing a new benchmark for fine-grained characteristic recognition tasks in the future.

下载PDF全文

下载文献需遵守相关版权规定

论文标题