论文标题
通过连续转换进行分类标准化流
Categorical Normalizing Flows via Continuous Transformations
论文作者
论文摘要
尽管它们很受欢迎,但迄今为止,将流量正常化在分类数据上的应用仍有限制。当前使用取消化将离散数据映射到连续空间的实践是不适用的,因为分类数据没有内在的顺序。取而代之的是,分类数据具有必须推断的复杂和潜在关系,就像单词之间的同义词一样。在本文中,我们研究了\ emph {分类归一化流},即对分类数据的流量进行标准化。通过将连续空间中的分类数据编码作为变异推理问题,我们可以共同优化连续表示和模型的可能性。使用分解的解码器,我们引入了一个电感偏置,以模拟归一化流中的任何相互作用。结果,我们不仅与拥有关节解码器相比简化了优化,而且还可以使扩展到大量类别,而这些类别目前不可能通过离散的归一化流量进行扩展。基于分类归一化流,我们提出了GraphCnf图形上的置换不变的生成模型。 GraphCNF实现了三个步骤的方法,以逐步建模节点,边缘和邻接矩阵,以提高效率。关于分子的生成,GraphCNF的表现均优于单发和自回归基于流动的最先进。
Despite their popularity, to date, the application of normalizing flows on categorical data stays limited. The current practice of using dequantization to map discrete data to a continuous space is inapplicable as categorical data has no intrinsic order. Instead, categorical data have complex and latent relations that must be inferred, like the synonymy between words. In this paper, we investigate \emph{Categorical Normalizing Flows}, that is normalizing flows for categorical data. By casting the encoding of categorical data in continuous space as a variational inference problem, we jointly optimize the continuous representation and the model likelihood. Using a factorized decoder, we introduce an inductive bias to model any interactions in the normalizing flow. As a consequence, we do not only simplify the optimization compared to having a joint decoder, but also make it possible to scale up to a large number of categories that is currently impossible with discrete normalizing flows. Based on Categorical Normalizing Flows, we propose GraphCNF a permutation-invariant generative model on graphs. GraphCNF implements a three step approach modeling the nodes, edges and adjacency matrix stepwise to increase efficiency. On molecule generation, GraphCNF outperforms both one-shot and autoregressive flow-based state-of-the-art.