欧几里得偏好模型中的错误

论文标题

欧几里得偏好模型中的错误

Error in the Euclidean Preference Model

论文作者

Thorburn, Luke, Polukarov, Maria, Ventre, Carmine

论文摘要

许多深度学习和多基因系统（包括推荐系统）以矢量嵌入形式的偏好空间模型学习。通常，假定这些模型近似于欧几里得结构，在该结构中，单个模型更喜欢替代定位于其“理想点”，如欧几里得指标所测量的那样。但是，Bogomolnaia and Laslier（2007）表明，如果欧几里得空间的维度少于个人或替代方案，则存在该结构的顺序偏好曲线。我们扩展了这一结果，表明在某些情况下，几乎所有的首选项都不能用欧几里得模型表示，并在使用欧几里得模型近似于非欧盟的偏好概况时，在预期误差上得出了理论下限。我们的结果对向量嵌入的解释和使用有影响，因为在某些情况下，只有在嵌入的维度占所代表的实体数量的很大一部分时，才能期望任意的真实关系。

Spatial models of preference, in the form of vector embeddings, are learned by many deep learning and multiagent systems, including recommender systems. Often these models are assumed to approximate a Euclidean structure, where an individual prefers alternatives positioned closer to their "ideal point", as measured by the Euclidean metric. However, Bogomolnaia and Laslier (2007) showed that there exist ordinal preference profiles that cannot be represented with this structure if the Euclidean space has two fewer dimensions than there are individuals or alternatives. We extend this result, showing that there are situations in which almost all preference profiles cannot be represented with the Euclidean model, and derive a theoretical lower bound on the expected error when using the Euclidean model to approximate non-Euclidean preference profiles. Our results have implications for the interpretation and use of vector embeddings, because in some cases close approximation of arbitrary, true ordinal relationships can be expected only if the dimensionality of the embeddings is a substantial fraction of the number of entities represented.

下载PDF全文

下载文献需遵守相关版权规定

论文标题