雅各布人和笛卡尔差异类别的梯度

论文标题

雅各布人和笛卡尔差异类别的梯度

Jacobians and Gradients for Cartesian Differential Categories

论文作者

Lemay, Jean-Simon Pacaud

论文摘要

笛卡尔差异类别配备了一个差分组合器，该组合器正式从多变量演算中形式化了定向衍生物。笛卡尔差异类别提供了差分lambda-calculus的分类语义，并在因果计算，增量计算，游戏理论，可区分编程和机器学习中找到了应用。最近，人们渴望在笛卡尔差异类别中提供（无坐标）对雅各布人和梯度的表征。人们的第一个尝试可能是考虑笛卡尔封闭的笛卡尔差异类别，例如差异lambda-calculus的模型，然后采取衍生物的咖喱。不幸的是，这种方法不包括笛卡尔差异类别的许多重要示例，例如真实平滑功能的类别。在本文中，我们介绍了线性封闭的笛卡尔差异类别，这些类别是具有线性图的内部HOM，双线性评估图以及在第二个论点中是线性的咖喱图的能力。因此，地图的雅各比式定义为其衍生物的咖喱。笛卡尔差异类别的许多著名示例是线性封闭的，尤其是实际平滑函数的类别。我们还解释了笛卡尔封闭的差异类别是如何在且仅当内部HOM拆分上的某些线性愿望时线性封闭的。为了定义地图的梯度，必须能够定义雅各布的转置，这可以在笛卡尔反向差分类别中进行。因此，我们将映射的梯度定义为其反向衍生物的咖喱，并表明这等于其雅各布的转置。我们还解释了线性封闭的笛卡尔反向差异类别如何精确地是一个线性封闭的笛卡尔差异类别，并具有适当的转置概念。

Cartesian differential categories come equipped with a differential combinator that formalizes the directional derivative from multivariable calculus. Cartesian differential categories provide a categorical semantics of the differential lambda-calculus and have also found applications in causal computation, incremental computation, game theory, differentiable programming, and machine learning. There has recently been a desire to provide a (coordinate-free) characterization of Jacobians and gradients in Cartesian differential categories. One's first attempt might be to consider Cartesian differential categories which are Cartesian closed, such as models of the differential lambda-calculus, and then take the curry of the derivative. Unfortunately, this approach excludes numerous important examples of Cartesian differential categories such as the category of real smooth functions. In this paper, we introduce linearly closed Cartesian differential categories, which are Cartesian differential categories that have an internal hom of linear maps, a bilinear evaluation map, and the ability to curry maps which are linear in their second argument. As such, the Jacobian of a map is defined as the curry of its derivative. Many well-known examples of Cartesian differential categories are linearly closed, such as, in particular, the category of real smooth functions. We also explain how a Cartesian closed differential category is linearly closed if and only if a certain linear idempotent on the internal hom splits. To define the gradient of a map, one must be able to define the transpose of the Jacobian, which can be done in a Cartesian reverse differential category. Thus, we define the gradient of a map to be the curry of its reverse derivative and show this equals the transpose of its Jacobian. We also explain how a linearly closed Cartesian reverse differential category is precisely a linearly closed Cartesian differential category with an appropriate notion of transpose.

下载PDF全文

下载文献需遵守相关版权规定

论文标题