论文标题
热带数据科学
Tropical Data Science
论文作者
论文摘要
系统基因学是一个新领域,适用于系统发育学工具对基因组数据。由于新技术和越来越多的数据,我们面临着新的挑战,可以在系统发育树的空间中分析它们。由于叶子上有固定标签的系统发育树的空间不是欧几里得,因此我们不能简单地在数据科学中应用工具。在本文中,我们使用热带几何形状调查了机器学习模型的一些新发展,以分析树木空间上的一组系统发育树。
Phylogenomics is a new field which applies to tools in phylogenetics to genome data. Due to a new technology and increasing amount of data, we face new challenges to analyze them over a space of phylogenetic trees. Because a space of phylogenetic trees with a fixed set of labels on leaves is not Euclidean, we cannot simply apply tools in data science. In this paper we survey some new developments of machine learning models using tropical geometry to analyze a set of phylogenetic trees over a tree space.