论文标题
用于广义并行坐标图的图形框架语法
A grammar of graphics framework for generalized parallel coordinate plots
论文作者
论文摘要
并行坐标图(PCP)是对高维数值数据的探索性数据分析的有用工具。使用分类变量或分类和连续变量的混合时,PCP的使用受到限制。在本文中,我们提出了广义的并行坐标图(GPCP)将PCP的能力从仅数字变量扩展到无缝处理单个图中的分类和数字变量的混合。在此过程中,我们发现仅针对分类值的现有解决方案,例如吊床图或放养品成为新框架中的边缘案例。通过专注于个人观察而不是边际频率,我们获得了额外的灵活性。结果方法是在R软件包GGPCP中实现的。
Parallel coordinate plots (PCP) are a useful tool in exploratory data analysis of high-dimensional numerical data. The use of PCPs is limited when working with categorical variables or a mix of categorical and continuous variables. In this paper, we propose generalized parallel coordinate plots (GPCP) to extend the ability of PCPs from just numeric variables to dealing seamlessly with a mix of categorical and numeric variables in a single plot. In this process we find that existing solutions for categorical values only, such as hammock plots or parsets become edge cases in the new framework. By focusing on individual observation rather a marginal frequency we gain additional flexibility. The resulting approach is implemented in the R package ggpcp.