论文标题
同时选择和融合其水平的刑法回归
Simultaneous Factors Selection and Fusion of Their Levels in Penalized Logistic Regression
论文作者
论文摘要
如今,几个数据分析问题需要降低复杂性,主要意味着它们是针对从模型中删除非影响的协变量并提供稀疏模型的。当存在分类协变量(其级别为虚拟编码)时,模型中包含的参数数量会迅速增长,这强调了减少要估计的参数数量的需求。在这种情况下,除了可变选择之外,还通过融合的协变量来实现稀疏性,这些协变量在对响应变量的影响方面没有显着区分。在这项工作中,引入了一种新的正则化技术,称为$ l_ {0} $ - Fused Group lasso($ l_ {0} $ - FGL)用于二进制逻辑回归。它使用小组套索惩罚进行因子选择,对于融合部分,它应用了$ l_ {0} $惩罚,以对分类预测变量的级别参数之间的差异。使用自适应权重,派生了$ l_ {0} $的自适应版本。在某些条件下建立了理论属性,例如存在,$ \ sqrt {n} $一致性和Oracle属性。此外,也表明,即使在不同的情况下,参数数量$ p_ {n} $随着样本大小$ n $的生长而生长,$ \ sqrt {n} $一致性和变量选择结果中的一致性。开发和实施了两种计算方法,PIRL和一种使用准牛顿的块坐标下降(BCD)方法。一项仿真研究支持$ l_ {0} $ - FGL显示出出色的性能,尤其是在高维情况下。
Nowadays, several data analysis problems require for complexity reduction, mainly meaning that they target at removing the non-influential covariates from the model and at delivering a sparse model. When categorical covariates are present, with their levels being dummy coded, the number of parameters included in the model grows rapidly, fact that emphasizes the need for reducing the number of parameters to be estimated. In this case, beyond variable selection, sparsity is also achieved through fusion of levels of covariates which do not differentiate significantly in terms of their influence on the response variable. In this work a new regularization technique is introduced, called $L_{0}$-Fused Group Lasso ($L_{0}$-FGL) for binary logistic regression. It uses a group lasso penalty for factor selection and for the fusion part it applies an $L_{0}$ penalty on the differences among the levels' parameters of a categorical predictor. Using adaptive weights, the adaptive version of $L_{0}$-FGL method is derived. Theoretical properties, such as the existence, $\sqrt{n}$ consistency and oracle properties under certain conditions, are established. In addition, it is shown that even in the diverging case where the number of parameters $p_{n}$ grows with the sample size $n$, $\sqrt{n}$ consistency and a consistency in variable selection result are achieved. Two computational methods, PIRLS and a block coordinate descent (BCD) approach using quasi Newton, are developed and implemented. A simulation study supports that $L_{0}$-FGL shows an outstanding performance, especially in the high dimensional case.