论文标题
挖掘因子动物园:具有足够代理的潜在因子模型的估计
Mining the Factor Zoo: Estimation of Latent Factor Models with Sufficient Proxies
论文作者
论文摘要
潜在因子模型估计通常依赖于使用域知识来手动选择几个观察到的协变量作为因子代理,或者纯粹进行多变量分析(例如主成分分析)。但是,以前的方法可能会遇到偏见,而后者无法纳入其他信息。我们建议桥接这两种方法,同时允许代理的数量差异,从而使潜在因子模型估计稳健,灵活和统计学上更准确。作为奖励,还允许许多因素增长。我们方法的核心是惩罚的减少等级回归以结合信息。为了进一步处理重尾数据,提出了一种计算有吸引力的牢固降低等级回归方法。与基准相比,我们建立了更快的收敛速度。广泛的模拟和真实示例用于说明优势。
Latent factor model estimation typically relies on either using domain knowledge to manually pick several observed covariates as factor proxies, or purely conducting multivariate analysis such as principal component analysis. However, the former approach may suffer from the bias while the latter can not incorporate additional information. We propose to bridge these two approaches while allowing the number of factor proxies to diverge, and hence make the latent factor model estimation robust, flexible, and statistically more accurate. As a bonus, the number of factors is also allowed to grow. At the heart of our method is a penalized reduced rank regression to combine information. To further deal with heavy-tailed data, a computationally attractive penalized robust reduced rank regression method is proposed. We establish faster rates of convergence compared with the benchmark. Extensive simulations and real examples are used to illustrate the advantages.