MACFE：基于元学习和因果关系的功能工程框架

论文标题

MACFE：基于元学习和因果关系的功能工程框架

MACFE: A Meta-learning and Causality Based Feature Engineering Framework

论文作者

Reyes-Amezcua, Ivan, Flores-Araiza, Daniel, Ochoa-Ruiz, Gilberto, Mendez-Vazquez, Andres, Rodriguez-Tello, Eduardo

论文摘要

功能工程已成为提高模型预测性能并生产优质数据集的最重要步骤之一。但是，此过程需要非平凡的域知识，涉及耗时的过程。因此，自动化这种过程已成为研究的积极领域，并在工业应用中感兴趣。在本文中，提出了一种新的方法，称为基于元学习和因果关系的特征工程（MACFE）。我们的方法基于使用元学习，特征分布编码和因果关系特征选择。在MacFe中，使用元学习来找到最佳的转换，然后通过预选“原始”功能来加速搜索，鉴于其因果关系的相关性。对流行分类数据集的实验评估表明，MACFE可以提高八个分类器的预测性能，表现平均最高的最新方法至少提高6.54％，并且比最佳先前工作的提高了2.71％。

Feature engineering has become one of the most important steps to improve model prediction performance, and to produce quality datasets. However, this process requires non-trivial domain-knowledge which involves a time-consuming process. Thereby, automating such process has become an active area of research and of interest in industrial applications. In this paper, a novel method, called Meta-learning and Causality Based Feature Engineering (MACFE), is proposed; our method is based on the use of meta-learning, feature distribution encoding, and causality feature selection. In MACFE, meta-learning is used to find the best transformations, then the search is accelerated by pre-selecting "original" features given their causal relevance. Experimental evaluations on popular classification datasets show that MACFE can improve the prediction performance across eight classifiers, outperforms the current state-of-the-art methods in average by at least 6.54%, and obtains an improvement of 2.71% over the best previous works.

下载PDF全文

下载文献需遵守相关版权规定

论文标题