论文标题
EasyTransfer- NLP应用程序的简单且可扩展的深度转移学习平台
EasyTransfer -- A Simple and Scalable Deep Transfer Learning Platform for NLP Applications
论文作者
论文摘要
文献见证了利用预训练的语言模型(PLM)和转移学习(TL)算法的成功,以实现广泛的自然语言处理(NLP)应用程序,但是为此目的构建易于使用且可扩展的TL工具包并不容易。为了弥合这一差距,EasyTransfer平台旨在为NLP应用程序开发深层TL算法。 EasyTransfer提供了高性能和可扩展的引擎,可进行有效的培训和推理,还集成了全面的深层TL算法,以使工业规模的TL应用程序的开发更加容易。在EasyTransfer中,内置数据和模型并行策略与AI编译器优化相结合,显示比分布式培训的社区版本快4.0倍。 EasyTransfer支持ModelZoo中的各种NLP模型,包括主流PLM和多模式模型。它还具有各种内部开发的TL算法以及用于NLP应用程序的Appzoo。该工具包方便用户快速开始模型培训,评估和在线部署。 EasyTransfer目前已在阿里巴巴部署,以支持各种业务方案,包括项目建议,个性化搜索,对话性问题答案等。对现实世界数据集和在线应用程序进行了广泛的实验表明,EasyTransfer适用于在线生产,具有各种应用程序的最先进性能。 EasyTransfer的源代码在GitHub(https://github.com/alibaba/easytransfer)上发布。
The literature has witnessed the success of leveraging Pre-trained Language Models (PLMs) and Transfer Learning (TL) algorithms to a wide range of Natural Language Processing (NLP) applications, yet it is not easy to build an easy-to-use and scalable TL toolkit for this purpose. To bridge this gap, the EasyTransfer platform is designed to develop deep TL algorithms for NLP applications. EasyTransfer is backended with a high-performance and scalable engine for efficient training and inference, and also integrates comprehensive deep TL algorithms, to make the development of industrial-scale TL applications easier. In EasyTransfer, the built-in data and model parallelism strategies, combined with AI compiler optimization, show to be 4.0x faster than the community version of distributed training. EasyTransfer supports various NLP models in the ModelZoo, including mainstream PLMs and multi-modality models. It also features various in-house developed TL algorithms, together with the AppZoo for NLP applications. The toolkit is convenient for users to quickly start model training, evaluation, and online deployment. EasyTransfer is currently deployed at Alibaba to support a variety of business scenarios, including item recommendation, personalized search, conversational question answering, etc. Extensive experiments on real-world datasets and online applications show that EasyTransfer is suitable for online production with cutting-edge performance for various applications. The source code of EasyTransfer is released at Github (https://github.com/alibaba/EasyTransfer).