PaddlesPeech：易于使用的多合一语音工具包

论文标题

PaddlesPeech：易于使用的多合一语音工具包

PaddleSpeech: An Easy-to-Use All-in-One Speech Toolkit

论文作者

Zhang, Hui, Yuan, Tian, Chen, Junkun, Li, Xintong, Zheng, Renjie, Huang, Yuxin, Chen, Xiaojie, Gong, Enlei, Chen, Zeyu, Hu, Xiaoguang, Yu, Dianhai, Ma, Yanjun, Huang, Liang

论文摘要

PaddlesPeech是一个开源的多合一语音工具包。它旨在通过提供易于使用的命令行界面和简单的代码结构来促进语音处理技术的开发和研究。本文介绍了PaddleSech的设计理念和核心体系结构，以支持几个基本的语音到文本和文本到语音任务。 PaddlesPeech在各种语音数据集上实现了竞争性或最先进的性能，并实现了最受欢迎的方法。它还提供了食谱和预处理的模型，以快速再现本文的实验结果。 paddlespeech可以在https://github.com/paddlepaddle/paddlespeech上公开避免。

PaddleSpeech is an open-source all-in-one speech toolkit. It aims at facilitating the development and research of speech processing technologies by providing an easy-to-use command-line interface and a simple code structure. This paper describes the design philosophy and core architecture of PaddleSpeech to support several essential speech-to-text and text-to-speech tasks. PaddleSpeech achieves competitive or state-of-the-art performance on various speech datasets and implements the most popular methods. It also provides recipes and pretrained models to quickly reproduce the experimental results in this paper. PaddleSpeech is publicly avaiable at https://github.com/PaddlePaddle/PaddleSpeech.

下载PDF全文

下载文献需遵守相关版权规定

论文标题