论文标题
仅说明Mein Kitna Balance Hai? - 建立语音为多语言社区提供银行服务
Mere account mein kitna balance hai? -- On building voice enabled Banking Services for Multilingual Communities
论文作者
论文摘要
言语和语言处理方面的巨大进步使语言技术更接近日常人类生活。语音技术有可能充当数字化各个方面的水平启用层。在大流行等场景中,这对农村社区特别有益。在这项工作中,我们将最初的探索性工作朝着一个这样的方向介绍 - 为多语言社会建立语音提供了银行服务。多语言社区中典型的银行交易的语音互动涉及填补的停顿,其特征是代码混合。代码混合是一种现象,其中一种语言的词汇项目嵌入了另一种语言的话语中。因此,为银行应用程序部署的语音系统应该能够处理此类内容。在我们的工作中,我们研究了各种培训基于语音的意图识别系统的培训策略。我们使用Allosaurus库在近似声音单元上使用幼稚的贝叶斯分类器提出结果。
Tremendous progress in speech and language processing has brought language technologies closer to daily human life. Voice technology has the potential to act as a horizontal enabling layer across all aspects of digitization. It is especially beneficial to rural communities in scenarios like a pandemic. In this work we present our initial exploratory work towards one such direction -- building voice enabled banking services for multilingual societies. Speech interaction for typical banking transactions in multilingual communities involves the presence of filled pauses and is characterized by Code Mixing. Code Mixing is a phenomenon where lexical items from one language are embedded in the utterance of another. Therefore speech systems deployed for banking applications should be able to process such content. In our work we investigate various training strategies for building speech based intent recognition systems. We present our results using a Naive Bayes classifier on approximate acoustic phone units using the Allosaurus library.