论文标题
联合方法分析联系中心域中的语音对话
A combined approach to the analysis of speech conversations in a contact center domain
论文作者
论文摘要
如今,在客户数据中更准确地搜索对客户数据的深入分析是一种非常强大的技术趋势,对私人和上市公司都非常有吸引力。在接触中心领域中尤其如此,在该联络中心领域中,语音分析是一种从非结构化数据中获得见解的一种非常有力的方法,这些方法来自客户和人类代理人的对话。在这项工作中,我们描述了意大利联络中心的语音分析过程的实验,该实验涉及从入站或出口流中提取的呼叫记录。首先,我们详细说明了基于Kaldi框架的内部语音到文本解决方案的开发,并评估其性能(并将其与Google Cloud Cloud Speech API进行比较)。然后,我们评估并比较了呼叫转录本的语义标签的不同方法,从经典正则表达式到基于Ngrams和Logistic回归的机器学习模型,并提出了它们的组合,这表明可以提供一致的好处。最后,将诱导者(称为J48S)应用于标记问题。这种算法本质上能够利用顺序数据,例如文本,以进行分类。将解决方案与其他方法进行比较,并显示出提供竞争性的分类性能,同时生成高度可解释的模型并降低数据制备阶段的复杂性。彻底研究了整个过程的潜在操作影响。
The ever more accurate search for deep analysis in customer data is a really strong technological trend nowadays, quite appealing to both private and public companies. This is particularly true in the contact center domain, where speech analytics is an extremely powerful methodology for gaining insights from unstructured data, coming from customer and human agent conversations. In this work, we describe an experimentation with a speech analytics process for an Italian contact center, that deals with call recordings extracted from inbound or outbound flows. First, we illustrate in detail the development of an in-house speech-to-text solution, based on Kaldi framework, and evaluate its performance (and compare it to Google Cloud Speech API). Then, we evaluate and compare different approaches to the semantic tagging of call transcripts, ranging from classic regular expressions to machine learning models based on ngrams and logistic regression, and propose a combination of them, which is shown to provide a consistent benefit. Finally, a decision tree inducer, called J48S, is applied to the problem of tagging. Such an algorithm is natively capable of exploiting sequential data, such as texts, for classification purposes. The solution is compared with the other approaches and is shown to provide competitive classification performances, while generating highly interpretable models and reducing the complexity of the data preparation phase. The potential operational impact of the whole process is thoroughly examined.