论文标题
俄罗斯的临床文本挖掘的自动拼写校正
Automated Spelling Correction for Clinical Text Mining in Russian
论文作者
论文摘要
本文的主要目的是为俄罗斯的临床文本开发一个咒语检查器模块。所描述的方法将弦距离测量算法与机器学习嵌入方法的技术结合在一起。我们的总体精度为0.86,词汇精度-0.975,误差精度为0.74。我们开发了拼写检查器,作为医学文本挖掘工具的一部分,内容涉及拼写,否定,经验者和时间性检测的问题。
The main goal of this paper is to develop a spell checker module for clinical text in Russian. The described approach combines string distance measure algorithms with technics of machine learning embedding methods. Our overall precision is 0.86, lexical precision - 0.975 and error precision is 0.74. We develop spell checker as a part of medical text mining tool regarding the problems of misspelling, negation, experiencer and temporality detection.