论文标题
音调复杂性及其权衡
Phonotactic Complexity and its Trade-offs
论文作者
论文摘要
我们提出了计算语音复杂性度量的方法 - 每个音素的位 - 允许直接的跨语言比较。当给出一个单词时,表示为音素段的序列,例如国际语音字母中的符号,以及在语言中对单词类型样本进行训练的统计模型,我们可以使用该单词在模型下的单词的负面概要性大致测量每个音素的位。这种简单的措施使我们能够比较跨语言的熵,从而深入了解语言的语音学有多复杂。使用106种语言的1016个基本概念词的集合,我们在每个音素和平均单词长度之间表明了-0.74的非常强的负相关。
We present methods for calculating a measure of phonotactic complexity---bits per phoneme---that permits a straightforward cross-linguistic comparison. When given a word, represented as a sequence of phonemic segments such as symbols in the international phonetic alphabet, and a statistical model trained on a sample of word types from the language, we can approximately measure bits per phoneme using the negative log-probability of that word under the model. This simple measure allows us to compare the entropy across languages, giving insight into how complex a language's phonotactics are. Using a collection of 1016 basic concept words across 106 languages, we demonstrate a very strong negative correlation of -0.74 between bits per phoneme and the average length of words.