论文标题
尼日利亚人的监督和无监督的神经机器翻译基线
Towards Supervised and Unsupervised Neural Machine Translation Baselines for Nigerian Pidgin
论文作者
论文摘要
尼日利亚的皮金可以说是尼日利亚最广泛的语言。这种语言的变体在整个西非和中非都会说,使其成为非常重要的语言。这项工作旨在建立英语和尼日利亚皮金之间的监督和无监督的神经机器翻译(NMT)基线。我们实施并比较具有不同令牌化方法的NMT模型,为将来的工作创造了坚实的基础。
Nigerian Pidgin is arguably the most widely spoken language in Nigeria. Variants of this language are also spoken across West and Central Africa, making it a very important language. This work aims to establish supervised and unsupervised neural machine translation (NMT) baselines between English and Nigerian Pidgin. We implement and compare NMT models with different tokenization methods, creating a solid foundation for future works.