论文标题
Rigoberta:西班牙语的最先进的语言模型
RigoBERTa: A State-of-the-Art Language Model For Spanish
论文作者
论文摘要
本文介绍了Rigoberta,这是西班牙语的最先进的语言模型。 Rigoberta接受了由不同的Subcorpora形成的精心策划的语料库训练,并具有关键特征。它遵循Deberta建筑,该建筑与其他与Bert或Roberta相似的架构具有多个优势。与其他可用的西班牙语模型相比,对13个NLU任务进行了评估Rigoberta的性能,即Maria,Bertin和Beto。里戈伯塔(Rigoberta)在13个任务中的10个任务中的10个模型都优于实现新的“最先进”结果。
This paper presents RigoBERTa, a State-of-the-Art Language Model for Spanish. RigoBERTa is trained over a well-curated corpus formed up from different subcorpora with key features. It follows the DeBERTa architecture, which has several advantages over other architectures of similar size as BERT or RoBERTa. RigoBERTa performance is assessed over 13 NLU tasks in comparison with other available Spanish language models, namely, MarIA, BERTIN and BETO. RigoBERTa outperformed the three models in 10 out of the 13 tasks, achieving new "State-of-the-Art" results.