论文标题
开普敦大学的WMT22系统:南部非洲语言的多语言机器翻译
University of Cape Town's WMT22 System: Multilingual Machine Translation for Southern African Languages
论文作者
论文摘要
该论文描述了开普敦大学对WMT22共享任务的约束轨道的提交:非洲语言的大规模机器翻译评估。我们的系统是一种多语言翻译模型,该模型在英语和8种南 /东南非洲语言之间以及非洲语言的特定对之间。我们使用了适合低资源机器翻译(MT)的几种技术,包括重叠BPE,反向翻译,合成训练数据生成以及在培训期间添加更多翻译方向。我们的结果显示了这些技术的价值,尤其是对于很少或没有双语培训数据的方向。
The paper describes the University of Cape Town's submission to the constrained track of the WMT22 Shared Task: Large-Scale Machine Translation Evaluation for African Languages. Our system is a single multilingual translation model that translates between English and 8 South / South East African Languages, as well as between specific pairs of the African languages. We used several techniques suited for low-resource machine translation (MT), including overlap BPE, back-translation, synthetic training data generation, and adding more translation directions during training. Our results show the value of these techniques, especially for directions where very little or no bilingual training data is available.