论文标题

数十年来NLP的代码转换研究进展:对趋势和挑战的系统调查

The Decades Progress on Code-Switching Research in NLP: A Systematic Survey on Trends and Challenges

论文作者

Winata, Genta Indra, Aji, Alham Fikri, Yong, Zheng-Xin, Solorio, Thamar

论文摘要

数十年来,自然语言处理(NLP)研究社区已经研究了代码转换是书面文本和对话中的一种常见现象。最初,通过利用语言理论以及目前更加面向机器学习的方法来开发模型,可以深入探讨代码转换。我们介绍了一项有关自然语言处理中代码转换研究的全面系统调查,以了解过去几十年的进步,并概念化代码转换主题的挑战和任务。最后,我们总结了趋势和发现,并以讨论未来的方向讨论,并开放问题以进行进一步调查。

Code-Switching, a common phenomenon in written text and conversation, has been studied over decades by the natural language processing (NLP) research community. Initially, code-switching is intensively explored by leveraging linguistic theories and, currently, more machine-learning oriented approaches to develop models. We introduce a comprehensive systematic survey on code-switching research in natural language processing to understand the progress of the past decades and conceptualize the challenges and tasks on the code-switching topic. Finally, we summarize the trends and findings and conclude with a discussion for future direction and open questions for further investigation.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源