论文标题
数十年来NLP的代码转换研究进展:对趋势和挑战的系统调查
The Decades Progress on Code-Switching Research in NLP: A Systematic Survey on Trends and Challenges
论文作者
论文摘要
数十年来,自然语言处理(NLP)研究社区已经研究了代码转换是书面文本和对话中的一种常见现象。最初,通过利用语言理论以及目前更加面向机器学习的方法来开发模型,可以深入探讨代码转换。我们介绍了一项有关自然语言处理中代码转换研究的全面系统调查,以了解过去几十年的进步,并概念化代码转换主题的挑战和任务。最后,我们总结了趋势和发现,并以讨论未来的方向讨论,并开放问题以进行进一步调查。
Code-Switching, a common phenomenon in written text and conversation, has been studied over decades by the natural language processing (NLP) research community. Initially, code-switching is intensively explored by leveraging linguistic theories and, currently, more machine-learning oriented approaches to develop models. We introduce a comprehensive systematic survey on code-switching research in natural language processing to understand the progress of the past decades and conceptualize the challenges and tasks on the code-switching topic. Finally, we summarize the trends and findings and conclude with a discussion for future direction and open questions for further investigation.