论文标题
了解跨语言嵌入映射的线性性
Understanding Linearity of Cross-Lingual Word Embedding Mappings
论文作者
论文摘要
跨语言嵌入(CLWE)的技术在应对低资源语言的自然语言处理挑战方面发挥了基本作用。它的主要方法假定嵌入之间的关系可以用线性映射来表示,但是没有探索该假设所存在的条件。这种研究差距最近变得非常重要,因为已经证明,在某些情况下,放松映射的非线性映射会导致更好的性能。我们首次提出了一个理论分析,该分析确定了单词嵌入中编码的类比的保存,这是在这些嵌入之间的地面clwe映射的必要条件,是线性的。在一个涵盖十二种不同语言的五个代表性类比类别的新型跨语性类比数据集中,我们进行了实验,为我们的理论主张提供直接的经验支持。这些结果提供了对其他研究人员的观察结果的更多见解,并为制定更有效的跨语性代表性学习策略做出了贡献。
The technique of Cross-Lingual Word Embedding (CLWE) plays a fundamental role in tackling Natural Language Processing challenges for low-resource languages. Its dominant approaches assumed that the relationship between embeddings could be represented by a linear mapping, but there has been no exploration of the conditions under which this assumption holds. Such a research gap becomes very critical recently, as it has been evidenced that relaxing mappings to be non-linear can lead to better performance in some cases. We, for the first time, present a theoretical analysis that identifies the preservation of analogies encoded in monolingual word embeddings as a necessary and sufficient condition for the ground-truth CLWE mapping between those embeddings to be linear. On a novel cross-lingual analogy dataset that covers five representative analogy categories for twelve distinct languages, we carry out experiments which provide direct empirical support for our theoretical claim. These results offer additional insight into the observations of other researchers and contribute inspiration for the development of more effective cross-lingual representation learning strategies.