论文标题
Word2Vec猜想和限制结果
Word2vec Conjecture and A Limitative Result
论文作者
论文摘要
受到\ texttt {word2vec} \ citep {mikolov2013distributed}的成功的启发,我们在捕获类比时研究了猜想,即类似关系可以由向量空间表示。与许多以前的作品不同,这些作品着重于\ texttt {word2vec}的分布语义方面,我们研究了纯粹的\ emph {代表性}问题:\ emph {ash all}语义单词关系可以用向量的差异(或指向)表示吗?我们将其称为“ 2VEC猜想”,并指出其一些理想的含义。但是,我们将展示一类无法以这种方式表示的关系,从而伪造了猜想,并确立了矢量空间在特征0(例如真实或复杂数字)上通过向量空间代表语义关系的限制结果。
Being inspired by the success of \texttt{word2vec} \citep{mikolov2013distributed} in capturing analogies, we study the conjecture that analogical relations can be represented by vector spaces. Unlike many previous works that focus on the distributional semantic aspect of \texttt{word2vec}, we study the purely \emph{representational} question: can \emph{all} semantic word-word relations be represented by differences (or directions) of vectors? We call this the word2vec conjecture and point out some of its desirable implications. However, we will exhibit a class of relations that cannot be represented in this way, thus falsifying the conjecture and establishing a limitative result for the representability of semantic relations by vector spaces over fields of characteristic 0, e.g., real or complex numbers.