论文标题
文化制图用单词嵌入
Cultural Cartography with Word Embeddings
论文作者
论文摘要
在文本的形式分析中,使用关键字的频率是一种经典的方法,但在单词含义的关系中掩盖了掩饰。单词嵌入模型通过构建标准化且连续的“含义空间”来克服这个问题,在该单词是根据与其他单词相似性的相似性根据自然语言样本中使用的方式分配的位置。我们展示了单词嵌入如何与社会学中的意义上的普遍理论相称,并可以通过两种导航来完成解释的任务。首先,人们可以保持术语恒定并测量嵌入空间如何在它们周围移动 - 就像天文学家通过季节测量了天体的变化。其次,还可以保持嵌入空间的恒定,并查看文档或作者如何相对于它移动 - 只是在船只在给定夜晚使用星星来确定其位置。使用美国移民话语的经验案例,我们证明了这两种广泛的策略的优点,以推进文化理论中的重要主题,包括社会标记,媒体领域,回声宗教庭,文化扩散以及更广泛的变化。
Using the frequency of keywords is a classic approach in the formal analysis of text, but has the drawback of glossing over the relationality of word meanings. Word embedding models overcome this problem by constructing a standardized and continuous "meaning space" where words are assigned a location based on relations of similarity to other words based on how they are used in natural language samples. We show how word embeddings are commensurate with prevailing theories of meaning in sociology and can be put to the task of interpretation via two kinds of navigation. First, one can hold terms constant and measure how the embedding space moves around them--much like astronomers measured the changing of celestial bodies with the seasons. Second, one can also hold the embedding space constant and see how documents or authors move relative to it--just as ships use the stars on a given night to determine their location. Using the empirical case of immigration discourse in the United States, we demonstrate the merits of these two broad strategies for advancing important topics in cultural theory, including social marking, media fields, echo chambers, and cultural diffusion and change more broadly.