论文标题
通过RDF数据集执行通过SPARQL执行实时时间引起的查询
Performing live time-traversal queries via SPARQL on RDF datasets
论文作者
论文摘要
本文介绍了一种基于此方法的RDF数据集和软件上的实时时间引起的SPARQL查询的方法,该方法提供了一种解决方案,以管理使用RDF所描述的实体的出处和更改跟踪。这些是确保可验证性和信任的关键因素。然而,一些最突出的知识库 - 包括DBPEDIA,WIKIDATA,YAGO和动态链接的数据天文台 - 不支持时间不足的查询,即,跨不同快照的查询以及出处信息。 Opencitations数据模型(OCDM)描述了一种可能在RDF数据集中跟踪出处和实体更改的可能方法,并且它允许通过应用SPARQL更新查询来将实体恢复到特定状态(即快照)。本文介绍的方法和库是基于OCDM中介绍的理由。我们还开发了基准,证明了这种程序对于特定查询有效,对他人来说效率较低。据我们所知,我们的库是唯一支持所有与时间相关的检索功能的库,即实时搜索和更新。此外,由于OCDM符合标准RDF,因此查询通过标准SPARQL表示。
This article introduces a methodology to perform live time-traversal SPARQL queries on RDF datasets and software based on this methodology that offers a solution to manage the provenance and change-tracking of entities described using RDF. These are crucial factors in ensuring verifiability and trust. Nevertheless, some of the most prominent knowledge bases - including DBpedia, Wikidata, Yago, and the Dynamic Linked Data Observatory - do not support time-agnostic queries, i.e., queries across different snapshots together with provenance information. The OpenCitations Data Model (OCDM) describes one possible way to track provenance and entities' changes in RDF datasets, and it allows restoring an entity to a specific status in time (i.e., a snapshot) by applying SPARQL update queries. The methodology and library presented in this article are based on the rationale introduced in the OCDM. We also developed benchmarks proving that such a procedure is efficient for specific queries and less efficient for others. To the best of our knowledge, our library is the only one to support all the time-related retrieval functionalities live, i.e., enabling real-time searches and updates. Moreover, since OCDM complies with standard RDF, queries are expressed via standard SPARQL.