论文标题

首先看重复和近乎解说的自我吸引的技术债务评论

A First Look at Duplicate and Near-duplicate Self-admitted Technical Debt Comments

论文作者

Yasmin, Jerin, Sheikhaei, Mohammad Sadegh, Tian, Yuan

论文摘要

自我吸附的技术债务(SATD)是指开发人员故意引入的技术债务,并在代码评论或其他软件工件(例如,发行报告)中明确记录,以注释开发人员在软件开发过程中做出的亚最佳决定。 在这项工作中,我们首先研究了五个流行的Apache OSS项目中的重复和近乎副本SATD评论的存在和特征,即JSPWIKI,Helix,Jackrabbit,Archiva和SystemMl。我们设计了一种方法,可以自动识别复制和近乎简化的SATD注释组,并通过挖掘软件项目的提交历史记录来跟踪其在软件系统中的演变。利用拟议的方法,我们从目标项目中确定了3,520个重复和近乎删除的SATD评论,这些评论属于1,141组。我们手动分析了1,505个SATD注释的样本的内容和上下文(通过为每个项目进行100组采样),并确定它们是否注释了相同的根本原因。我们还调查了代码克隆中是否存在重复的SATD评论,它们是否共存在同一文件中,以及是否同时引入和删除它们。我们的初步研究揭示了一些令人惊讶的发现,这些发现将揭示未来的研究,旨在改善重复的SATD评论的管理。例如,在常规代码克隆中只有48.5%的重复的SATD注释组存在,并且仅在同一提交中引入了33.9%的重复SATD评论对。

Self-admitted technical debt (SATD) refers to technical debt that is intentionally introduced by developers and explicitly documented in code comments or other software artifacts (e.g., issue reports) to annotate sub-optimal decisions made by developers in the software development process. In this work, we take the first look at the existence and characteristics of duplicate and near-duplicate SATD comments in five popular Apache OSS projects, i.e., JSPWiki, Helix, Jackrabbit, Archiva, and SystemML. We design a method to automatically identify groups of duplicate and near-duplicate SATD comments and track their evolution in the software system by mining the commit history of a software project. Leveraging the proposed method, we identified 3,520 duplicate and near-duplicate SATD comments from the target projects, which belong to 1,141 groups. We manually analyze the content and context of a sample of 1,505 SATD comments (by sampling 100 groups for each project) and identify if they annotate the same root cause. We also investigate whether duplicate SATD comments exist in code clones, whether they co-exist in the same file, and whether they are introduced and removed simultaneously. Our preliminary study reveals several surprising findings that would shed light on future studies aiming to improve the management of duplicate SATD comments. For instance, only 48.5% duplicate SATD comment groups with the same root cause exist in regular code clones, and only 33.9% of the duplicate SATD comment pairs are introduced in the same commit.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源