论文标题

科学工作流管理系统的中间数据驱动方法,以支持可重用性

An Intermediate Data-driven Methodology for Scientific Workflow Management System to Support Reusability

论文作者

Chakroborti, Debasish

论文摘要

首先,在本文中,我们为SWFM提出了一个中间数据管理方案。在我们的第二次尝试中,我们探索了可能性,并引入了一种自动推荐技术,用于从现实世界中工作流数据(即银河[1]工作流程)中的SWFM,我们的研究表明,该提议的技术可以通过将SWFM促进51%的工作流程构建,从而通过降低了74%的工作流程中的工作流程中的工作流程和工作流量为swfms a swfms a swfms a swfms a swfms a swfms a a swfms a perflow apefts a plosefings a posseffter的apefts a的工作流程的。后来,我们通过考虑SWFM中的工具状态来提出一种自适应版本,该工具显示了工作流程的40%可重复使用。因此,在我们的第四项研究中,我们进行了几项实验,以分析性能并探索SWFMS在各种环境中的有效性。介绍了该技术是为了强调降低成本,提高数据可重复性和更快的工作流程执行,这是我们的最佳知识,这是同类产品中的第一个。本文介绍了该技术的详细体系结构和评估。我们认为,我们的发现和开发系统将对SWFMS的研究领域产生重大贡献。

In this thesis first we propose an intermediate data management scheme for a SWfMS. In our second attempt, we explored the possibilities and introduced an automatic recommendation technique for a SWfMS from real-world workflow data (i.e Galaxy [1] workflows) where our investigations show that the proposed technique can facilitate 51% of workflow building in a SWfMS by reusing intermediate data of previous workflows and can reduce 74% execution time of workflow buildings in a SWfMS. Later we propose an adaptive version of our technique by considering the states of tools in a SWfMS, which shows around 40% reusability for workflows. Consequently, in our fourth study, We have done several experiments for analyzing the performance and exploring the effectiveness of the technique in a SWfMS for various environments. The technique is introduced to emphasize on storing cost reduction, increase data reusability, and faster workflow execution, to the best of our knowledge, which is the first of its kind. Detail architecture and evaluation of the technique are presented in this thesis. We believe our findings and developed system will contribute significantly to the research domain of SWfMSs.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源