论文标题
Globus自动化服务:整个时空连续性的研究过程自动化
Globus Automation Services: Research process automation across the space-time continuum
论文作者
论文摘要
研究过程自动化 - 对科学工具,计算机,数据存储和其他资源的可靠,高效且可重复执行的可靠,高效和可重复执行 - 已成为现代科学的基本要素。我们在此处报告了Globus研究数据管理平台内的新服务,该服务可以将各种研究过程的规范作为可重复使用的动作集,\ emph {flow},以及在异质研究环境中执行此类流量的情况。为了支持广泛的空间范围(例如,从科学仪器到远程数据中心)和时间范围(从几秒钟到几周),这些Globus自动化服务具有:1)尽管零星失败,但可靠地执行云托管,以可靠地执行长期的流量; 2)简单的规范和可扩展的异步动作提供商API,用于定义和执行各种涉及异构资源的动作和流动; 3)事件驱动的执行模型,用于自动执行对任意事件的流量执行; 4)一个丰富的安全模型,实现了授权授权机制,以安全地执行跨分布式资源的长期行动。这些服务允许研究人员将广泛的研究任务的管理外包和自动化为可靠,可扩展和安全的云平台。我们介绍了Globus自动化服务的用例,描述其设计和实施,目前的微观计算研究以及在一系列应用程序中应用服务的经验。
Research process automation -- the reliable, efficient, and reproducible execution of linked sets of actions on scientific instruments, computers, data stores, and other resources -- has emerged as an essential element of modern science. We report here on new services within the Globus research data management platform that enable the specification of diverse research processes as reusable sets of actions, \emph{flows}, and the execution of such flows in heterogeneous research environments. To support flows with broad spatial extent (e.g., from scientific instrument to remote data center) and temporal extent (from seconds to weeks), these Globus automation services feature: 1) cloud hosting for reliable execution of even long-lived flows despite sporadic failures; 2) a simple specification and extensible asynchronous action provider API, for defining and executing a wide variety of actions and flows involving heterogeneous resources; 3) an event-driven execution model for automating execution of flows in response to arbitrary events; and 4) a rich security model enabling authorization delegation mechanisms for secure execution of long-running actions across distributed resources. These services permit researchers to outsource and automate the management of a broad range of research tasks to a reliable, scalable, and secure cloud platform. We present use cases for Globus automation services, describe their design and implementation, present microbenchmark studies, and review experiences applying the services in a range of applications.