论文标题
与Kumquat的平行UNIX命令和管道的自动合成
Automatic Synthesis of Parallel Unix Commands and Pipelines with KumQuat
论文作者
论文摘要
我们提出Kumquat,这是一种用于自动生成UNIX Shell命令和管道的数据并行实现的系统。生成的并行版本拆分输入流,执行原始管道命令的多个实例化以并行处理拆分,然后将所得的并行输出组合以产生最终的输出流。 Kumquat会自动合成组合运算符,而特定于域的组合语言是一种强大的常规化器,可促进对正确组合者的有效推断。 我们在70个基准脚本上评估了Kumquat,共同拥有427个阶段。 Kumquat合成了这些基准脚本中出现的121个唯一命令中的113个正确组合程序。合成时间在39秒至331秒之间,中位数为60秒。我们提出了实验结果,这些结果表明这些组合能够有效地平行我们的基准脚本。
We present KumQuat, a system for automatically generating data parallel implementations of Unix shell commands and pipelines. The generated parallel versions split input streams, execute multiple instantiations of the original pipeline commands to process the splits in parallel, then combine the resulting parallel outputs to produce the final output stream. KumQuat automatically synthesizes the combine operators, with a domain-specific combiner language acting as a strong regularizer that promotes efficient inference of correct combiners. We evaluate KumQuat on 70 benchmark scripts that together have a total of 427 stages. KumQuat synthesizes a correct combiner for 113 of the 121 unique commands that appear in these benchmark scripts. The synthesis times vary between 39 seconds and 331 seconds with a median of 60 seconds. We present experimental results that show that these combiners enable the effective parallelization of our benchmark scripts.