TY - JOUR
T1 - Scalable Adaptive Optimizations for Stream-based Workflows in Multi-HPC-Clusters and Cloud Infrastructures
AU - Liang, Liang
AU - Filgueira, Rosa
AU - Yan, Yan
AU - Heinis, Thomas
N1 - Funding Information:
This work is partially supported by the EU H2020 project DARE , No. 777413 ; and by Google Cloud Platform research credits program .
Publisher Copyright:
© 2021 Elsevier B.V.
PY - 2022/3
Y1 - 2022/3
N2 - This work presents three new adaptive optimization techniques to maximize the performance of dispel4py workflows. dispel4py is a parallel Python-based stream-oriented dataflow framework that acts as a bridge to existing parallel programming frameworks like MPI or Python multiprocessing. When a user runs a dispel4py workflow, the original framework performs a fixed workload distribution among the processes available for the run. This allocation does not take into account the features of the workflows, which can cause scalability issues, especially for data-intensive scientific workflows. Our aim, therefore, is to improve the performance of dispel4py workflows by testing different workload strategies that automatically adapt to workflows at runtime. For achieving this objective, we have implemented three new techniques, called Naive Assignment, Staging and Dynamic Scheduling. We have evaluated our proposal with several workflows from different domains and across different computing resources. The results show that our proposed techniques have significantly (up to 10X) improved the performance of the original dispel4py framework.
AB - This work presents three new adaptive optimization techniques to maximize the performance of dispel4py workflows. dispel4py is a parallel Python-based stream-oriented dataflow framework that acts as a bridge to existing parallel programming frameworks like MPI or Python multiprocessing. When a user runs a dispel4py workflow, the original framework performs a fixed workload distribution among the processes available for the run. This allocation does not take into account the features of the workflows, which can cause scalability issues, especially for data-intensive scientific workflows. Our aim, therefore, is to improve the performance of dispel4py workflows by testing different workload strategies that automatically adapt to workflows at runtime. For achieving this objective, we have implemented three new techniques, called Naive Assignment, Staging and Dynamic Scheduling. We have evaluated our proposal with several workflows from different domains and across different computing resources. The results show that our proposed techniques have significantly (up to 10X) improved the performance of the original dispel4py framework.
KW - Scientific workflow
KW - Stream-based workflow
KW - Workflow optimization
KW - dispel4py
KW - distributed systems
UR - http://www.scopus.com/inward/record.url?scp=85118733552&partnerID=8YFLogxK
U2 - 10.1016/j.future.2021.09.036
DO - 10.1016/j.future.2021.09.036
M3 - Article
SN - 0167-739X
VL - 128
SP - 102
EP - 116
JO - Future Generation Computer Systems
JF - Future Generation Computer Systems
ER -