Abstract
We present Asterism, an open source data-intensive framework, which combines the strengths of traditional workflow management systems with new parallel stream-based dataflow systems to run data-intensive applications across multiple heterogeneous resources, without users having to: re-formulate their methods according to different enactment engines; manage the data distribution across systems; parallelize their methods; co-place and schedule their methods with computing resources; and store and transfer large/small volumes of data. We also present the Data-Intensive workflows as a Service (DIaaS) model, which enables easy dataintensive workow composition and deployment on clouds using containers. The feasibility of Asterism and DIaaS model have been evaluated using a real domain application on the NSF-Chameleon cloud. Experimental results shows how Asterism successfully and efficiently exploits combinations of diverse computational platforms, whereas DIaaS delivers specialized software to execute data-intensive applications in a scalable, efficient, and robust way reducing the engineering time and computational cost.
| Original language | English |
|---|---|
| Title of host publication | 2016 Seventh International Workshop on Data-Intensive Computing in the Clouds (DataCloud) |
| Publisher | IEEE |
| ISBN (Electronic) | 9781509061587 |
| DOIs | |
| Publication status | Published - 9 Feb 2017 |
| Event | 7th International Workshop on Data-Intensive Computing in the Clouds 2016 - Salt Lake City, United States Duration: 14 Nov 2016 → 14 Nov 2016 |
Conference
| Conference | 7th International Workshop on Data-Intensive Computing in the Clouds 2016 |
|---|---|
| Abbreviated title | DataCloud 2016 |
| Country/Territory | United States |
| City | Salt Lake City |
| Period | 14/11/16 → 14/11/16 |
Keywords
- Data-Intensive science
- Deployment and reusability of execution environments
- scientific workows
- stream-based system
ASJC Scopus subject areas
- Software
- Computer Science Applications
- Computer Networks and Communications