Logo image
Risk-aware checkpoint selection in cloud-based scientific workflow
Conference paper   Peer reviewed

Risk-aware checkpoint selection in cloud-based scientific workflow

Mingzhong Wang, L Zhu and J Chen
Proceedings of the 2nd International Conference on Cloud and Green Computing and the 2nd International Conference on Social Computing and Its Applications, pp.137-144
2nd International Conference on Cloud and Green Computing and the 2nd International Conference on Social Computing and Its Applications (CGC/SCA 2012), 2012 (Xiangtan, China, 01-Nov-2012–03-Nov-2012)
IEEE Computer Society
2012
url
https://doi.org/10.1109/CGC.2012.46View
Published Version

Abstract

checkpoint exception handling risk robustness scientific workflow
Scientific workflows are generally computing- and data-intensive with large volume of data generated during their execution. Therefore, some of the data should be saved to avoid the expensive re-execution of tasks in case of exceptions. However, cloud-based data storage services come at some expense. In this paper, we extend the risk evaluation model, which assigns different weights to tasks based on their ordering relationship, to decide the occasion to perform backup or checkpoint service after the completion of a task. The proposed method computes and compares the potential loss with and without data backup to achieve the tradeoff between overhead of check pointing and re-execution after exceptions. We also design the utility function with the model and apply a genetic algorithm to find the optimized schedule. The results show that the robustness of the schedule is increased while the possible risk of failure is minimized, especially when the generated data is not large. © 2012 IEEE.

Details

Metrics

603 Record Views
Logo image