CQUniversity
Browse

File(s) not publicly available

Checkpointing schemes for grid workflow systems

journal contribution
posted on 2017-12-06, 00:00 authored by Z Li, Yang Xiang
One of the major challenges in wide use of Grid workflow systems is fault tolerance and avoidance. Checkpointing schemes provide a way of fault detection and recovery. In our research, we focus on performance optimization of checkpointing schemes and DVS (Dynamic Voltage Scaling) for Grid workflow systems. We propose offline checkpointing schemes with DVS and online adaptive checkpointing schemes that dynamically adjust the checkpointing intervals by using store-checkpoints (SCPs) and compare-checkpoints (CCPs). When combined with DVS, offline adaptive checkpointing schemes not only are fault tolerant but also lead to reduce average execution time of tasks. These schemes can efficiently utilize comparison and storage operations and significantly improve the performance. Further, these schemes can calculate the optimal numbers of checkpoints by which minimize the mean execution time. We also expand the online adaptive checkpointing schemes from single-task execution scenarios to multi-task execution scenarios. Simulation results show these online schemes outstandingly increase the likelihood of timely task completion when faults occur.

Funding

Category 1 - Australian Competitive Grants (this includes ARC, NHMRC)

History

Volume

20

Issue

15

Start Page

1773

End Page

1790

Number of Pages

18

ISSN

1532-0626

Location

Chichester, UK

Publisher

John Wiley and Sons

Language

en-aus

Peer Reviewed

  • Yes

Open Access

  • No

External Author Affiliations

Faculty of Business and Informatics; Not affiliated to a Research Institute; Xiamen da xue;

Era Eligible

  • Yes

Journal

Concurrency and computation : practice and experience.

Usage metrics

    CQUniversity

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC