Overhead of Coordinated Checkpointing Protocols for Message Passing Parallel Systems

Bernd Bieker, Erik Maehle

Abstract

Within this paper a simulation study of coordinated checkpointing protocols for parallel systems using message passing is presented. The aim is to provide an estimation of the overhead that is produced by checkpointing. Furthermore, a comparison of different protocols and their execution on different parallel computing systems is considered. To enable this analysis a simple application model is derived which is used as a representative of a class of number-crunching programs. By means of simulations general statements on the runtime overhead generated by coordinated checkpointing protocols can be given.
Original languageEnglish
Title of host publicationDependable Network Computing
EditorsDimiter R. Avresky
Number of pages20
Volume538
Place of PublicationBoston, MA
PublisherSpringer US
Publication date2000
Pages359-378
ISBN (Print)978-1-4613-7053-6
ISBN (Electronic)978-1-4615-4549-1
DOIs
Publication statusPublished - 2000

Fingerprint

Dive into the research topics of 'Overhead of Coordinated Checkpointing Protocols for Message Passing Parallel Systems'. Together they form a unique fingerprint.

Cite this