A Checkpoint-on-Failure Protocol for Algorithm-Based Recovery in Standard MPI

TitleA Checkpoint-on-Failure Protocol for Algorithm-Based Recovery in Standard MPI
Publication TypeConference Paper
Year of Publication2012
AuthorsBland, W., P. Du, A. Bouteiller, T. Herault, G. Bosilca, and J. Dongarra
Conference Name18th International European Conference on Parallel and Distributed Computing (Euro-Par 2012), Christos Kaklamanis, Theodore Papatheodorou and Paul Spirakis eds.
Conference LocationSpringer-Verlag, Rhodes, Greece, August 27-31