Gesellschaft für Informatik e.V.

Lecture Notes in Informatics


Informatik 2004, Informatik verbindet, Band 2, Beiträge der 34. Jahrestagung der Gesellschaft für Informatik e.V. (GI), Ulm, 20.-24. September 2004 P-51, 656-660 (2004).

GI, Gesellschaft für Informatik, Bonn
2004


Editors

Peter Dadam, Manfred Reichert (eds.)


Copyright © GI, Gesellschaft für Informatik, Bonn

Contents

Crash management for distributed parallel systems

Jan Haase and Frank Eschmann

Abstract


With the growing complexity of parallel architectures, the probability of system failures grows, too. One approach to cope with this problem is the self-healing, one of the organic computing's self-x features. Self-healing in this context means that computer clusters should detect and handle failures automatically. This paper presents a self-healing mechanism based on checkpointing, so that a cluster remains operative even if some sites or the connections between them fail. The proposed method has been implemented and tested on the Self Distributing Virtual Machine (SDVM).


Full Text: PDF

GI, Gesellschaft für Informatik, Bonn
ISBN 3-88579-3080-6


Last changed 24.01.2012 21:47:37