Application and middleware transparent checkpointing with TCKPT on ClusterGrids

Kovács, József and Kacsuk, Péter and Januszewski, R. and Jankowski, G. (2010) Application and middleware transparent checkpointing with TCKPT on ClusterGrids. Future Generation Computer Systems, 26 (3). pp. 498-503.

[img] Text
S0167739X09000958.pdf - Published Version
Restricted to Registered users only

Download (2MB)

Abstract

This paper introduces a combination of the existing parallel checkpointing techniques for software heterogeneous ClusterGrid infrastructures. Most of the existing solutions are aiming at supporting application transparency (no checkpoint-related code development in application), but some others build middleware transparent (no service modification) solutions. The main contribution of this paper is to introduce a solution providing both application and middleware transparency at the same time. Compatibility and integrity requirements are identified and corresponding conditions are established using Abstract State Machines. The most relevant checkpointing systems are checked against the conditions in order to examine their conformity. Based on the conditions, a novel checkpointing method is defined and a proof of concept checkpointing tool, called TotalCheckpoint (TCKPT) is introduced.

Item Type: ISI Article
Uncontrolled Keywords: cluster, grid, clustergrid, checkpoint, recovery, parallel, pvm, migration
Subjects: Q Science > QA Mathematics and Computer Science > QA75 Electronic computers. Computer science / számítástechnika, számítógéptudomány
Divisions: Laboratory of Parallel and Distributed Systems
Depositing User: Eszter Nagy
Date Deposited: 12 Dec 2012 08:39
Last Modified: 12 Dec 2012 08:39
URI: https://eprints.sztaki.hu/id/eprint/6471

Update Item Update Item