Abstract
In grid computing, the realization of an enviable fault tolerance ability is linked with the proper utilization of resources and scheduling of jobs. The literature offers two solutions to these two challenging tasks, viz. check- pointing and replication. A checkpointing strategy is being proposed that uses the median of failure inter- vals of the resources in deciding the checkpoint intervals for the given jobs. The strategy shows improved sys- tem throughput, job losses and job execution times while eliminating unnecessary checkpoints.
Original language | English |
---|---|
Title of host publication | ECMS 2012 Proceedings |
Publisher | European Council for Modeling and Simulation |
Number of pages | 7 |
ISBN (Print) | 9780956494443 |
DOIs | |
Publication status | Published - 29 May 2012 |
Externally published | Yes |
Event | 26th European Conference on Modelling and Simulation: Shaping reality through simulation - Koblenz, Germany Duration: 29 May 2012 → 1 Jun 2012 http://www.scs-europe.net/conf/ecms2012/index.html |
Conference
Conference | 26th European Conference on Modelling and Simulation |
---|---|
Country/Territory | Germany |
City | Koblenz |
Period | 29/05/12 → 1/06/12 |
Internet address |