Binary delta optimization of parallel program checkpoints
Distributed computer systems (CS) are the major high-performance data processing tools.They are multiarchitectural, hierarchically structured and large-scaled means. According to statisticaldata frequency of partial faults in such systems is about 1 per several days. Check pointing isone of the most common approaches to CS fault tolerance problem.Analysis of the existing check pointing tools shows that a synchronous approach to checkpointcreation is widely used. This approach is known to create a heavy input/output load atcheckpoint time. Therefore, it is important to decrease overhead during distributed checkpointcreation.This paper considers a binary delta compression as a solution of the above-stated problem.Hash-based algorithms FHash (Fixed Hash size algorithm) and SHash (Sliding window Hashbasedalgorithm) are analyzed. Theoretical and experimental estimation of computational complexityis obtained for both of them. It is shown that the SHash algorithm is not suitable for thebinary delta compression of the checkpoints because of its high overhead.New adaptive delta compression approach, combining advantages of the well-known deltacompression types, is proposed in the work. Its effectiveness is demonstrated on the collection ofparallel programs. Proposed algorithms form the basis of the software package called HBICT.HBICT is a part of the system software of the geographically distributed multicluster computersystem that is being developed by the Center of parallel computer technologies of the SiberianState University of Telecommunications and Information Sciences and A.V. Rzhanov Institute ofSemiconductor Physics (Siberian branch of the RAS). To provide automated checkpoint compressionat creation time HBICT is integrated with the check pointing package called DMTCP.
Keywords
distributed computer systems,
fault tolerance,
coordinated check pointing,
распределённые вычислительные системы,
контрольные точки восстановления программ,
отказоустойчивостьAuthors
Polyakov Artem Y. | Siberian State University of Telecommunications and Information Sciences. Novosibirsk | artpol84@gmail.com |
Moldovanova Olga V. | Siberian State University of Telecommunications and Information Sciences. Novosibirsk | ovmold@yandex.ru |
Всего: 2
References
NAS Parallel Benchmarks. URL: http://www.nas.nasa.gov/Resources/Software/npb.html (да- та обращения: 18.11.2010).
HBICT (Hash Based Incremental Checkpointing Tool). URL: http://sourceforge.net/projects /hbict/ (дата обращения: 18.11.2010).
Центр параллельных вычислительных технологий ГОУ ВПО «СибГУТИ» и ИФП СО РАН. URL: http://cpct.sibsutis.ru/ (дата обращения: 18.11.2010).
Рычков А.Д., Шокина Н.Ю., Милошевич Х. Моделирование процесса зажигания гранулированного унитарного твердого топлива в камере сгорания айрбэга // Материалы междунар. конф. «Вычислительные и информационные технологии в науке, технике и образовании». Павлодар, 2006. Т. 2. С. 165−175.
Sangho Yi S. et al. Adaptive page-level incremental checkpointing based on expected recovery time // Proc. of the 2006 ACM symposium on applied computing. NewYork: ACM, 2006. P. 1472-1476.
Tridgell A. Efficient algorithms for sorting and synchronization // PhD thesis. The Austrailian National University, 1999.
Agarwal S. et al. Adaptive incremental checkpointing for massively parallel systems // ICS'04: Proc. of the 18th Annual International Conference on Supercomputing. N.Y.: ACM Press, 2004. P. 277-286.
Kiswany S.A. et al. stdchk: A checkpoint storage system for desktop grid computing // Proc. of ICDCS. 2008. P. 613-624.
Ansel J. et. al. DMTCP: Transparent сheckpointing for сluster сomputations and the desktop // Proc. of IEEE International Parallel and Distributed Processing Symposium (IPDPS'09). IEEE Press, 2009. P. 1-12.
Bosilca G. et. al. MPICH-V: Toward a scalable fault tolerant MPI for volatile nodes // ACM/IEEE 2002 Conference on Supercomputing. IEEE Press, 2002. P. 29.
Hursey J. et al. The design and implementation of checkpoint/restart process fault tolerance for Open MPI // Proc. of the 21st IEEE International Parallel and Distributed Processing Symposium (IPDPS). IEEE Computer Society, 2007. P. 1-8.
Elnozahy E.N. et al. A survey of rollback-recovery protocols in message-passing systems // ACM Computing Surveys. 2002. V. 34. Nо. 3. P. 375-408.
Team T.B. An overview of the BlueGene/L supercomputer // Proc. of SC2002: High Performance Networking and Computing. - Baltimore, MD, Nov. 2002.
Budnik T. et al. High throughput computing on IBM's Blue Gene® // IBM Rochester Blue Gene Development. URL: http://www-03.ibm.com/systems/resources/HTC_WhitePaper_V2 _050508.pdf (дата обращения: 18.11.2010).
Philp I. Software failures and the road to a petaflop machine // Proc. of the Workshop on High Performance Computing Reliability. IEEE Computer Society. 2005.
TOP500 Supercomputer sites. URL: http://www.top500.org/ (дата обращения: 18.11.2010).
Хорошевский В.Г. Архитектура вычислительных систем. М.: МГТУ им. Н.Э. Баумана, 2008. 520 с.