selected publications
-
academic article
- A failure detector for HPC platforms. The International Journal of High Performance Computing Applications. 32:139-158. 2017
- Unified Model for Assessing Checkpointing Protocols at Extreme-Scale. . 2012