selected publications
-
academic article
- A failure detector for HPC platforms. The International Journal of High Performance Computing Applications. 32:139-158. 2017
- Unified Model for Assessing Checkpointing Protocols at Extreme-Scale. . 2012
-
conference paper
- CLUSTER 2019 Committees. . i-v. 2019