Resilience within Ultrascale Computing System: Challenges and Opportunities from NESUS Project

Pascal Bouvry, Rudolph Mayer, Jakub Muszynski, Dana Petcu, Andreas Rauber, Gianluca Tempesti, Tuan Trinh, Sebastien Varrette

Research output: Contribution to journalArticlepeer-review

Abstract

Although resilience is already an established field in system science and many methodologies and approaches are available to deal with it, the unprecedented scales of computing, of the massive data to be managed, new network technologies, and drastically new forms of massive scale applications bring new challenges that need to be addressed. This paper reviews the challenges and approaches of resilience in ultrascale computing systems from multiple perspectives involving and addressing the resilience aspects of hardware-software co-design for ultrascale systems, resilience against (security) attacks, new approaches and methodologies to resilience in ultrascale systems, applications and case studies.
Original languageEnglish
Pages (from-to)1-18
Number of pages18
JournalSupercomputing frontiers and innovations
Volume2
Issue number2
DOIs
Publication statusPublished - 2015

Cite this