Christian Engelmann Senior Scientist and Group Leader, Intelligent Systems and Facilities Research Contact 865.574.3132 | ENGELMANNC@ORNL.GOV All Publications Pattern-based Modeling of Multiresilience Solutions for High-Performance Computing Shrink or Substitute: Handling Process Failures in HPC Systems Using In-Situ Recovery Pattern-based Modeling of High-Performance Computing Resilience Characterizing Temperature, Power, and Soft-Error Behaviors in Data Center Systems: Insights, Challenges, and Opportunities Failures in Large Scale Systems: Long-term Measurement, Analysis, and Implications Resilience Design Patterns: A Structured Approach to Resilience at Extreme Scale Big Data Meets HPC Log Analytics: Scalable Approach to Understanding Systems at Extreme Scale A Pattern Language for High-Performance Computing Resilience... A Cooperative Approach to Virtual Machine Based Fault Injection... Towards New Metrics for High-Performance Computing Resilience Epidemic Failure Detection and Consensus... Language Support for Reliable Memory Regions Havens: Explicit Reliable Memory Regions for HPC Applications Benchmark Generation and Simulation at Extreme Scale... A New Deadlock Resolution Protocol and Message Matching Algorithm for the Extreme-scale Simulator Lightweight, Actionable Analytical Tools Based on Statistical Learning for Efficient System Operations... Power-capping Aware Checkpointing: On the Interplay among Power-capping, Temperature, Reliability, Performance, and Energy Mini-Ckpts: Surviving OS Failures in Persistent Memory... Adding Fault Tolerance to NPB Benchmarks Using ULFM... Reducing Waste in Extreme Scale Systems through Introspective Analysis Supporting the Development of Soft-Error Resilient Message Passing Applications using Simulation... Scalable and Fault Tolerant Failure Detection and Consensus... A Network Contention Model for the Extreme-scale Simulator... Analyzing the Interplay of Failures and Workload on a Leadership-Class Supercomputer Improving the Performance of the Extreme-scale Simulator Pagination First page « First Previous page ‹â¶Ä¹ Page 1 Current page 2 Page 3 … Next page ›â¶Äº Last page Last » Key Links Curriculum Vitae INTERSECT Initiative Organizations Computing and Computational Sciences Directorate Computer Science and Mathematics Division Advanced Computing Systems Research Section Intelligent Systems and Facilities Group