Thomas Naughton III Computer Scientist, Intelligent Systems and Facilities Research Group Contact 865.576.4184 | NAUGHTONT@ORNL.GOV All Publications A comparison of Amazon Web Services and Microsoft Azure cloud platforms for high performance computing A Cooperative Approach to Virtual Machine Based Fault Injection... Preemptive Resource Management for Dynamically Arriving Tasks in an Oversubscribed Heterogeneous Computing System Epidemic Failure Detection and Consensus... A New Deadlock Resolution Protocol and Message Matching Algorithm for the Extreme-scale Simulator Adding Fault Tolerance to NPB Benchmarks Using ULFM... Supporting the Development of Soft-Error Resilient Message Passing Applications using Simulation... Scalable and Fault Tolerant Failure Detection and Consensus... A Network Contention Model for the Extreme-scale Simulator... Towards a Resilience Investigation Framework for High Performance Computing... Improving the Performance of the Extreme-scale Simulator What is the right balance for performance and isolation with virtualization in HPC?... Efficient Checkpointing of Virtual Machines using Virtual Machine Introspection Toward Improved Support for Loosely Coupled Large Scale Simulation Workflows Using Performance Tools to Support Experiments in HPC Resilience... Supporting the Development of Resilient Message Passing Applications using Simulation... A Runtime Environment for Supporting Research in Resilient HPC System Software & Tools Toward a Performance/Resilience Tool for Hardware/Software Co-Design of High-Performance Computing Systems... The Impact of a Fault Tolerant MPI on Scalable Systems Services and Applications... Hyperspectral Aquatic Radiative Transfer Modeling Using a High-Performance Cluster Computing–Based Approach... Hyperspectral Aquatic Radiative Transfer Modeling Using a High-Performance Cluster Computing-Based Approach A Log-Scaling Fault Tolerant Agreement Algorithm for a Fault Tolerant MPI A case for Virtual Machine based Fault Injection in a High-Performance Computing Environment... Architecture for the Next Generation System Management Tools... Realization of User Level Fault Tolerant Policy Management through a Holistic Approach for Fault Correlation... Pagination First page « First Previous page ‹â¶Ä¹ Page 1 Current page 2 Page 3 Next page ›â¶Äº Last page Last » Key Links Organizations Computing and Computational Sciences Directorate Computer Science and Mathematics Division Advanced Computing Systems Research Section Intelligent Systems and Facilities Group