Jeffrey Vetter Section Head - Advanced Computing Systems Research Contact 865.576.7115 | VETTER@ORNL.GOV All Publications ChatBLAS: The First AI-Generated and Portable BLAS Library IRIS-GNN: Leveraging Graph Neural Networks for Scheduling on Truly Heterogeneous Runtime Systems Large language model evaluation for high‐performance computing software development A Performance-Portable MultiGPU Implementation of 3D Euler Equations using ProtoX and IRIS JACC: Leveraging HPC Meta-Programming and Performance Portability with the Just-in-Time and LLVM-based Julia Language... Integrating ORNL’s HPC and Neutron Facilities with a Performance-Portable CPU/GPU Ecosystem CHARM-SYCL & IRIS: A Tool Chain for Performance Portability on Extremely Heterogeneous Systems IRIS: A Performance-Portable Framework for Cross-Platform Heterogeneous Computing Clacc: OpenACC for C/C++ in Clang MatRIS: Addressing the Challenges for Portability and Heterogeneity Using Tasking for Matrix Decomposition (Cholesky) IRIS Reimagined: Advancements in Intelligent Runtime System for Task-Based Programming IRIS: Exploring Performance Scaling of the Intelligent Runtime System and its Dynamic Scheduling Policies eCC++ : A Compiler Construction Framework for Embedded Domain-Specific Languages sKokkos: Enabling Kokkos with Transparent Device Selection on Heterogeneous Systems using OpenACC Errant Beam Detection Using the AMD Versal ACAP and Vitis AI Arithmetic Primitives for Efficient Neuromorphic Computing Moment Representation of Regularized Lattice Boltzmann Methods on NVIDIA and AMD GPUs Mixed-Precision S/DGEMM Using the TF32 and TF64 Frameworks on Low-Precision AI Tensor Cores Performance Evaluation of Heterogeneous GPU Programming Frameworks for Hemodynamic Simulations FFTX-IRIS: Towards Performance Portability and Heterogeneity for SPIRAL Generated Code... Julia as a unifying end-to-end workflow language on the Frontier exascale system MatRIS: Multi-level Math Library Abstraction for Heterogeneity and Performance Portability using IRIS Runtime... CHARM-SYCL: New Unified Programming Environment for Multiple Accelerator Types IRIS-DMEM: Efficient Memory Management for Heterogeneous Computing Comparing Llama-2 and GPT-3 LLMs for HPC kernels generation Pagination Current page 1 Page 2 Page 3 … Next page ›ĺ Last page Last » Key Links