Mixed-Precision S/DGEMM Using the TF32 and TF64 Frameworks on Low-Precision AI Tensor Cores Conference Paper November, 2023
Julia as a unifying end-to-end workflow language on the Frontier exascale system Conference Paper November, 2023
Moment Representation of Regularized Lattice Boltzmann Methods on NVIDIA and AMD GPUs Conference Paper November, 2023
Performance Evaluation of Heterogeneous GPU Programming Frameworks for Hemodynamic Simulations Conference Paper November, 2023
Experience Migrating OpenCL to SYCL: A Case Study on Searches for Potential Off-Target Sites of Cas9 RNA-Guided Endonucleases on AMD GPUs Conference Paper September, 2023
Experimental Characterization of OpenMP Offloading Memory Operations and Unified Shared Memory Support Conference Paper September, 2023
Optimizing Data Movement for GPU-Based In-Situ Workflow Using GPUDirect RDMA Conference Paper September, 2023