Pedro Valero Lara Senior Computer Scientist Contact 865.341.2035 | VALEROLARAP@ORNL.GOV All Publications JACC: Leveraging HPC Meta-Programming and Performance Portability with the Just-in-Time and LLVM-based Julia Language... Large language model evaluation for highperformance computing software development ChatBLAS: The First AI-Generated and Portable BLAS Library Integrating ORNLs HPC and Neutron Facilities with a Performance-Portable CPU/GPU Ecosystem Clacc: OpenACC for C/C++ in Clang IRIS Reimagined: Advancements in Intelligent Runtime System for Task-Based Programming eCC++ : A Compiler Construction Framework for Embedded Domain-Specific Languages MatRIS: Addressing the Challenges for Portability and Heterogeneity Using Tasking for Matrix Decomposition (Cholesky) sKokkos: Enabling Kokkos with Transparent Device Selection on Heterogeneous Systems using OpenACC MatRIS: Multi-level Math Library Abstraction for Heterogeneity and Performance Portability using IRIS Runtime... Julia as a unifying end-to-end workflow language on the Frontier exascale system Mixed-Precision S/DGEMM Using the TF32 and TF64 Frameworks on Low-Precision AI Tensor Cores Moment Representation of Regularized Lattice Boltzmann Methods on NVIDIA and AMD GPUs Comparing Llama-2 and GPT-3 LLMs for HPC kernels generation IRIS-DMEM: Efficient Memory Management for Heterogeneous Computing S4PST: Sustainability for Programming Systems and Tools Workshop Report Evaluation of OpenAI Codex for HPC Parallel Programming Models Kernel Generation A MultiGPU Performance-Portable Solution for Array Programming Based on Kokkos Evaluating performance and portability of high-level programming models: Julia, Python/Numba, and Kokkos on exascale nodes Tiling Framework for Heterogeneous Computing of Matrix based Tiled Algorithms IRIS-BLAS: Towards a Performance Portable and Heterogeneous BLAS Library... KokkACC: Enhancing Kokkos with OpenACC SparseLU, A Novel Algorithm and Math Library for Sparse LU Factorization LaRIS: Targeting Portability and Productivity for LAPACK Codes on Extreme Heterogeneous Systems by Using IRIS OpenMP Target Task: Tasking and Target Offloading on Heterogeneous Systems Pagination Current page 1 Page 2 Next page 算傭 Last page Last 罈 Key Links