Publications (Selected)
Progressing Non-blocking Two-Sided Communication Using BlueField DPUs, ARCS'25
AutoBench: A Holistic Platform for Automated and Reproducible Benchmarking in HPC Testbeds, ICPE'25
Performance Comparison of GPU Programming Models Using HeCBench Benchmarks, CF'25
Phase-aware System-Side Sampling for HPC, CF'23
Modeling and Simulation of MPI Communication Dynamics in InfiniBand Networks Using SimGrid, PARS'25
Exploiting reduced precision for GPU-based time series mining, IPDPS'22
Benchmarking Machine Learning Inference in FPGA-based Accelerated Space Applications, MLBench'21
Time Series Mining at Petascale Performance, ISC'20 (Hans Meuer Best Paper Award)
Enabling Malleability for Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics using LAIK, PARS'19