Publications

KokkosComm: Communication Layer for Distributed Kokkos Applications
Gabriel Dos Santos, Nicole Avans, Cedric Chevalier, Hugo Taboada, Carl Pearson, Jan Ciesko, Stephen L. Olivier, Marc Perache
in
EuroMPI
09/24
Latency and Bandwidth Microbenchmarks of US Department of Energy Systems in the June 2023 Top 500 List
Christopher M. Siefert, Carl Pearson, Stephen L. Olivier, Andrey Prokopenko, Jonathan J. Hu, Timothy J. Fuller
in
14th IEEE International Workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems
11/23
Latency and Bandwidth Microbenchmarks of Six US Department of Energy Systems in the Top500
Carl Pearson, Christopher M. Siefert, Stephen L. Olivier, Andrey Prokopenko, Timothy J. Fuller, Jonathan J. Hu
in
IEEE Cluster 2023
11/23
Interconnect Bandwidth Heterogeneity on AMD MI250x and Infinity Fabric
Carl Pearson
arXiv
02/23
Machine Learning for CUDA+MPI Design Rules
Carl Pearson, Aurya Javeed, Karen Devine
in
23rd IEEE International Workshop on Parallel and Distributed Scientific and Engineering Computing (PDSEC)
03/22
TEMPI: An Interposed MPI Library with a Canonical Representation of CUDA-aware Datatypes
Carl Pearson, Kun Wu, I-Hsin Chung, Jinjun Xiong, Wen-Mei Hwu
in
2021 ACM Symposium on High-Performance Parallel and Distributed Computing
06/21
Movement and Placement of Non-Contiguous Data In Distributed GPU Computing
Carl Pearson
Ph.D. Dissertation
04/21
At-Scale Sparse Deep Neural Network Inference With Efficient GPU Implementation
Mert Hidayetoglu, Carl Pearson, Vikram Sharma Mailthody, Eiman Ebrahimi, Jinjun Xiong, Rakesh Nagi, Wen-Mei Hwu
in
2020 IEEE High Performance Extreme Compute Conference
09/20
Node-Aware Stencil Communication on Heterogeneous Supercomputers
Carl Pearson, Mert Hidayetoglu, Mohammad Almasri, Omer Anjum, I-Hsin Chung, Jinjun Xiong, Wen-Mei Hwu
in
2020 IEEE International Workshop on Automatic Performance Tuning (iWAPT)
03/20
Accelerating Sparse Deep Neural Networks on FPGAs
Sitao Huang, Carl Pearson, Rakesh Nagi, Jinjun Xiong, Deming Chen, Wen-Mei Hwu
in
2019 IEEE High Performance Extreme Computing Conference
09/19
Update on k-truss Decomposition on GPU
Mohammad Almasri, Omer Anjum, Carl Pearson, Vikram S. Mailthody, Zaid Qureshi, Rakesh Nagi, Jinjun Xiong, Wen-Mei Hwu
in
2019 IEEE High Performance Extreme Computing Conference
08/19
Update on Triangle Counting on GPU
Carl Pearson, Mohammad Almasri, Omer Anjum, Vikram S. Mailthody, Zaid Qureshi, Rakesh Nagi, Jinjun Xiong, Wen-Mei Hwu
in
2019 IEEE High Performance Extreme Computing Conference
08/19
Evaluating Characteristics of CUDA Communication Primitives on High-Bandwidth Interconnects
Carl Pearson, Adbul Dakkak, Sarah Hashash, Cheng Li, I-Hsin Chung, Jinjun Xiong, Wen-Mei Hwu
in
2019 ACM/SPEC International Conference on Performance Engineering
04/19
Collaborative (CPU+ GPU) Algorithms for Triangle Counting and Truss Decomposition
Vikram S. Mailthody, Ketan Date, Zaid Qureshi, Carl Pearson, Rakesh Nagi, Jinjun Xiong, Wen-Mei Hwu
in
2018 IEEE High Performance Extreme Computing Conference
09/18
SCOPE: C3SR Systems Characterization and Benchmarking Framework
Carl Pearson, Abdul Dakkak, Cheng Li, Sarah Hashash, Jinjun Xiong, Wen-Mei Hwu
tech report
09/18
NUMA-Aware Data-Transfer Measurements for Power/NVLink Multi-GPU Systems
Carl Pearson, I-Hsin Chung, Zehra Sura, Jinjun Xiong, Wen-Mei Hwu
in
International Workshop on OpenPower in HPC (IWOPH) 2018
06/18
Heterogeneous Application and System Modeling
Carl Pearson
M.S. Thesis, May 2018
06/18
A Fast and Massively-Parallel Solver for Multiple-Scattering Tomographic Image Reconstruction
Mert Hidayetoglu, Carl Pearson, Izzat El Hajj, Levent Gurel, Weng Cho Chew, Wen-Mei Hwu
in
2018 IEEE International Parallel and Distributed Processing Symposium
05/18
Rebooting the Data Access Hierarchy of Computing Systems
Wen-mei Hwu, Izzat El Hajj, Simon Garcia de Gonzalo, Carl Pearson, Nam Sung Kim, Deming Chen, Jinjun Xiong, Zehra Sura
in
IEEE International Conference on Rebooting Computing 2017
11/17
Scalable Parallel DBIM Solutions of Inverse-Scattering Problems
Mert Hidayetoglu, Carl Pearson, Levent Gurel, Wen-mei Hwu, Weng Cho Chew
in
Computing and Electromagnetics International Workshop (CEM), 2017
06/17
Thoughts on Massively-Parallel Heterogeneous Computing for Solving Large Problems
Wen-mei Hwu, Mert Hidayetoglu, Weng Cho Chew, Carl Pearson, Simon Garcia de Gonzalo, Sitao Huang, Abdul Dakkak
in
Computing and Electromagnetics International Workshop 2017
06/17
Comparative Performance Evaluation of Multi-GPU MLFMM Implementation for 2-D VIE Problems
Carl Pearson, Mert Hidayetoglu, Wei Ren, Weng Cho Chew, Wen-Mei Hwu
in
Computing and Electromagnetics International Workshop, IEEE 2017
06/17
RAI: A Scalable Project Submission System for Parallel Programming Courses
Adbul Dakkak, Carl Pearson, Cheng Li
in
Parallel and Distributed Processing Symposium Workshops, 2016 IEEE International
05/17
Large Inverse-Scattering Solutions with DBIM on GPU-Enabled Supercomputers
Mert Hidayetoglu, Carl Pearson, Weng Cho Chew, Levent Gurel, Wen-mei Hwu
in
Applied and Computational Electromagnetics Symposium, 2017
03/17
WebGPU: A Scalable Online Development Platform for GPU Programming Courses
Adbul Dakkak, Carl Pearson, Cheng Li
in
Parallel and Distributed Processing Symposium Workshops, 2016 IEEE International
05/16
Adaptive Cache Bypass and Insertion for Many-Core Accelerators
Xuhao Chen, Shengzhao Wu, Li-Wen Chang, Wei-Sheng Huang, Carl Pearson, Wen-mei Hwu
in
Proceedings of International Workshop on Manycore Embedded Systems, 2016
06/14