#mpi

(all tags)

Publications

Movement and Placement of Non-Contiguous Data In Distributed GPU Computing
Carl Pearson
Ph.D. Dissertation
04/21
TEMPI: An Interposed MPI Library with a Canonical Representation of CUDA-aware Datatypes
Carl Pearson, Kun Wu, I-Hsin Chung, Jinjun Xiong, Wen-Mei Hwu
in
2021 ACM Symposium on High-Performance Parallel and Distributed Computing
06/21
Machine Learning for CUDA+MPI Design Rules
Carl Pearson, Aurya Javeed, Karen Devine
in
23rd IEEE International Workshop on Parallel and Distributed Scientific and Engineering Computing (PDSEC)
03/22

Posts

Improving MPI_Pack performance in CUDA-aware MPI

Talks

Adding Fast GPU Derived Datatype Handing to Existing MPIs
at
University of New Mexico PSAAP Colloquium
02/15/21
Adding Fast GPU Derived Datatype Handing to Existing MPIs
at
UNM Computer Science Department Colloquium
05/05/21
Automatic Discovery of Implementation Rules for Fast GPU + MPI Operations
at
SIAM Parallel Processing
02/25/22
Latency and Bandwidth Microbenchmarks of Six US Department of Energy Systems in the Top500
at
Cluster 2023
11/02/23
Latency and Bandwidth Microbenchmarks of US Department of Energy Systems in the June 2023 Top500 List
at
Supercomputing 2023
11/13/23