Home
Publications
Posts
Talks
Projects
Recognition
Experience
Links
#mpi
(all tags)
Publications
Movement and Placement of Non-Contiguous Data In Distributed GPU Computing
Carl Pearson
Ph.D. Dissertation
04/21
TEMPI: An Interposed MPI Library with a Canonical Representation of CUDA-aware Datatypes
Carl Pearson
, Kun Wu, I-Hsin Chung, Jinjun Xiong, Wen-Mei Hwu
in
2021 ACM Symposium on High-Performance Parallel and Distributed Computing
06/21
Machine Learning for CUDA+MPI Design Rules
Carl Pearson
, Aurya Javeed, Karen Devine
in
23rd IEEE International Workshop on Parallel and Distributed Scientific and Engineering Computing (PDSEC)
03/22
KokkosComm: Communication Layer for Distributed Kokkos Applications
Gabriel Dos Santos, Nicole Avans, Cedric Chevalier, Hugo Taboada,
Carl Pearson
, Jan Ciesko, Stephen L. Olivier, Marc Perache
in
EuroMPI
09/24
Posts
Improving MPI_Pack performance in CUDA-aware MPI
10/06/20
Talks
Adding Fast GPU Derived Datatype Handing to Existing MPIs
at
University of New Mexico PSAAP Colloquium
02/15/21
Adding Fast GPU Derived Datatype Handing to Existing MPIs
at
UNM Computer Science Department Colloquium
05/05/21
Automatic Discovery of Implementation Rules for Fast GPU + MPI Operations
at
SIAM Parallel Processing
02/25/22
Latency and Bandwidth Microbenchmarks of Six US Department of Energy Systems in the Top500
at
Cluster 2023
11/02/23
Latency and Bandwidth Microbenchmarks of US Department of Energy Systems in the June 2023 Top500 List
at
Supercomputing 2023
11/13/23