Massively Parallel OpenSHMEM

Colloq: Speaker: 
Max Grossman
Colloq: Speaker Institution: 
Rice University
Colloq: Date and Time: 
Wed, 2017-07-26 14:00
Colloq: Location: 
Building 5700, Room L204
Colloq: Host: 
Jeff Vetter
Colloq: Host Email:
Colloq: Abstract: 
While OpenSHMEM is used to scalably execute some of the most massive and irregular distributed workloads in the world, the OpenSHMEM community has been relatively slow to adapt to the drastic changes in shared-memory parallelism of the last decade. Indeed as of the latest version of OpenSHMEM (v1.3), the standard continues to say nothing about the thread-safety of any OpenSHMEM APIs (i.e. there is no equivalent to MPI_Init_thread). While the simplicity of programming OpenSHMEM in a flat model is attractive, the increased dimensionality of intra-node parallelism has led the OpenSHMEM community to start considering several unique proposals and research directions for supporting well-performing OpenSHMEM in a hybrid, massively multi-threaded environment. This talk will provide an overview of community proposals and research in multi-threaded programming with OpenSHMEM, and discuss how some of this work is setting up OpenSHMEM to be more future-proof against hardware and software changes than other threading models used in libraries like MPI or UPC++. This talk will focus on the AsyncSHMEM project at Rice University, which investigates the combination of asynchronous tasking runtimes and OpenSHMEM for improved scalability, programmability, and tooling. We will also discuss the OpenSHMEM thread registration proposal, contexts proposal, and the nvshmem project for supporting OpenSHMEM on NVIDIA GPUs.
Colloq: Speaker Bio: 
Max Grossman is a postdoctoral researcher in the Rice University Department of Computer Science, as well as CEO and Co-Founder of 7pod Technology. His work focuses on schedulers, programming models, tools, and applications for heterogeneous and distributed systems.