|Title||IMPACC: A Tightly Integrated MPI+OpenACC Framework Exploiting Shared Memory Parallelism|
|Publication Type||Conference Paper|
|Year of Publication||2016|
|Authors||Kim, Jungwon, Lee Seyong, and Vetter Jeffrey S.|
|Conference Name||ACM International Symposium on High-Performance Parallel and Distributed Computing (HPDC)|
We propose IMPACC, an MPI+OpenACC framework for heterogeneous accelerator clusters. IMPACC tightly integrates MPI and OpenACC, while exploiting the shared memory parallelism in the target system. IMPACC dynamically adapts the input MPI+OpenACC applications on the target heterogeneous accelerator clusters to fully exploit target system-specific features. IMPACC provides the programmers with the unified virtual address space, automatic NUMA-friendly task-device mapping, efficient integrated communication routines, seamless streamlining of asynchronous executions, and transparent memory sharing. We have implemented IMPACC and evaluated its performance using three heterogeneous accelerator systems, including Titan supercomputer. Results show that IMPACC can achieve easier programming, higher performance, and better scalability than the current MPI+OpenACC model.