Publications
Design and Analysis of Soft-Error Resilience Mechanisms for GPU Register File,
, IEEE International Conference on VLSI Design (VLSID), Hyderabad, India, (2017)
Design and Implementation of Papyrus: Parallel Aggregate Persistent Storage,
, IEEE International Parallel and Distributed Processing Symposium (IPDPS), (2017)
DESTINY: A Comprehensive Tool with 3D and Multi-level Cell Memory Modeling Capability,
, Journal of Low Power Electronics and Applications, Volume 7, p.23, (2017)
Durango: Scalable Synthetic Workload Generation for Extreme-Scale Application Performance Modeling and Simulation,
, Proceedings of the 2017 ACM SIGSIM Conference on Principles of Advanced Discrete Simulation, New York, NY, USA, (2017)
A Distributed OpenCL Framework using Redundant Computation and Data Replication,
, ACM SIGPLAN conference on Programming Language Design and Implementation (PLDI), (2016)
DESTINY: A Tool for Modeling Emerging 3D NVM and eDRAM caches,
, Design Automation and Test in Europe (DATE), (2015)
Diagnosis and Optimization of Application Prefetching Performance,
, ACM International Conference on Supercomputing (ICS), Euguene, OR, (2013)
A Distributed Data-Parallel Framework for Analysis and Visualization Algorithm Development,
, Proceedings of the 5th Annual Workshop on General Purpose Processing with Graphics Processing Units, New York, NY, USA, p.11–19, (2012)
DOSAS: Mitigating the Resource Contention in Active Storage Systems,
, IEEE Cluster 2012, 09/2012, (2012)
Design, Implementation, and Evaluation of Transparent pNFS on Lustre,
, IEEE International Parallel and Distributed Processing Symposium (IPDPS 09), 2009, Rome, Italy, (2009)
DARPA's HPCS Program: History, Models, Tools, Languages,
, Advances in Computers, Volume Volume 72, p.1-100, (2008)
Deep Start: A Hybrid Strategy for Automated Performance Problem Searches,
, Concurrency and Computation: Practice and Experience, September, Volume 15, Issue 11-12, p.1027-1046, (2003)
Dynamic Statistical Profiling of Communication Activity in Distributed Applications,
, ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems, Marina del Rey, California, (2002)
A Dynamic Tracing Mechanism for Performance Analysis of OpenMP Applications,
, Workshop on OpenMP Applications and Tools (WOMPAT), West Lafayette, Indiana, USA, (2001)
Dynamic Software Testing of MPI Applications with Umpire,
, SC2000: High Performance Networking and Computing Conf. (electronic publication), Dallas, TX USA, (2000)
Data Exchange: High Performance Communications In Distributed Laboratories,
, Ninth Int'l Conf. Parallel and Distributed Computing Systems (PDCS 97), (1997)
Design and Performance of a Scalable Parallel Community Climate Model,
, Parallel Computing, Volume 21, Issue 10, p.1571–1591, (1995)
Design of a Parallel Computational Fluid Dynamics Code on a Shared Memory Architecture,
, American Society of Mechanical Engineers, Bioengineering Division (BED), (1995)
Development of a Parallel Spectral Element Code Using SPMD Constructs,
, Parallel CFD: Implementations and Results Using Parallel Computers, p.121–128, (1995)
Development of a Parallel Spectral Element Code Using SPMD Constructs,
, Parallel CFD: Implementations and Results Using Parallel Computers, p.121-128, (1995)