Publications

Export 3 results:
Author Title Type [ Year(Asc)]
2011
Performance Implications of Nonuniform Device Topologies in Scalable Heterogeneous Architectures, Meredith, Jeremy S., Roth Philip C., Spafford Kyle, and Vetter Jeffrey S. , IEEE Micro, 09/2011, Volume 31, Issue 5, p.66–75, (2011)
Performance of the Community Earth System Model, Worley, P. H., Craig A. P., Dennis J. M., Mirin A. A., Taylor M. A., and Vertenstein M. , Proceedings of the ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis (SC11), November, Seattle, WA, (2011)
Probabilistic Communication and I/O Tracing with Deterministic Replay at Scale, Wu, X., Vijayakumar K., Mueller F., Ma X., and Roth Philip C. , 2011 International Conference on Parallel Processing (ICPP 2011), 9/2011, Taipei, Taiwan, (2011)
A Prototype Two-Decade Fully-Coupled Fine-Resolution CCSM Simulation, McClean, J. L., Bader D. C., Bryan F. O., Maltrud M. E., Dennis J. M., Mirin A. A., Jones P. W., Kim Y. Y., Ivanova D. P., Vertenstein M., et al. , Ocean Modelling, Volume 39, p.10–30, (2011)
Quantifying NUMA and Contention Effects in Multi-GPU Systems, Spafford, Kyle, Meredith Jeremy S., and Vetter Jeffrey S. , Fourth Workshop on General Purpose Processing on Graphics Processing Units, (2011)
Scalable Memory Registration for High-Performance Networks Using Helper Threads, Li, Dong, Nikolopoulos Dimitrios S., Cameron Kirk W., de Supinski Bronis R., and Schulz Martin , ACM International Conference on Computer Frontier (CF), Feb, 2011, Ischia, Italy, (2011)
A Scalable Two-Phase Parallel I/O Library With Application To a Large Scale Subsurface Simulator (poster), Sreepathi, Sarat, Sripathi Vamsi, Hammond Glenn, Mills Richard, and Mahinthakumar Kumar , SC Companion, (2011)
Scientific Discovery at the Exascale, a Report from the DOE ASCR 2011 Workshop on Exascale Data Management, Analysis, and Visualization, Ahern, Sean, Shoshani Arie, Ma Kwan-Liu, Choudhary Alok, Critchlow Terence, Klasky Scott, Pascucci Valerio, Ahrens Jim, Bethel Wes E., Childs Hank, et al. , (2011)
2010
Collective Prefetching for Parallel I/O Systems , Chen, Y., and Roth Philip C. , 5th Petascale Data Storage Workshop, 11/2010, New Orleans, LA, (2010)
The Community Climate System Model, Worley, P. H. , Performance Tuning of Scientific Applications, Boca Raton, FL, p.315–338, (2010)
Hybrid MPI/OpenMP Power-Aware Computing, Li, Dong, de Supinski Bronis R., Schulz Martin, Nikolopoulos Dimitrios S., and Cameron Kirk W. , International Parallel and Distributed Processing Symposium (IPDPS), April, 2010, Atlanta, GA, (2010)
Maestro: Data Orchestration and Tuning for OpenCL Devices, Spafford, Kyle, Meredith Jeremy S., and Vetter Jeffrey S. , European Conference on Parallel and Distributed Computing (EuroPar), (2010)
Maestro: Data Orchestration and Tuning for OpenCL Devices, Spafford, Kyle, Meredith Jeremy S., and Vetter Jeffrey S. , European Conference on Parallel and Distributed Computing (EuroPar), (2010)
Memphis: Finding and Fixing NUMA-related Performance Problems on Multi-core Platforms , McCurdy, Collin, and Vetter Jeffrey S. , IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), White Plains, NY, (2010)
An OpenCL Framework for Heterogeneous Multicores with Local Memory, Lee, Jaejin, Kim Jungwon, Seo Sangmin, Kim Seungkyun, Park Jungho, Kim Honggyu, Dao Thanh Tuan, Cho Yongjin, Seo Sung Jong, Lee Seung Hak, et al. , Proceedings of the 19th International Conference on Parallel Architectures and Compilation Techniques, New York, NY, USA, p.193–204, (2010)
OpenMPC: Extended OpenMP Programming and Tuning for GPUs, Lee, Seyong, and Eigenmann Rudolf , The International Conference for High Performance Computing, Networking, Storage, and Analysis (SC), Won the Best Student Paper award, 11/2010, New Orleans, LA, USA, (2010)  (355.54 KB)
On the Path to Exascale, Alvin, Kenneth F., Barrett Brian W., Brightwell Ronald B., Dosanjh Sudip S., Geist Al, Hemmert Scott K., Heroux Michael A., Kothe Doug, Murphy Richard C., Nichols Jeff, et al. , International Journal of Distributed Systems and Technologies (IJDST), Volume 1, Issue 2, p.1-22, (2010)
Petascale Direct Numerical Simulation of Blood Flow on 200K Cores and Heterogeneous Architectures (Gordon Bell Award Winner), Rahimian, Abtin, Lashuk Ilya, Veerapaneni Shravan, Chandramowlishwaran Aparna, Malhotra Dhairya, Moon Logan, Sampath Rahul, Shringarpure Aashay, Vetter Jeffrey S., Vuduc Richard, et al. , 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis (SC10), New Orleans, p.1-11, (2010)
Power Saving Experiments for Large-Scale Global Optimisation, Cao, Zhenwei, Easterling David, Watson Layne T., Li Dong, Cameron Kirk W., and Feng Wu-Chun , International Journal of Parallel, Emergent and Distributed Systems, (2010)
Power-Aware MPI Task Aggregation Prediction for High-End Computing Systems, Li, Dong, Nikolopoulos Dimitrios S., Cameron Kirk W., de Supinski Bronis R., and Schulz Martin , International Parallel and Distributed Processing Symposium (IPDPS), April, 2010, Atlanta, GA, (2010)
PowerPack: Energy Profiling and Analysis of High-Performance Systems and Applications, Ge, Rong, Feng Xizhou, Song Shuaiwen, Chang Hung-Ching, Li Dong, and Cameron Kirk W. , IEEE Transactions on Parallel and Distributed Systems, Volume 21, (2010)
The Scalable HeterOgeneous Computing (SHOC) Benchmark Suite, Danalis, Anthony, Marin Gabriel, McCurdy Collin, Meredith Jeremy S., Roth Philip C., Spafford Kyle, Tipparaju Vinod, and Vetter Jeffrey S. , ACM Workshop on General-Purpose Processing on Graphics Processing Units (GPGPU), Pittsburgh, Pennsylvania, p.63-74, (2010)
Subcycled Dynamics in the Spectral Community Atmosphere Model Version 4, M. A. Taylor, K. J. Evans, Hack J. J., and Worley P. H. , Proceedings of SciDAC 2010 Conference, July, Chattanooga, TN, (2010)
System-level, Unified In-band and Out-of-band Dynamic Thermal Control, Li, Dong, Ge Rong, and Cameron Kirk W. , International Conference on Parallel Processing, Sept, 2010, San Diego, CA, (2010)
Toward Performance Prediction of Tree-Based Overlay Networks on the Cray XT, Roth, Philip C. , Dagstuhl Seminar on Program Development for Extreme-Scale Computing, Wadern, Germany, (2010)

Pages