Publications

Export 3 results:
Author Title Type [ Year(Asc)]
2010
Collective Prefetching for Parallel I/O Systems , Chen, Y., and Roth Philip C. , 5th Petascale Data Storage Workshop, 11/2010, New Orleans, LA, (2010)
The Community Climate System Model, Worley, P. H. , Performance Tuning of Scientific Applications, Boca Raton, FL, p.315–338, (2010)
Hybrid MPI/OpenMP Power-Aware Computing, Li, Dong, de Supinski Bronis R., Schulz Martin, Nikolopoulos Dimitrios S., and Cameron Kirk W. , International Parallel and Distributed Processing Symposium (IPDPS), April, 2010, Atlanta, GA, (2010)
Maestro: Data Orchestration and Tuning for OpenCL Devices, Spafford, Kyle, Meredith Jeremy S., and Vetter Jeffrey S. , European Conference on Parallel and Distributed Computing (EuroPar), (2010)
Maestro: Data Orchestration and Tuning for OpenCL Devices, Spafford, Kyle, Meredith Jeremy S., and Vetter Jeffrey S. , European Conference on Parallel and Distributed Computing (EuroPar), (2010)
Memphis: Finding and Fixing NUMA-related Performance Problems on Multi-core Platforms , McCurdy, Collin, and Vetter Jeffrey S. , IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), White Plains, NY, (2010)
An OpenCL Framework for Heterogeneous Multicores with Local Memory, Lee, Jaejin, Kim Jungwon, Seo Sangmin, Kim Seungkyun, Park Jungho, Kim Honggyu, Dao Thanh Tuan, Cho Yongjin, Seo Sung Jong, Lee Seung Hak, et al. , Proceedings of the 19th International Conference on Parallel Architectures and Compilation Techniques, New York, NY, USA, p.193–204, (2010)
OpenMPC: Extended OpenMP Programming and Tuning for GPUs, Lee, Seyong, and Eigenmann Rudolf , The International Conference for High Performance Computing, Networking, Storage, and Analysis (SC), Won the Best Student Paper award, 11/2010, New Orleans, LA, USA, (2010)  (355.54 KB)
On the Path to Exascale, Alvin, Kenneth F., Barrett Brian W., Brightwell Ronald B., Dosanjh Sudip S., Geist Al, Hemmert Scott K., Heroux Michael A., Kothe Doug, Murphy Richard C., Nichols Jeff, et al. , International Journal of Distributed Systems and Technologies (IJDST), Volume 1, Issue 2, p.1-22, (2010)
Petascale Direct Numerical Simulation of Blood Flow on 200K Cores and Heterogeneous Architectures (Gordon Bell Award Winner), Rahimian, Abtin, Lashuk Ilya, Veerapaneni Shravan, Chandramowlishwaran Aparna, Malhotra Dhairya, Moon Logan, Sampath Rahul, Shringarpure Aashay, Vetter Jeffrey S., Vuduc Richard, et al. , 2010 ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis (SC10), New Orleans, p.1-11, (2010)
Power Saving Experiments for Large-Scale Global Optimisation, Cao, Zhenwei, Easterling David, Watson Layne T., Li Dong, Cameron Kirk W., and Feng Wu-Chun , International Journal of Parallel, Emergent and Distributed Systems, (2010)
Power-Aware MPI Task Aggregation Prediction for High-End Computing Systems, Li, Dong, Nikolopoulos Dimitrios S., Cameron Kirk W., de Supinski Bronis R., and Schulz Martin , International Parallel and Distributed Processing Symposium (IPDPS), April, 2010, Atlanta, GA, (2010)
PowerPack: Energy Profiling and Analysis of High-Performance Systems and Applications, Ge, Rong, Feng Xizhou, Song Shuaiwen, Chang Hung-Ching, Li Dong, and Cameron Kirk W. , IEEE Transactions on Parallel and Distributed Systems, Volume 21, (2010)
The Scalable HeterOgeneous Computing (SHOC) Benchmark Suite, Danalis, Anthony, Marin Gabriel, McCurdy Collin, Meredith Jeremy S., Roth Philip C., Spafford Kyle, Tipparaju Vinod, and Vetter Jeffrey S. , ACM Workshop on General-Purpose Processing on Graphics Processing Units (GPGPU), Pittsburgh, Pennsylvania, p.63-74, (2010)
Subcycled Dynamics in the Spectral Community Atmosphere Model Version 4, M. A. Taylor, K. J. Evans, Hack J. J., and Worley P. H. , Proceedings of SciDAC 2010 Conference, July, Chattanooga, TN, (2010)
System-level, Unified In-band and Out-of-band Dynamic Thermal Control, Li, Dong, Ge Rong, and Cameron Kirk W. , International Conference on Parallel Processing, Sept, 2010, San Diego, CA, (2010)
Toward Performance Prediction of Tree-Based Overlay Networks on the Cray XT, Roth, Philip C. , Dagstuhl Seminar on Program Development for Extreme-Scale Computing, Wadern, Germany, (2010)
Using MRNet for Scalable Tool Development on Cray XT, Brim, M. J., Olichandran R., Miller B. P., Roth Philip C., and DeRose L. , Cray User Group 2010, (2010)
Visualization and Analysis-Oriented Reconstruction of Material Interfaces, Meredith, Jeremy S., and Childs Hank , Computer Graphics Forum, Volume 29, Number 3, p.1241–1250, (2010)
XGC1: Performance on the 8-Core and 12-Core Cray XT5 Systems at Oak Ridge National Laboratory, Worley, P. H., Adams M. F., D'azevedo E. F., Chang C. - S., Ku S. - H., and McCurdy C. , Proceedings of the 52nd Cray User Group Conference, May, Edinburgh, United Kingdom, (2010)
2009
Accelerating S3D: A GPGPU Case Study, Spafford, Kyle, Meredith Jeremy S., Vetter Jeffrey S., Chen J., Grout R., and Sankaran R. , Seventh International Workshop on Algorithms, Models, and Tools for Parallel Computing on Heterogeneous Platforms (HeteroPar 2009), 25 August 2009, Delft, The Netherlands, (2009)
Accuracy and Performance of Graphics Processors: A Quantum Monte Carlo Application Case Study, Meredith, Jeremy S., Alvarez Gonzalo, Maier Thomas A., Schulthess Thomas C., and Vetter Jeffrey S. , Parallel Comput., Volume 35, Issue 3, p.151-163, (2009)
Cetus: A Source-to-Source Compiler Infrastructure for Multicores, Bae, Hansang, Bachega Leonardo, Dave Chirag, Lee Sang-Ik, Lee Seyong, Min Seung-Jai, Eigenmann Rudolf, and Midkiff Samuel , Proc. of the 14th Int'l Workshop on Compilers for Parallel Computing (CPC'09), (2009)  (249.74 KB)

Pages