Publications

Export 3 results:
Author Title Type [ Year(Asc)]
2009
Accelerating S3D: A GPGPU Case Study, Spafford, Kyle, Meredith Jeremy S., Vetter Jeffrey S., Chen J., Grout R., and Sankaran R. , Seventh International Workshop on Algorithms, Models, and Tools for Parallel Computing on Heterogeneous Platforms (HeteroPar 2009), 25 August 2009, Delft, The Netherlands, (2009)
Accuracy and Performance of Graphics Processors: A Quantum Monte Carlo Application Case Study, Meredith, Jeremy S., Alvarez Gonzalo, Maier Thomas A., Schulthess Thomas C., and Vetter Jeffrey S. , Parallel Comput., Volume 35, Issue 3, p.151-163, (2009)
Cetus: A Source-to-Source Compiler Infrastructure for Multicores, Bae, Hansang, Bachega Leonardo, Dave Chirag, Lee Sang-Ik, Lee Seyong, Min Seung-Jai, Eigenmann Rudolf, and Midkiff Samuel , Proc. of the 14th Int'l Workshop on Compilers for Parallel Computing (CPC'09), (2009)  (249.74 KB)
Cetus: A Source-to-Source Compiler Infrastructure for Multicores, Dave, Chirag, Bae Hansang, Min Seung-Jai, Lee Seyong, Eigenmann Rudolf, and Midkiff Samuel , IEEE Computer, Volume 42, p.36–42, (2009)  (3.16 MB)
Coping at the User-Level with Resource Limitations in the Cray Message Passing Toolkit MPI at Scale: How Not to Spend Your Summer Vacation, Mills, R. T., Hoffman F. M., Worley P. H., Perumalla K., Mirin A. A., Hammond G., and Smith B. , Proceedings of the 51st Cray User Group Conference, May, Atlanta, GA, (2009)
Design, Implementation, and Evaluation of Transparent pNFS on Lustre, Yu, W., Drokin O., and Vetter Jeffrey S. , IEEE International Parallel and Distributed Processing Symposium (IPDPS 09), 2009, Rome, Italy, (2009)
Early Evaluation of the Cray XT5, Worley, P. H., Barrett R. F., and Kuehn J. A. , Proceedings of the 51st Cray User Group Conference, May, Atlanta, GA, (2009)
A Holistic Approach for Performance Measurement and Analysis for Petascale Applications, Jagode, Heike, Dongarra Jack, Alam S. R., Vetter Jeffrey S., Spear Wyatt, and Malony Allen D. , International Conference on Computational Science, Baton Rouge, LA, (2009)
HPC Interconnection Networks: The Key to Exascale Computing, Vetter, Jeffrey S., Tipparaju Vinod, Yu Weikuan, and Roth Philip C. , Advances in Parallel Computing: High Speed and Large Scale Scientific Computing, Volume 18, p.95-106, (2009)
Model-based Hybrid MPI/OpenMP Power-Aware Computing (poster), Li, Dong, de Supinski Bronis R., Schulz Martin, Cameron Kirk W., and Nikolopoulos Dimitrios S. , International Conference for High Performance Computing, Networking, Storage and Analysis (SC), Portland, OR, (2009)
Modeling the Office of Science Ten Year Facilities Plan: The PERI Architecture Tiger Team, de Supinski, B., Alam Sadaf R., Bailey D., Carrington L., Daley C., Dubey A., Gamblin T., Gunter D., Hovland P., Jagode H., et al. , Journal of Physics: Conference Series (Proceedings of SciDAC 2009, San Diego, CA, July 14-18, 2009), Volume 180, Issue 012039, (2009)
OpenMP to GPGPU: A Compiler Framework for Automatic Translation and Optimization, Lee, Seyong, Min Seung-Jai, and Eigenmann Rudolf , ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP), New York, NY, USA, (2009)  (481.02 KB)
Scalable I/O Tracing and Analysis, Vijayakumar, K., Mueller F., Ma X., and Roth Philip C. , 2009 Petascale Data Storage Workshop, 11/2009, Portland, Oregon, (2009)
Scalable Tool Infrastructure for the Cray XT Using Tree-Based Overlay Networks, Roth, Philip C., and Vetter Jeffrey S. , 2009 CScADS Workshop on Performance Tools for Petascale Computing, July, Tahoe City, CA, (2009)  (3.49 MB)
Scalable Tool Infrastructure for the Cray XT Using Tree-Based Overlay Networks, Roth, Philip C., and Vetter Jeffrey S. , Cray User Group Meeting (CUG 2009), Atlanta, (2009)
Scaling to 150K Cores: Recent Algorithm and Performance Engineering Developments Enabling XGC1 to Run at Scale, Adams, M. F., Ku S. - H., Worley P. H., D'azevedo E. F., Cummings J. C., and Chang C. - S. , Journal of Physics: Conference Series, Volume 180, (2009)
Whole-Volume Integrated Gyrokinetic Simulation of Plasma Turbulence in Realistic Diverted-Tokamak Geometry, Chang, C. - S., Ku S. - H., Diamond P. H., Adams M. F., Barreto R. D., Chen Y., Cummings J. C., D'azevedo E. F., Dif-Pradalier G., Ethier S., et al. , Journal of Physics: Conference Series, Volume 180, (2009)
2008
Acceleration of Time Integration, White III, J. B., Evans K. J., Archibald R., Drake J. B., Worley P. H., and Kothe D. , Proceedings of the 50th Cray User Group Conference, May, Helsinki, Finland, (2008)
Adaptive Runtime Tuning of Parallel Sparse Matrix-Vector Multiplication on Distributed Memory Systems, Lee, Seyong, and Eigenmann Rudolf , ACM International Conference on Supercomputing (ICS08), 06/2008, (2008)  (667.15 KB)
Adaptive Tuning in a Dynamically Changing Resource Environment, Lee, Seyong, and Eigenmann Rudolf , Workshop on Next-Generation Software Systems, Int'l Parallel and Distributed Processing Symposium (IPDPS'08), (2008)  (143.01 KB)
Algorithm 888: Spherical Harmonic Transform Algorithms, Drake, J. B., Worley P. H., and D'azevedo E. F. , ACM Transactions on Mathematical Software, October, Volume 35, p.1–23, (2008)
CG-Cell: An NPB Benchmark Implementation on Cell Broadband Engine, Li, Dong, Huang Song, and Cameron Kirk W. , Interntional Conference on Distributed Computing and Networking (ICDCN), Jan, 2008, Kolkata, India, (2008)
COMIC: A Coherent Shared Memory Interface for Cell Be, Lee, Jaejin, Seo Sangmin, Kim Chihun, Kim Junghyun, Chun Posung, Sura Zehra, Kim Jungwon, and Han SangYong , Proceedings of the 17th International Conference on Parallel Architectures and Compilation Techniques, New York, NY, USA, p.303–314, (2008)
The Cray XT4 Quad-core: A First Look, Alam, Sadaf R., Barrett R. F., Eisenbach M., Fahey M. R., Hartman-Baker R., Kuehn J. A., Poole S., Sankaran R., and Worley P. H. , Proceedings of the 50th Cray User Group Conference, May, Helsinki, Finland, (2008)

Pages