Publications

Export 3 results:
Author Title Type [ Year(Asc)]
2011
Achieving a Single Compute Device Image in OpenCL for Multiple GPUs, Kim, Jungwon, Kim Honggyu, Lee Joo Hwan, and Lee Jaejin , Proceedings of the 16th ACM Symposium on Principles and Practice of Parallel Programming, New York, NY, USA, p.277–288, (2011)
Community Climate System Model, Worley, P. H., Vertenstein M., and Craig A. P. , Encyclopedia of Parallel Computing, New York, NY, p.342–351, (2011)
The Community Climate System Model Version 4, Gent, P. R., Danabasoglu G., Donner L. J., Holland M. M., Hunke E. C., Jayne S. R., Lawrence D. M., Neale R. B., Rasch P. J., Vertenstein M., et al. , Journal of Climate, Volume 24, p.4973–4991, (2011)
Critical Path-Based Thread Placement for NUMA Systems, Su, Chun-Yi, Li Dong, Nikolopoulos Dimitrios S., Grove Mat, Cameron Kirk W., and de Supinski Bronis R. , International Workshop on Modeling, Benchmarking and Simulation of High Performance Computing Systems (PMBS). In conjunction with SC'11, November, 2011, Seattle, WA, (2011)
Energy-Aware Workload Consolidaton on GPU, Li, Dong, Byna Surendra, and Chakradar Srimat , International Workshop on Scheduling and Resource Management for Parallel and Distributed Systems. In conjuction with ICPP'11, Sept, 2011, Taipei, Taiwan, (2011)
Global-Aware and Multi-Order Context-Based Prefetching for High-Performance Processors, Chen, Y., Zhu H., Roth Philip C., Jin H., and Sun X. - H. , International Journal of High Performance Computing Applications, 11/2011, Volume 25, Issue 4, Number 4, p.355–370, (2011)
An Instruction-scheduling-aware Data Partitioning Technique for Coarse-grained Reconfigurable Architectures, Jang, Choonki, Kim Jungwon, Lee Jaejin, Kim Hee-Seok, Yoo Dong-Hoon, Kim Sukjin, Kim Hong-Seok, and Ryu Soojung , Proceedings of the 2011 SIGPLAN/SIGBED Conference on Languages, Compilers and Tools for Embedded Systems, New York, NY, USA, p.151–160, (2011)
Keeneland: Bringing Heterogeneous GPU Computing to the Computational Science Community, Vetter, Jeffrey S., Glassbrook R., Dongarra J., Schwan K., Loftis B., McNally S., Meredith Jeremy S., Rogers J., Roth Philip C., Spafford Kyle, et al. , IEEE Computing in Science and Engineering, Volume 13, Issue 5, Number 5, p.90-95, (2011)
LACIO: A New Collective I/O Strategy for Parallel I/O Systems, Chen, Y., Sun X. - H., Thakur R., Roth Philip C., and Gropp W. D. , 25th IEEE International Parallel & Distributed Processing Symposium, 5/2011, Anchorage, Alaska, (2011)
Memphis on a Cray XT: Pinpointing Memory Performance Problems on Cray Platforms, McCurdy, C., Vetter Jeffrey S., Worley P. H., and Maxwell D. , Proceedings of the 53rd Cray User Group Conference, May, Fairbanks, AK, (2011)
An OpenCL Framework for Homogeneous Manycores with No Hardware Cache Coherence, Lee, Jun, Kim Jungwon, Kim Junghyun, Seo Sangmin, and Lee Jaejin , Parallel Architectures and Compilation Techniques (PACT), 2011 International Conference on, Oct, p.56-67, (2011)
Parallel In Situ Coupling of Simulation with a Fully Featured Visualization System, Whitlock, Brad J., Favre Jean M., and Meredith Jeremy S. , Proceedings of the Eurographics Symposium on Parallel Graphics and Visualization, p.101-109, (2011)
Performance Implications of Nonuniform Device Topologies in Scalable Heterogeneous Architectures, Meredith, Jeremy S., Roth Philip C., Spafford Kyle, and Vetter Jeffrey S. , IEEE Micro, 09/2011, Volume 31, Issue 5, p.66–75, (2011)
Performance of the Community Earth System Model, Worley, P. H., Craig A. P., Dennis J. M., Mirin A. A., Taylor M. A., and Vertenstein M. , Proceedings of the ACM/IEEE International Conference for High Performance Computing, Networking, Storage and Analysis (SC11), November, Seattle, WA, (2011)
Probabilistic Communication and I/O Tracing with Deterministic Replay at Scale, Wu, X., Vijayakumar K., Mueller F., Ma X., and Roth Philip C. , 2011 International Conference on Parallel Processing (ICPP 2011), 9/2011, Taipei, Taiwan, (2011)
A Prototype Two-Decade Fully-Coupled Fine-Resolution CCSM Simulation, McClean, J. L., Bader D. C., Bryan F. O., Maltrud M. E., Dennis J. M., Mirin A. A., Jones P. W., Kim Y. Y., Ivanova D. P., Vertenstein M., et al. , Ocean Modelling, Volume 39, p.10–30, (2011)
Quantifying NUMA and Contention Effects in Multi-GPU Systems, Spafford, Kyle, Meredith Jeremy S., and Vetter Jeffrey S. , Fourth Workshop on General Purpose Processing on Graphics Processing Units, (2011)
Scalable Memory Registration for High-Performance Networks Using Helper Threads, Li, Dong, Nikolopoulos Dimitrios S., Cameron Kirk W., de Supinski Bronis R., and Schulz Martin , ACM International Conference on Computer Frontier (CF), Feb, 2011, Ischia, Italy, (2011)
A Scalable Two-Phase Parallel I/O Library With Application To a Large Scale Subsurface Simulator (poster), Sreepathi, Sarat, Sripathi Vamsi, Hammond Glenn, Mills Richard, and Mahinthakumar Kumar , SC Companion, (2011)
Scientific Discovery at the Exascale, a Report from the DOE ASCR 2011 Workshop on Exascale Data Management, Analysis, and Visualization, Ahern, Sean, Shoshani Arie, Ma Kwan-Liu, Choudhary Alok, Critchlow Terence, Klasky Scott, Pascucci Valerio, Ahrens Jim, Bethel Wes E., Childs Hank, et al. , (2011)
2010
Collective Prefetching for Parallel I/O Systems , Chen, Y., and Roth Philip C. , 5th Petascale Data Storage Workshop, 11/2010, New Orleans, LA, (2010)
The Community Climate System Model, Worley, P. H. , Performance Tuning of Scientific Applications, Boca Raton, FL, p.315–338, (2010)

Pages