Export 12 results:
Filters: First Letter Of Title is P  [Clear All Filters]
Norris, B., W. Spear, and A. Malony, "Performance Analysis of Applications in the Context of Architectural Rooflines", Proceedings of the 8th ACM/SPEC on International Conference on Performance Engineering, New York, NY, USA, ACM, 2017.
Shan, H., K. McElvain, C. Johnson, S. Williams, and E. W. Ormand, "Parallel implementation and performance optimization of the configuration-interaction method", SC'15, Austin, TX, 11/2015.
Sarje, A., S. Song, D. Jacobsen, K. Huck, J. Hollingsworth, A. Malony, S. Williams, and L. Oliker, "Parallel performance optimizations on unstructured mesh-based simulations", Procedia Computer Science, vol. 51, pp. 2016–2025, 2015.
Ozog, D., A. D. Malony, and A. R. Siegel, "A Performance Analysis of SIMD Algorithms for Monte Carlo Simulations of Nuclear Reactor Cores", Parallel and Distributed Processing Symposium (IPDPS), 2015 IEEE International: IEEE, 2015.
Ellsworth, D. A., A. D. Malony, B. Rountree, and M. Schulz, "POW: System-wide Dynamic Reallocation of Limited Power in HPC", Proceedings of the 24th International Symposium on High-Performance Parallel and Distributed Computing, {HPDC} 2015, Portland, OR, USA, June 15-19, 2015, 2015.
Childs, H., S. Biersdorff, D. Poliakoff, D. Camp, and A. D. Malony, "Particle advection performance over varied architectures and workloads", High Performance Computing (HiPC), 2014 21st International Conference on: IEEE, 2014.
McCraw, H., J. Ralph, A. Danalis, and J. Dongarra, "Power Monitoring with PAPI for Extreme Scale Architectures and Dataflow-based Programming Models", Workshop on Monitoring and Analysis for High Performance Computing Systems Plus Applications (HPCMASPA 2014), IEEE Cluster 2014, Madrid, Spain, IEEE, September, 2014.
Jia, Y., G. Bosilca, P. Luszczek, and J. Dongarra, "Parallel Reduction to Hessenberg Form with Algorithm-Based Fault Tolerance", SC'13, Denver, Colorado, November, 2013.
Shan, H., B. Austin, W. De Jong, L. Oliker, N. Wright, and E. Apra, "Performance Tuning of Fock Matrix and Two-Electron Integral Calculations for NWChem on Leading HPC Platforms", Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems (PMBS13) held as part of SC13, 11/2013.
Shan, H.., W.. Jong, L.. Oliker, N.. Wright, and B.. Austin, "Performance Tuning of Fock Matrix and Two-Electron Integral Calculations for NWChem on Leading HPC Platforms", 4th International Workshop on Performance Modeling, Benchmarking, and Simulation of HPC Systems (PMBS13), Denver, Colorado, November, 2013.
Jain, N., A. Bhatele, M. robson, T. Gamblin, and L. Kale, "Predicting Application Performance using Supervised Learning on Communication Features", SC'13, Denver, Colorado, November, 2013.
Hammond, J. R., S. Krishnamoorthy, S. Shende, N. A. Romero, and A. D. Malony, "Performance characterization of global address space applications: a case study with NWChem", Concurrency and Computation: Practice and Experience, vol. 24, issue 2, 135-154, 2012.
Jagode, H., S. Moore, and D. Terpstra, "Performance counter monitoring for the Blue Gene/Q architecture", ScicomP 2012, Toronto, Ontario, Canada, May, 2012.
Malony, A. D., and S. Shende, "Performance Technology for Complex Parallel and Distributed Systems", Distributed and Parallel Systems: From Instruction Parallelism to Cluster Computing, vol. 567, pp. 37, 2012.