Publications

Export 12 results:
A B C D E F G H I J K L M N O P Q R S T U V W X Y Z 
M
Tiwari, A., A. Gamst, M. A. Laurenzano, M. Schulz, and L. Carrington, "Modeling the Impact of Reduced Memory Bandwidth on HPC Applications", Euro-Par 2014 Parallel Processing - 20th International Conference, Porto, Portugal, August 25-29, 2014. Proceedings, 2014.
Balarakash, P., A. Tiwari, and S. M. Wild, "Multi-Objective Optimization of HPC Kernels for Performance, Power, and Energy", 4th International Workshop on Performance Modeling, Benchmarking, and Simulation of HPC Systems (PMBS12), Denver, Colorado, November, 2013.
Porterfield, A., N. Nassar, and R. Fowler, "Multi-Threaded Library for Many-Core Systems", Workshop on Multithreaded Architectures and Applications, Rome, Italy, IEEE, 2009.
N
Venkat, A., M. Shantharam, M. Hall, and M. Strout, "Non-affine Extensions to Polyhedral Code Generation", IEEE/ACM International Symposium on Code Generation and Optimization (CGO): ACM, pp. 185, 02/2014.
O
Porterfield, A., R. Fowler, S. Balachandra, and W. Wang, "OpenMP and MPI Application Energy Measurement Variation", 1st International Workshop on Energy Efficient SuperComputing (E2SC), Denver, CO, 11/2013.
Madduri, K., J. Su, S. Williams, L. Oliker, S. Ethier, and K. A. Yelick, "Optimization of Parallel Particle-to-Grid Interpolation on Leading Multicore Platforms", IEEE Transactions on Parallel Distributed Systems, vol. 23, issue 10, pp. 1915 - 1922, October 2012.
Aktulga, H.. M., A.. Buluc, S. Williams, and C.. Yang, "Optimizing Sparse Matrix-Multiple Vector Multiplication for Nuclear Configuration Interaction Calculations", International Parallel and Distributed Processing Symposium (IPDPS 2014), 05/2014.
P
Shan, H., K. McElvain, C. Johnson, S. Williams, and E. W. Ormand, "Parallel implementation and performance optimization of the configuration-interaction method", SC'15, Austin, TX, 11/2015.
Malony, A., S. Biersdorff, S. Shende, H. Jagode, S. Tomov, G. Juckeland, R. Dietrich, D. Poole, and C. Lamb, "Parallel Performance Measurement of Heterogeneous Parallel Systems with GPUs", International Conference on Parallel Processing (ICPP 2011), Taipei, Taiwan, sep, 2011.
Sarje, A., S. Song, D. Jacobsen, K. Huck, J. Hollingsworth, A. Malony, S. Williams, and L. Oliker, "Parallel performance optimizations on unstructured mesh-based simulations", Procedia Computer Science, vol. 51, pp. 2016–2025, 2015.
Jia, Y., G. Bosilca, P. Luszczek, and J. Dongarra, "Parallel Reduction to Hessenberg Form with Algorithm-Based Fault Tolerance", SC'13, Denver, Colorado, November, 2013.
Childs, H., S. Biersdorff, D. Poliakoff, D. Camp, and A. D. Malony, "Particle advection performance over varied architectures and workloads", High Performance Computing (HiPC), 2014 21st International Conference on: IEEE, 2014.
Laurenzano, M., M. Tikir, L. Carrington, and A. Snavely, "PEBIL: Efficient Static Binary Instrumentation for Linux", IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), March, 2010.
Bhatele, A., H. Langer, A. D. Malony, M. Schulz, S. Shende, and N. R. Tallent, Performance Analysis, Modeling and Scaling of HPC Applications and Tools, , 2016.
Norris, B., W. Spear, and A. Malony, "Performance Analysis of Applications in the Context of Architectural Rooflines", Proceedings of the 8th ACM/SPEC on International Conference on Performance Engineering, New York, NY, USA, ACM, 2017.

Pages