Publications
"General Hybrid Parallel Profiling",
Parallel, Distributed, and Network-Based Processing (PDP 2014), Turin, Italy, 02/2014.
"Graphical processing units and scientific applications",
International Journal of High Performance Computing Applications (IJHPCA) , August 2012, vol. 26, pp. 189 - 191, 2012.
"Green Queue: Customized Large-Scale Clock Frequency Scaling",
Cloud and Green Computing (CGC) , 1-3 November , 2012, Xiangtan, Hunan, China, IEEE, 2012.
"Gyrokinetic Particle-in-Cell Optimization on Emerging Multi- and Manycore Platforms",
Parallel Computing, vol. 37, no. 9, pp. 501-520, sept, 2011.
"Gyrokinetic Toroidal Simulations on Leading Mult- and Manycore HPC Systems",
(submitted to) Supercomputing, April, 2011.
"Generating Performance Bounds from Source Code",
Proceedings of the First International Workshop on Parallel Software Tools and Tool Infrastructures (PSTI 2010), sep, 2010.
"A Generalized Framework for Auto-tuning Stencil Computations",
In Proceedings of the Cray User Group Conference, 2009, Atlanta, GA, May, 2009.
"Genetic Algorithm Approach to Modeling the Performance of Memory-bound Codes",
Supercomputing, 2007. The Proceeding of the ACM/IEEE Conference on High Performance Networking and Computing, Reno, NV, november, 2007.