Publications
"Thread-level parallelization and optimization of NWChem for the Intel MIC architecture",
Proceedings of the Sixth International Workshop on Programming Models and Applications for Multicores and Manycores, PMAM@PPoPP 2015, San Francisco, CA, USA, February 7-8, 2015, 2015.
"Through the Looking-Glass: From Performance Observation to Dynamic Adaptation",
Proceedings of the 24th International Symposium on High-Performance Parallel and Distributed Computing, {HPDC} 2015, Portland, OR, USA, June 15-19, 2015, 2015.
"Toward Multi-target Autotuning for Accelerators",
The 20th IEEE International Conference on Parallel and Distributed Systems (ICPADS 2014), Hsinchu, Taiwan, IEEE Computer Society Press, 12/2014.
"Tools for machine-learning-based empirical autotuning and specialization",
International Journal of High Performance Computing Applications, vol. 27, issue 4, pp. 403-411, November, 2013.
"Towards fully automatic auto-tuning: Leveraging language features of Chapel",
International Journal of High Performance Computing Applications, vol. 27, issue 4, pp. 394-402, November, 2013.
"Towards making auto tuning mainstream",
International Journal of High Performance Computing Applications, vol. 27, issue 4, pp. 379-393, November, 2013.
"Tracking a Value's Influence on Later Computation",
6th Workshop on Productivity and Performance (PROPER 2013), Aachen, Germany, 08/2013.
"Transformations exploiting array syntax in Fortran for accelerator programming",
International Journal of Computational Science and Engineering (IJCSE), vol. 8, pp. 47-57, February,2013.
"A Type-Based Approach to Separating Protocol from Application Logic - A Case Study in Hybrid Computer Programming",
Euro -Par International Conference on parallel processing ,27-31 August 2012, Rhodes Island, Greece, Springer Berlin Heidelberg, 2012.
"A type-based approach to separating protocol from application logic: a case study in hybrid computer programming",
Euro -Par International Conference on parallel processing ,27-31 August 2012, Rhodes Island, Greece, Springer Berlin Heidelberg, 2012.
"Tuning parallel applications in parallel",
Parallel Comput., vol. 35, no. 8-9, pp. 475–492, August, 2009.
"The TAU Parallel Performance System",
International Journal of High Performance Computing Applications, vol. 20, no. 2, Summer, pp. 287–311, 2006.