
|
Albert Hartono |
|
· "Iterative Optimization in the Polyhedral Model: Part I, One-Dimensional Time", Louis-Noel Pouchet, Cedric Bastoul, Albert Cohen, Nicolas Vasilache, CGO 2007 · "Semi-automatic Composition of Loop Transformations for Deep Parallelism and Memory Hierarchies", Sylvain Girbal, Nicolas Vasilache, Cédric Bastoul, Albert Cohen, David Parello, Marc Sigler, and Olivier Temam, IJPP 2006
· "Automated Empirical Tuning of a Multiresolution Analysis Kernel", Haihang You, Keith Seymour, Jack Dongarra, and Shirley Moore, ICL Technical Report, ICL-UT-07-01 · "Automatic Analysis of Inefficiency Patterns in Parallel Applications", Felix Wolf, Bernd Mohr, Jack Dongarra, and Shirley Moore, Concurrency and Computation: Practice and Experience, 2007
· "High-performance implementation of the Level-3 BLAS", Kazushige Goto and Robert van de Geijn, Technical Report TR-2006-23, The University of Texas at Austin, Department of Computer Sciences, 2006
· "Combining models and guided empirical search to optimize for multiple levels of the memory hierarchy", Chun Chen, Jacqueline Chame, and Mary W. Hall, CGO 2005 · "Compiler-assisted performance tuning", Chun Chen, Jacqueline Chame, Yoonju Lee Nelson, Pedro Diniz, Mary Hall and Robert Lucas, Journal of Physics: Conference Series (SciDAC 2007) · "A systematic approach to model-guided empirical search for memory hierarchy optimization", Chun Chen, Jacqueline Chame, Mary W. Hall, and Kristina Lerman, LCPC 2005 · "Model-guided empirical optimization for multimedia extension architectures: A case study", Chun Chen, Jaewook Shin, Shiva Kintali, Jacqueline Chame, and Mary Hall, POHLL 2007
· "Automatic Tuning Matrix Multiplication Performance on Graphics Hardware", Changhao Jiang and Marc Snir, PACT 2005
· "Automated Empirical Optimizations of Software and the ATLAS Project", R. Whaley, A. Petitet, and J. Dongarra, Parallel Computing 27(1–2):3–25, 2001 · "Self Adapting Linear Algebra Algorithms and Software", Jim Demmel, Jack Dongarra, Victor Eijkhout, Erika Fuentes, Antoine Petitet, Rich Vuduc, R. Clint Whaley, Katherine Yelick, Proceedings of the IEEE, 2004 · "Tuning High Performance Kernels through Empirical Compilation", R. Clint Whaley and David B Whalley, ICPP 2005 · "An Effective Empirical Search Method for Automatic Software Tuning", H. You, K. Seymour, and J. Dongarra, University of Tennessee, Computer Science Department Tech Report, ICL-UT-05-02
· "Automated Transformation for Performance-Critical Kernels", Qing Yi and Clint Whaley, LCSD 2007 · "POET: Parameterized Optimizations for Empirical Tuning", Qing Yi, Keith Seymour, Haihang You, Richard Vuduc and Dan Quinlan, POHLL 2007
· "The design and implementation of FFTW3", M. Frigo, and S. G. Johnson, IJPP 2006
· "The Effect of Cache Models on Iterative Compilation for Combined Tiling and Unrolling", T. Kisuki, P.M.W. Knijnenburg, K. Gallivan, M.F.P. O'Boyle, Concurrency and Computation: Practice & Experience, 2004
· "Combining Analytical and Empirical Approaches in Tuning Matrix Transposition", Qingda Lu, Sriram Krishnamoorthy, and P. Sadayappan, PACT 2006 · "Empirical Performance-Model Driven Data Layout Optimization", Q. Lu, X. Gao, S. Krishnamoorthy, G. Baumgartner, J. Ramanujam, and P. Sadayappan, LCPC 2004
· "Affine transformations for communication minimal parallelization and locality optimization of arbitrarily nested loop sequences", Uday Bondhugula, M. Baskaran, S. Krishnamoorthy, J. Ramanujam, A. Rountev, and P. Sadayappan, OSU Research Report OSU-CISRC-5/07-TR43.
|

|
Albert Hartono |