[1]袁娥, 张云泉, 刘芳芳, 等. SpMV的自动性能优化实现技术及其应用研究[J].计算机研究与发展, 2009, 46 (7):11171126.YUAN E, ZHANG Yunquan, LIU Fangfang, et al. Automatic performance tuning of sparse matrixvector multiplication: Implementation techniques and its application research [J]. Journal of Computcr Research and Development, 2009, 46 (7): 11171126.[2]Chen Y, Huang Y J, Eeckhout L, et al. Evaluating iterative optimization across 1000 data sets [C]∥Programming Language Design and Implementation. New York: ACM, 2010: 448459.[3]Lee Y, Hall M. A code isolator isolating code fragments from large programs [C]∥Languages and Compilers for Parallel Computing. Germany: Springer LNCS, 2004: 164178.[4]Liao C H, Quinlan D J, Vuduc R, et al. Effective sourcetosource outlining to support whole program empirical optimization [C]∥Languages and Compilers for Parallel Computing. Germany: Springer LNCS 5898, 2009: 308322.[5]江毛进, 陆鑫达, 陈杰. 编译中的循环优化[J]. 上海交通大学学报, 1996, 30 (6): 2028.JIANG Maojin, LU Xinda, CHEN Jie. Loop optimization in compilation [J]. Journal of Shanghai Jiaotong University, 1996, 30 (6): 2028.[6]陈杰, 陆鑫达. 调整数组大小——一种减少Cache失效率的有效方法[J]. 上海交通大学学报, 1997, 31 (8): 4448.CHEN Jie, LU Xinda. Tuning dizes of arrays: An effective method to reduce cache miss ratio [J]. Journal of Shanghai Jiaotong University, 1997, 31 (8): 4448.[7]Cavazos J, Fursin G, Agakov F, et al. Rapidly selecting good compiler optimizations using performance counters [C]∥Code Generation and Optimization. San Jose, USA: IEEE Computer Society, 2007: 185197.[8]Lu P J, Che Y G, Wang Z H. UMDA/S: An effective iterative compilation algorithm for parameter search [J]. Computing and Informatics, 2010, 29 (6): 11591179. |