Journal of shanghai Jiaotong University (Science) ›› 2016, Vol. 21 ›› Issue (3): 280-288.doi: 10.1007/s12204-016-1723-2
Previous Articles Next Articles
ZHAO Xia* (赵 夏), MA Sheng (马 胜), CHEN Wei (陈 微), WANG Zhiying (王志英)
Online:
2016-06-30
Published:
2016-06-30
Contact:
ZHAO Xia (赵 夏)
E-mail: xiazhao@nudt.edu.cn
CLC Number:
ZHAO Xia* (赵 夏), MA Sheng (马 胜), CHEN Wei (陈 微), WANG Zhiying (王志英). Exploiting Parallelism in the Simulation of General Purpose Graphics Processing Unit Program[J]. Journal of shanghai Jiaotong University (Science), 2016, 21(3): 280-288.
[1] | NVIDIA. TeslaKeplerTM GPU accelerator [EB/OL].(2014-09-01). http://www.nvidia.com/content/tesla/pdf/ Tesla-KSeries-Overview-LR.pdf. |
[2] | AYANI R. Parallel simulation [C]//Performance Evaluationof Computer and Communication Systems.Berlin Heidelberg: Springer, 1993: 1-20. |
[3] | NICOL D, FUJIMOTO R. Parallel simulation today[J]. Annals of Operations Research, 1994, 53(1): 249-285. |
[4] | REINHARDT S K, HILL M D, LARUS J R, et al.The Wisconsin wind tunnel: Virtual prototyping ofparallel computers [C]// Proceedings of the 1993 ACMSIGMETRICS Conference. New York: ACM, 1993: 1-3. |
[5] | MUKHERJEE S S, REINHARDT S K, FALSAFI B,et al. Wisconsin wind tunnel II: A fast, portable parallelarchitecture simulator [J]. IEEE Concurrency, 2000,8(4): 12-20. |
[6] | CHEN J W, ANNAVARAM M, DUBOIS M. Slack-Sim: A platform for parallel simulations of CMPson CMPs [J]. ACM SIGARCH Computer ArchitectureNews, 2009, 37(2): 20-29. |
[7] | MILLER J E, KASTURE H, KURIAN G, et al.Graphite: A distributed parallel simulator for multicores[C]//Proceedings of 16th International Symposiumon High Performance Computer Architecture.Washington: IEEE, 2010: 1-12. |
[8] | LEE S, RO W W. Parallel GPU architecture simulationframework exploiting work allocation unit parallelism[C]//2013 IEEE International Symposium onPerformance Analysis of Systems and Software. Washington:IEEE, 2013: 107-117. |
[9] | DEL BARRIO V M, GONZ′ALEZ C, ROCA J, et al.ATTILA: A cycle-level execution-driven simulator formodern GPU architectures [C]//2006 IEEE InternationalSymposium on Performance Analysis of Systemsand Software. Washington: IEEE, 2006: 231-241. |
[10] | BAKHODA A, YUAN G L, FUNG W W L, et al. AnalyzingCUDA workloads using a detailed GPU simulator[C]// 2009 IEEE International Symposium onPerformance Analysis of Systems and Software. Washington:IEEE, 2009: 163-174. |
[11] | UBAL R, JANG B, MISTRY P, et al. Multi2Sim:A simulation framework for CPU-GPU computing[C]//Proceedings of the 21st International Conferenceon Parallel Architectures and Compilation Techniques.New York: ACM, 2012: 335-344. |
[12] | YU Z B, EECKHOUT L, GOSWAMI N, et al. AcceleratingGPGPU architecture simulation [C]// Proceedingsof the ACM SIGMETRICS/International Conferenceon Measurement and Modeling of Computer Systems.New York: ACM, 2013: 331-332. |
[13] | MAUER C J, HILL M D, WOOD D A. Full-systemtiming-first simulation [C]// Proceedings of the 2002ACM Sigmetrics Conference on Measurement andModeling of Computer Systems. New York: ACM,2002: 108-116. |
[14] | Illinois Microarchitecture Project utilizing AdvancedCompiler Technology Research Group.Parboil benchmark suite [EB/OL]. (2014-09-01).http://impact.crhc.illinois.edu/Parboil/parboil.aspx. |
[15] | NVIDIA Corporation. NVIDIA CUDASDK code samples [EB/OL]. (2014-09-01).http://docs.nvidia.com/cuda/cuda-samples. |
[16] | MIKE GILES. Libor [EB/OL]. (2014-09-01).http://people.maths.ox.ac.uk/gilesm/cuda.html. |
[17] | EECKHOUT L. Computer architecture performanceevaluation methods [J]. Synthesis Lectures on ComputerArchitecture, 2010, 5(1): 1-145. |
[18] | LUO Y, JOHN L K, EECKHOUT L. Self-monitoredadaptive cache warm-up for microprocessor simulation[C]// Proceedings of the 16th Symposium on ComputerArchitecture and High Performance Computing(SBAC-PAD’04). [s.l.]: IEEE, 2004: 10-17. |
[19] | HASKINS JR J W, SKADRON K. Acceleratedwarmup for sampled microarchitecture simulation [J].ACM Transactions on Architecture and Code Optimization,2005, 2(1): 78-108. |
[1] | WU Guanlun, SHI Guanglin. Design and Realization of Continuum Manipulator Based on Coupling of Double Parallel Mechanism [J]. Journal of Shanghai Jiao Tong University, 2022, 56(6): 809-817. |
[2] | FENG Xin, FU Zhuang, WANG Kejin, HAO Gaofeng. Design of Slip Ring Based on SSP Compensation and Variable Frequency Control [J]. Journal of Shanghai Jiao Tong University, 2021, 55(7): 814-825. |
[3] | XU Xianyang,CHEN Lu. Parallel Machine Scheduling Problem Considering Machine Reliability and Energy Consumption [J]. Journal of Shanghai Jiaotong University, 2020, 54(3): 247-255. |
[4] | FU Chao, FAN Jiacheng, WANG Shigang, LIANG Qinghua. Modelling of Spatial Pose of Ortho-SUV Frame and Mathematical Solution [J]. Journal of Shanghai Jiaotong University, 2020, 54(10): 1007-1014. |
[5] | TAO Haijun,ZHOU Yousong,ZHANG Guopeng,ZHENG Zheng. Parallel Resonance Mechanism Analysis and Suppression Method for LCL Type Grid-Connected Inverter [J]. Journal of Shanghai Jiaotong University, 2020, 54(10): 1065-1073. |
[6] | LIN Heyun (林和昀), YUAN Chaowei (袁超伟), DU Jianhe (杜建和), HU Zhongwei (胡仲伟). Tensor-Based Joint Channel Estimation and Symbol Detection for AF MIMO Relay Networks [J]. Journal of Shanghai Jiao Tong University (Science), 2020, 25(1): 88-96. |
[7] | MA Zhiqiang,LOU Yunfeng,LI Junjie,JIN Xianlong. Explicit Asynchronous Time Steps Parallel Computational Method for Structural Dynamics Based on Multiple Overlapping Nodes [J]. Journal of Shanghai Jiaotong University, 2019, 53(9): 1100-1106. |
[8] | ZHOU Binghai,LIU Wenlong. Multi-Objective Hybrid Flow-Shop Scheduling Problem Considering Energy Consumption and On-Time Delivery [J]. Journal of Shanghai Jiaotong University, 2019, 53(7): 773-779. |
[9] | ZHANG Xiaohui,BAI Junli,GU Xiechong,MA Ning. An Explicit Parallel Successive Over-Relaxation Method for Simulation of 2-Dimensional Incompressible Flows [J]. Journal of Shanghai Jiaotong University, 2019, 53(6): 681-687. |
[10] | QIAN Huanan,TAO Jing,YU Suiran. Error Analysis and Accuracy Synthesis for Linkage Mechanism of High-Precision Press [J]. Journal of Shanghai Jiaotong University, 2019, 53(3): 269-275. |
[11] | SUN Bowen,GUO Wenyu,XIA Tangbin,PAN Ershun,XI Lifeng. Capacity Balancing-Oriented Leasing Profit Optimization of Opportunistic Maintenance for Leased Series-Parallel Production System [J]. Journal of Shanghai Jiaotong University, 2019, 53(3): 276-284. |
[12] | YU Dengjia,CHEN Jiangping. Refrigerant Distribution and Application Research of Parallel Flow Evaporator for Bus Air Condition [J]. Journal of Shanghai Jiaotong University, 2019, 53(2): 140-145. |
[13] | SONG Renjie1,YU Tong1,CHEN Yuhong2,CHEN Yuyang2,XIA Bin2. A Similar Duplicate Record Detection Algorithm for Big Data Based on MapReduce [J]. Journal of Shanghai Jiaotong University, 2018, 52(2): 214-221. |
[14] | WEN Kai (温凯), LI Yichen (李熠辰), YANG Yang (杨洋), GONG Jing (宫敬). Reliability Evaluation of Compressor Systems Based on Universal Generating Function Method [J]. Journal of Shanghai Jiao Tong University (Science), 2018, 23(2): 291-296. |
[15] | JIANG Lielin,YANG Peizhong,SHI Chao. Numerical Parallel Simulation and Application of Building Fire Based on Load Balance Domain Decomposition Method [J]. Journal of Shanghai Jiaotong University, 2018, 52(11): 1524-1531. |
Viewed | ||||||
Full text |
|
|||||
Abstract |
|
|||||