Journal of shanghai Jiaotong University (Science) ›› 2016, Vol. 21 ›› Issue (3): 280-288.doi: 10.1007/s12204-016-1723-2
Previous Articles Next Articles
ZHAO Xia* (赵 夏), MA Sheng (马 胜), CHEN Wei (陈 微), WANG Zhiying (王志英)
Online:
2016-06-30
Published:
2016-06-30
Contact:
ZHAO Xia (赵 夏)
E-mail: xiazhao@nudt.edu.cn
CLC Number:
ZHAO Xia* (赵 夏), MA Sheng (马 胜), CHEN Wei (陈 微), WANG Zhiying (王志英). Exploiting Parallelism in the Simulation of General Purpose Graphics Processing Unit Program[J]. Journal of shanghai Jiaotong University (Science), 2016, 21(3): 280-288.
[1] | NVIDIA. TeslaKeplerTM GPU accelerator [EB/OL].(2014-09-01). http://www.nvidia.com/content/tesla/pdf/ Tesla-KSeries-Overview-LR.pdf. |
[2] | AYANI R. Parallel simulation [C]//Performance Evaluationof Computer and Communication Systems.Berlin Heidelberg: Springer, 1993: 1-20. |
[3] | NICOL D, FUJIMOTO R. Parallel simulation today[J]. Annals of Operations Research, 1994, 53(1): 249-285. |
[4] | REINHARDT S K, HILL M D, LARUS J R, et al.The Wisconsin wind tunnel: Virtual prototyping ofparallel computers [C]// Proceedings of the 1993 ACMSIGMETRICS Conference. New York: ACM, 1993: 1-3. |
[5] | MUKHERJEE S S, REINHARDT S K, FALSAFI B,et al. Wisconsin wind tunnel II: A fast, portable parallelarchitecture simulator [J]. IEEE Concurrency, 2000,8(4): 12-20. |
[6] | CHEN J W, ANNAVARAM M, DUBOIS M. Slack-Sim: A platform for parallel simulations of CMPson CMPs [J]. ACM SIGARCH Computer ArchitectureNews, 2009, 37(2): 20-29. |
[7] | MILLER J E, KASTURE H, KURIAN G, et al.Graphite: A distributed parallel simulator for multicores[C]//Proceedings of 16th International Symposiumon High Performance Computer Architecture.Washington: IEEE, 2010: 1-12. |
[8] | LEE S, RO W W. Parallel GPU architecture simulationframework exploiting work allocation unit parallelism[C]//2013 IEEE International Symposium onPerformance Analysis of Systems and Software. Washington:IEEE, 2013: 107-117. |
[9] | DEL BARRIO V M, GONZ′ALEZ C, ROCA J, et al.ATTILA: A cycle-level execution-driven simulator formodern GPU architectures [C]//2006 IEEE InternationalSymposium on Performance Analysis of Systemsand Software. Washington: IEEE, 2006: 231-241. |
[10] | BAKHODA A, YUAN G L, FUNG W W L, et al. AnalyzingCUDA workloads using a detailed GPU simulator[C]// 2009 IEEE International Symposium onPerformance Analysis of Systems and Software. Washington:IEEE, 2009: 163-174. |
[11] | UBAL R, JANG B, MISTRY P, et al. Multi2Sim:A simulation framework for CPU-GPU computing[C]//Proceedings of the 21st International Conferenceon Parallel Architectures and Compilation Techniques.New York: ACM, 2012: 335-344. |
[12] | YU Z B, EECKHOUT L, GOSWAMI N, et al. AcceleratingGPGPU architecture simulation [C]// Proceedingsof the ACM SIGMETRICS/International Conferenceon Measurement and Modeling of Computer Systems.New York: ACM, 2013: 331-332. |
[13] | MAUER C J, HILL M D, WOOD D A. Full-systemtiming-first simulation [C]// Proceedings of the 2002ACM Sigmetrics Conference on Measurement andModeling of Computer Systems. New York: ACM,2002: 108-116. |
[14] | Illinois Microarchitecture Project utilizing AdvancedCompiler Technology Research Group.Parboil benchmark suite [EB/OL]. (2014-09-01).http://impact.crhc.illinois.edu/Parboil/parboil.aspx. |
[15] | NVIDIA Corporation. NVIDIA CUDASDK code samples [EB/OL]. (2014-09-01).http://docs.nvidia.com/cuda/cuda-samples. |
[16] | MIKE GILES. Libor [EB/OL]. (2014-09-01).http://people.maths.ox.ac.uk/gilesm/cuda.html. |
[17] | EECKHOUT L. Computer architecture performanceevaluation methods [J]. Synthesis Lectures on ComputerArchitecture, 2010, 5(1): 1-145. |
[18] | LUO Y, JOHN L K, EECKHOUT L. Self-monitoredadaptive cache warm-up for microprocessor simulation[C]// Proceedings of the 16th Symposium on ComputerArchitecture and High Performance Computing(SBAC-PAD’04). [s.l.]: IEEE, 2004: 10-17. |
[19] | HASKINS JR J W, SKADRON K. Acceleratedwarmup for sampled microarchitecture simulation [J].ACM Transactions on Architecture and Code Optimization,2005, 2(1): 78-108. |
[1] | GAO Xiang, NIU Junchuan, HE Lei, QIN Zhen, WANG Zhonglong. Vibration Robust Optimal Semi-Active Control of Multi-Dimensional Vibration Isolator Based on Parallel Mechanism [J]. Journal of Shanghai Jiao Tong University, 2025, 59(5): 648-656. |
[2] | Duan Jizhong, Su Yan. Improved Sensitivity Encoding Parallel Magnetic Resonance Imaging Reconstruction Algorithm Based on Efficient Sum of Outer Products Dictionary Learning [J]. J Shanghai Jiaotong Univ Sci, 2025, 30(3): 555-565. |
[3] | Duan Jizhong, Xu Yuhán, Huang Huan. Fast Parallel Magnetic Resonance Imaging Reconstruction Based on Sparsifying Transform Learning and Structured Low-Rank Model [J]. J Shanghai Jiaotong Univ Sci, 2025, 30(3): 499-509. |
[4] | MA Xiaolong, XU Xinpeng, REN Shulei, LI Chen, CUI Shan. Architecture Design of Guidance Head Signal Processing Module Based on GP-GPU Technology Application [J]. Air & Space Defense, 2025, 8(2): 84-92. |
[5] | GUAN Yanmin, YANG Caihong, KANG Zhuang, ZHOU Li. Application of an Improved GPU Acceleration Strategy for the Smoothed Particle Hydrodynamics Method [J]. Journal of Shanghai Jiao Tong University, 2023, 57(8): 981-987. |
[6] | ZHANG Zelong, ZHANG Yingchao, WU Bo, DONG Wei, FAN Youben. Real-Time Laser Speckle Imaging of Blood Flow with High Gray Level and High Resolution [J]. Journal of Shanghai Jiao Tong University, 2023, 57(5): 552-559. |
[7] | DUAN Jizhong, QIAN Qingqing. Fast Parallel Imaging Reconstruction Method Based on SIDWT and Iterative Self-Consistency [J]. Journal of Shanghai Jiao Tong University, 2023, 57(5): 582-592. |
[8] | LI Dingjia1,2,3,4(黎定佳),WANG Chongang1,2,3(王重阳),GUO Wei5(郭伟),WANG Zhidong6(王志东),ZHANG Zhongtao5(张忠涛),LIU Hao1,2,3*(刘浩). Shape Sensing for Single-Port Continuum Surgical Robot Using Few Multicore Fiber Bragg Grating Sensors [J]. J Shanghai Jiaotong Univ Sci, 2023, 28(3): 312-322. |
[9] | YING Hongwei, YAO Yan, WANG Kuihua, ZHANG Changju. Observed Environment Response Caused by Construction of Double-Line Parallel Pipe Jacking Crossing over Metro Shield Tunnels [J]. Journal of Shanghai Jiao Tong University, 2023, 57(12): 1639-1647. |
[10] | ZHANG Sujun, YANG Wenqiang, GU Xingsheng. An Improved Multi-Swarm Migrating Birds Optimization Algorithm for Hybrid Flow Shop Scheduling [J]. Journal of Shanghai Jiao Tong University, 2023, 57(10): 1378-1388. |
[11] | ZHAO Yilin, YAN Li, LAI Xiaoyi, MA Luchuang, HOU Linxiao, YE Zhexiao. Numerical Study on Thermal Environment of a New Generation of Four-Parallel Rocket Engine Jet [J]. Air & Space Defense, 2023, 6(1): 109-116. |
[12] | WU Guanlun, SHI Guanglin. Design and Realization of Continuum Manipulator Based on Coupling of Double Parallel Mechanism [J]. Journal of Shanghai Jiao Tong University, 2022, 56(6): 809-817. |
[13] | FENG Xin, FU Zhuang, WANG Kejin, HAO Gaofeng. Design of Slip Ring Based on SSP Compensation and Variable Frequency Control [J]. Journal of Shanghai Jiao Tong University, 2021, 55(7): 814-825. |
[14] | XU Xianyang,CHEN Lu. Parallel Machine Scheduling Problem Considering Machine Reliability and Energy Consumption [J]. Journal of Shanghai Jiaotong University, 2020, 54(3): 247-255. |
[15] | FU Chao, FAN Jiacheng, WANG Shigang, LIANG Qinghua. Modelling of Spatial Pose of Ortho-SUV Frame and Mathematical Solution [J]. Journal of Shanghai Jiaotong University, 2020, 54(10): 1007-1014. |
Viewed | ||||||||||||||||||||||||||||||||||||||||||||||||||
Full text 202
|
|
|||||||||||||||||||||||||||||||||||||||||||||||||
Abstract 670
|
|
|||||||||||||||||||||||||||||||||||||||||||||||||