收稿日期: 2024-03-21
修回日期: 2024-06-07
录用日期: 2024-06-28
网络出版日期: 2024-11-01
Optimization of Data Storage Performance in Medical PACS Imaging System
Received date: 2024-03-21
Revised date: 2024-06-07
Accepted date: 2024-06-28
Online published: 2024-11-01
尤丽珏 , 焦圣品 , 李小勇 . 医疗PACS影像系统的数据存储性能优化[J]. 上海交通大学学报, 2024 , 58(10) : 1629 -1636 . DOI: 10.16183/j.cnki.jsjtu.2024.099
Medical picture archiving and communication system (PACS) is a typical application scenario with massive small files, which faces two challenges in data storage, i.e., efficient metadata management and effective performance reduction caused by fragmentization. By analyzing various key components of the full IO (input/output) path in the medical PACS imaging system, this paper optimizes the design to achieve a significant improvement in the retrieval performance of the PACS from four dimensions, PACS software retrieval algorithm, storage protocol gateway high-concurrency design, small file aggregation, and data storage service concurrency model. The results of test show that the retrieval performance after optimizion can reach 300 images per second, which is more than three times that of traditional storage, and resolves effectively the performance problem of PACS image data retrieval.
[1] | ALDOSARI H, SADDIK B, KADI A K. Impact of picture archiving and communication system (PACS) on radiology staff[J]. Informatics in Medicine Unlocked, 2018, 10: 1-16. |
[2] | HAAK D, PAGE C E, REINARTZ S, et al. DICOM for clinical research: PACS-integrated electronic data capture in multi-center trials[J]. Journal of Digital Imaging, 2015, 28(5): 558-566. |
[3] | SHVACHKO K, KUANG H R, RADIA S, et al. The hadoop distributed file system[C]// IEEE 26th Symposium on Mass Storage Systems and Technologies. Incline Village, USA: IEEE, 2010: 1-10. |
[4] | 曾梦, 邹北骥, 张文生, 等. 多模态医疗数据中海量小文件存储优化方法[J]. 软件学报, 2023, 34(3): 1451-1469. |
ZENG Meng, ZOU Beiji, ZHANG Wensheng, et al. Optimization method for storing massive small files in multi-modal medical data[J]. Journal of Software, 2023, 34(3): 1451-1469. | |
[5] | VORA M N. Hadoop-HBase for large-scale data[C]// Proceedings of 2011 International Conference on Computer Science and Network Technology. Harbin, China: IEEE, 2011: 601-605. |
[6] | 郑通, 郭卫斌, 范贵生. HDFS中海量小文件合并与预取优化方法的研究[J]. 计算机科学, 2017, 44(11A): 516-519. |
ZHENG Tong, GUO Weibin, FAN Guisheng. Research on optimization method of merging and prefetching for massive small files in HDFS[J]. Computer Science, 2017, 44(11A): 516-519. | |
[7] | BENDE S, SHEDGE R. Dealing with small files problem in Hadoop distributed file system[J]. Procedia Computer Science, 2016, 79: 1001-1012. |
[8] | PATEL A, MEHTA M A. A novel approach for efficient handling of small files in HDFS[C]// IEEE International Advance Computing Conference. Banglore, India: IEEE, 2015: 1258-1262. |
[9] | AGGARWAL R, VERMA J, SIWACH M. Small files’ problem in Hadoop: A systematic literature review[J]. Journal of King Saud University-Computer and Information Sciences, 2022, 34(10): 8658-8674. |
[10] | TAO W J, ZHAI Y L, TCHAYE-KONDI J. LHF: A new archive based approach to accelerate massive small files access performance in HDFS[C]// IEEE Fifth International Conference on Big Data Computing Service and Applications. Newark, USA: IEEE, 2019: 40-48. |
[11] | GAO Z P, QIN Y H, NIU K. An effective merge strategy based hierarchy for improving small file problem on HDFS[C]// 4th International Conference on Cloud Computing and Intelligence Systems. Beijing, China: IEEE, 2016: 327-331. |
[12] | 陈法河, 柴小丽. 基于Ceph存储系统的小文件存储优化方案[J]. 计算机系统应用, 2022, 31(2): 108-113. |
CHEN Fahe, CHAI Xiaoli. Optimization of small file storage algorithm based on Ceph storage system[J]. Computer Systems & Applications, 2022, 31(2): 108-113. | |
[13] | SHI A W, TIAN Z C, CHEN G, et al. Research and optimization of massive small file processing performance based on Ceph[C]// International Conference on Electronic Information Engineering and Data Processing (EIEDP 2023). Nanchang, China: SPIE, 2023: 311-315. |
[14] | BEAVER D, KUMAR S, LI H C, et al. Finding a needle in Haystack: Facebook’s photo storage[C]// Proceedings of the 9th USENIX Symposium on Operating Systems Design and Implementation. Vancouver, BC, Canada: USENIX Association, 2010: 47-60. |
[15] | FU S L, HE L G, HUANG C L, et al. Performance optimization for managing massive numbers of small files in distributed file systems[J]. IEEE Transactions on Parallel and Distributed Systems, 2015, 26(12): 3433-3448. |
/
〈 |
|
〉 |