电子信息与电气工程

医疗PACS影像系统的数据存储性能优化

  • 尤丽珏 ,
  • 焦圣品 ,
  • 李小勇
展开
  • 1.复旦大学附属华东医院 计算机中心,上海 200040
    2.上海霄云信息科技有限公司,上海 200240
    3.上海交通大学 电子信息与电气工程学院,上海 200240
尤丽珏(1975—),高级工程师,从事医院数字化研究.
李小勇,副教授;E-mail:xiaoyongli@sjtu.edu.cn.

收稿日期: 2024-03-21

  修回日期: 2024-06-07

  录用日期: 2024-06-28

  网络出版日期: 2024-11-01

Optimization of Data Storage Performance in Medical PACS Imaging System

  • YOU Lijue ,
  • JIAO Shengpin ,
  • LI Xiaoyong
Expand
  • 1. Huadong Hospital Affiliated to Fudan University, Shanghai 200040, China
    2. Shanghai Xiaoyun Info Tech Co., Ltd., Shanghai 200240, China
    3. School of Electronic andElectrical Engineering, Shanghai Jiao Tong University, Shanghai 200240, China

Received date: 2024-03-21

  Revised date: 2024-06-07

  Accepted date: 2024-06-28

  Online published: 2024-11-01

摘要

医疗影像归档与通信系统(PACS)影像数据是典型的海量小文件场景,其面临的挑战主要为高效的海量元数据管理和有效解决碎片化导致的性能下降.通过分析医疗PACS影像存储系统路径的全IO(input/output)各个关键环节,从PACS软件调阅算法、存储协议网关高并发设计、小文件聚合以及数据存储服务并发模型4个维度进行优化设计,实现对PACS调阅性能的大幅提升.实际测试表明,优化设计后PACS影像系统的调阅性能可达到每秒300幅图像,为传统存储调阅速度的3倍以上,有效解决了PACS影像数据调阅卡顿的问题.

本文引用格式

尤丽珏 , 焦圣品 , 李小勇 . 医疗PACS影像系统的数据存储性能优化[J]. 上海交通大学学报, 2024 , 58(10) : 1629 -1636 . DOI: 10.16183/j.cnki.jsjtu.2024.099

Abstract

Medical picture archiving and communication system (PACS) is a typical application scenario with massive small files, which faces two challenges in data storage, i.e., efficient metadata management and effective performance reduction caused by fragmentization. By analyzing various key components of the full IO (input/output) path in the medical PACS imaging system, this paper optimizes the design to achieve a significant improvement in the retrieval performance of the PACS from four dimensions, PACS software retrieval algorithm, storage protocol gateway high-concurrency design, small file aggregation, and data storage service concurrency model. The results of test show that the retrieval performance after optimizion can reach 300 images per second, which is more than three times that of traditional storage, and resolves effectively the performance problem of PACS image data retrieval.

参考文献

[1] ALDOSARI H, SADDIK B, KADI A K. Impact of picture archiving and communication system (PACS) on radiology staff[J]. Informatics in Medicine Unlocked, 2018, 10: 1-16.
[2] HAAK D, PAGE C E, REINARTZ S, et al. DICOM for clinical research: PACS-integrated electronic data capture in multi-center trials[J]. Journal of Digital Imaging, 2015, 28(5): 558-566.
[3] SHVACHKO K, KUANG H R, RADIA S, et al. The hadoop distributed file system[C]// IEEE 26th Symposium on Mass Storage Systems and Technologies. Incline Village, USA: IEEE, 2010: 1-10.
[4] 曾梦, 邹北骥, 张文生, 等. 多模态医疗数据中海量小文件存储优化方法[J]. 软件学报, 2023, 34(3): 1451-1469.
  ZENG Meng, ZOU Beiji, ZHANG Wensheng, et al. Optimization method for storing massive small files in multi-modal medical data[J]. Journal of Software, 2023, 34(3): 1451-1469.
[5] VORA M N. Hadoop-HBase for large-scale data[C]// Proceedings of 2011 International Conference on Computer Science and Network Technology. Harbin, China: IEEE, 2011: 601-605.
[6] 郑通, 郭卫斌, 范贵生. HDFS中海量小文件合并与预取优化方法的研究[J]. 计算机科学, 2017, 44(11A): 516-519.
  ZHENG Tong, GUO Weibin, FAN Guisheng. Research on optimization method of merging and prefetching for massive small files in HDFS[J]. Computer Science, 2017, 44(11A): 516-519.
[7] BENDE S, SHEDGE R. Dealing with small files problem in Hadoop distributed file system[J]. Procedia Computer Science, 2016, 79: 1001-1012.
[8] PATEL A, MEHTA M A. A novel approach for efficient handling of small files in HDFS[C]// IEEE International Advance Computing Conference. Banglore, India: IEEE, 2015: 1258-1262.
[9] AGGARWAL R, VERMA J, SIWACH M. Small files’ problem in Hadoop: A systematic literature review[J]. Journal of King Saud University-Computer and Information Sciences, 2022, 34(10): 8658-8674.
[10] TAO W J, ZHAI Y L, TCHAYE-KONDI J. LHF: A new archive based approach to accelerate massive small files access performance in HDFS[C]// IEEE Fifth International Conference on Big Data Computing Service and Applications. Newark, USA: IEEE, 2019: 40-48.
[11] GAO Z P, QIN Y H, NIU K. An effective merge strategy based hierarchy for improving small file problem on HDFS[C]// 4th International Conference on Cloud Computing and Intelligence Systems. Beijing, China: IEEE, 2016: 327-331.
[12] 陈法河, 柴小丽. 基于Ceph存储系统的小文件存储优化方案[J]. 计算机系统应用, 2022, 31(2): 108-113.
  CHEN Fahe, CHAI Xiaoli. Optimization of small file storage algorithm based on Ceph storage system[J]. Computer Systems & Applications, 2022, 31(2): 108-113.
[13] SHI A W, TIAN Z C, CHEN G, et al. Research and optimization of massive small file processing performance based on Ceph[C]// International Conference on Electronic Information Engineering and Data Processing (EIEDP 2023). Nanchang, China: SPIE, 2023: 311-315.
[14] BEAVER D, KUMAR S, LI H C, et al. Finding a needle in Haystack: Facebook’s photo storage[C]// Proceedings of the 9th USENIX Symposium on Operating Systems Design and Implementation. Vancouver, BC, Canada: USENIX Association, 2010: 47-60.
[15] FU S L, HE L G, HUANG C L, et al. Performance optimization for managing massive numbers of small files in distributed file systems[J]. IEEE Transactions on Parallel and Distributed Systems, 2015, 26(12): 3433-3448.
文章导航

/