上海交通大学学报 ›› 2022, Vol. 56 ›› Issue (1): 89-100.doi: 10.16183/j.cnki.jsjtu.2020.186
所属专题: 《上海交通大学学报》2022年“生物医学工程”专题
郑德重1,2, 杨媛媛1, 黄浩哲3, 谢哲1,2, 李文涛3(
)
收稿日期:2020-06-18
出版日期:2022-01-28
发布日期:2022-01-21
通讯作者:
李文涛
E-mail:liwentao98@126.com
作者简介:郑德重(1990-),男,湖北省武汉市人,博士生,主要研究方向为机器学习、深度学习在医学影像方面的应用.
基金资助:
ZHENG Dezhong1,2, YANG Yuanyuan1, HUANG Haozhe3, XIE Zhe1,2, LI Wentao3(
)
Received:2020-06-18
Online:2022-01-28
Published:2022-01-21
Contact:
LI Wentao
E-mail:liwentao98@126.com
摘要:
使用多模态数据建模可以有效地克服单一模态信息量不足的问题,大大提高模型的性能.但在量化神经网络模型置信度,尤其是对于多模态融合模型方面并没有很多进展.基于此,提出一种基于嵌入的方法,在嵌入空间中通过计算样本间的距离进行局部密度估计,进而计算模型的置信度分数.该方法具备可扩展性,不仅可以用于单一模态模型,还可以用于多模态融合模型置信度的度量.此外,所提方法还可以用来评估和量化不同模态数据对于多模态融合模型的影响程度.
中图分类号:
郑德重, 杨媛媛, 黄浩哲, 谢哲, 李文涛. 基于距离置信度分数的多模态融合分类网络[J]. 上海交通大学学报, 2022, 56(1): 89-100.
ZHENG Dezhong, YANG Yuanyuan, HUANG Haozhe, XIE Zhe, LI Wentao. Multimodal Fusion Classification Network Based on Distance Confidence Score[J]. Journal of Shanghai Jiao Tong University, 2022, 56(1): 89-100.
| [1] | MANDELBAUM A, WEINSHALL D. Distance-based confidence score for neural network classifiers[EB/OL]. (2017-09-02) [2020-04-16]. https://www.researchgate.net/publication/320097900_Distance-based_Confidence_Score_for_Neural_Network_Classifiers. |
| [2] | SHAKED A, WOLF L. Improved stereo matching with constant highway networks and reflective confidence learning[C]// 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Honolulu, HI, USA, 2017: 6901-6910. |
| [3] |
ZHANG Y B, ZHANG Z F, MIAO D Q, et al. Three-way enhanced convolutional neural networks for sentence-level sentiment classification[J]. Information Sciences, 2019, 477:55-64.
doi: 10.1016/j.ins.2018.10.030 URL |
| [4] |
DEL-AGUA M Á, GIMÉNEZ A, SANCHIS A, et al. Speaker-adapted confidence measures for ASR using deep bidirectional recurrent neural networks[J]. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2018, 26(7):1198-1206.
doi: 10.1109/TASLP.2018.2819900 URL |
| [5] | NADEEM U, BENNAMOUN M, SOHEL F, et al. Learning-based confidence estimation for multi-modal classifier fusion[C]// International Conference on Neural Information Processing. Sydney, NSW, Australia: ICONIP, 2019: 299-312. |
| [6] | BERTHON A, HAN B, NIU G, et al. Confidence scores make instance-dependent label-noise learning possible[EB/OL]. (2020-01-12)[2020-04-16]. https://www.researchgate.net/publication/338570353_Confidence_Scores_Make_Instance-dependent_Label-noise_Learning_Possible. |
| [7] | SZEGEDY C, ZAREMBA W, SUTSKEVER I, et al. Intriguing properties of neural networks[EB/OL].(2013-12-12)[2020-04-16]. https://www.researchgate.net/publication/259440613_Intriguing_properties_of_neural_networks. |
| [8] | GAL Y, GHAHRAMANI Z. Dropout as a Bayesian approximation: Representing model uncertainty in deep learning[J]. Proceedings of Machine Learning Research, 2016, 48:1050-1059. |
| [9] | GUO C, PLEISS G, SUN Y, et al. On calibration of modern neural networks[J]. Proceedings of Machine Learning Research, 2017, 70:1321-1330. |
| [10] |
DENG B, JIA S, SHI D M. Deep metric learning-based feature embedding for hyperspectral image classification[J]. IEEE Transactions on Geoscience and Remote Sensing, 2020, 58(2):1422-1435.
doi: 10.1109/TGRS.36 URL |
| [11] | CAO S C, WANG X F, KITANI K M. Learnable embedding space for efficient neural architecture compression[EB/OL].(2019-02-12) [2020-04-16]. https://www.researchgate.net/publication/330845079_Learnable_Embedding_Space_for_Efficient_Neural_Architecture_Compression. |
| [12] | HU H X, ZHOU G T, DENG Z W, et al. Learning structured inference neural networks with label relations[C]// 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Las Vegas, NV, USA: IEEE, 2016: 2960-2968. |
| [13] | OZDEMIR O, WOODWARD B, BERLIN A A. Propagating uncertainty in multi-stage Bayesian convolutional neural networks with application to pulmonary nodule detection[EB/OL].(2019-02-12)[2020-04-16]. https://www.researchgate.net/publication/321511200_Propagating_Uncertainty_in_Multi-Stage_Bayesian_Convolutional_Neural_Networks_with_Application_to_Pulmonary_Nodule_Detection. |
| [14] | NEAL R M. Bayesian learning for neural networks[M]. New York, NY, USA: Springer, 1996. |
| [15] | 徐磊, 李向阳, 黄湘岳. 基于贝叶斯网络的非常规突发事件灾情评估[J]. 上海交通大学学报, 2013, 47(5):846-850. |
| XU Lei, LI Xiangyang, HUANG Xiangyue. Unconventional emergency assessment based on Bayesian network[J]. Journal of Shanghai Jiao Tong University, 2013, 47(5):846-850. | |
| [16] | GAL Y, GHAHRAMANI Z. Dropout as a Bayesian approximation: Representing model uncertainty in deep learning[C]// Proceedings of the 33rd International Conference on International Conference on Machine Learning. New York, NY, USA:SIGCHI, 2016,: 1050-1059. |
| [17] | LAKSHMINARAYANAN B, PRITZEL A, BLUNDELL C. Simple andscalable predictive uncertainty estimation using deep ensembles[EB/OL].(2016-12-12)[2020-04-16]. https://www.researchgate.net/publication/311430201_Simple_and_Scalable_Predictive_Uncertainty_Estimation_using_Deep_Ensembles. |
| [18] | DENG J, DONG W, SOCHER R, et al. ImageNet: A large-scale hierarchical image database[C]// 2009 IEEE Conference on Computer Vision and Pattern Recognition. Miami, FL, USA: IEEE, 2009: 10836047. |
| [19] | NOJAVANASGHARI B, GOPINATH D, KOUSHIK J, et al. Deep multimodal fusion for persuasiveness prediction[C]// Proceedings of the 18th ACM International Conference on Multimodal Interaction. Tokyo, Japan: SIGCHI, 2016: 284-288. |
| [20] |
BALTRUŠAITIS T, AHUJA C, MORENCY L P. Multimodal machine learning: A survey and taxonomy[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2019, 41(2):423-443.
doi: 10.1109/TPAMI.2018.2798607 URL |
| [21] | ZARAGOZA H, BUC D. Confidence measures for neural network classifiers[EB/OL]. [2020-04-16]. https://www.semanticscholar.org/paper/Confidence-Measures-for-Neural-Network-Classifiers-Zaragoza-Buc/f19e6e8a06cba5fc8cf234881419de9193bba9d0. |
| [22] | GHOSH P, DAVIS L S. Understanding center loss based network for image retrieval with few training data[C]// Computer Vision-European Conference on Computer Vision 2018 Workshops. Munich, Germany: Springer, 2018: 717-722. |
| [23] | WEN Y D, ZHANG K P, LI Z F, et al. A discriminative feature learning approach for deep face recognition[C]// Computer Vision-European Conference on Computer Vision 2016 Workshops. Amsterdam, The Netherlands: Springer, 2016: 499-515. |
| [24] | SNELL J, SWERSKY K, ZEMEL R. Prototypical networks for few-shot learning[C]// Proceedings of the 31 st International Conference on Neural Information Processing Systems. Red Hook, NY, USA: SIGCHI, 2017: 4080-4090. |
| [25] | VASWANI A, SHAZEER N, PARMAR N, et al. Attention is all you need[EB/OL].(2017-06-12)[2020-04-16]. https://arxiv.org/abs/1706.03762. |
| [26] | WANG F, JIANG M Q, QIAN C, et al. Residual attention network for image classification[C]// 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Honolulu, HI, USA: IEEE, 2017: 6450-6458. |
| [27] | ZHANG H, GOODFELLOW I, METAXAS D, et al. Self-attention generative adversarial networks[J]. Proceedings of Machine Learning Research, 2019, 97:7354-7363. |
| [28] | RAMACHANDRAN P, PARMAR N, VASWANI A, et al. Stand-alone self-attention in vision models[EB/OL].(2019-06-28)[2020-04-16]. https://www.researchgate.net/publication/333815334_Stand-Alone_Self-Attention_in_Vision_Models. |
| [29] | WOO S, PARK J, LEE J Y, et al. CBAM: Convolutional block attention module[C]// Computer Vision-European Conference on Computer Vision 2018 Workshops. Munich, Germany: Springer, 2018: 3-19. |
| [30] |
WILKS D S. Sampling distributions of the Brier score and Brier skill score under serial dependence[J]. Quarterly Journal of the Royal Meteorological Society, 2010, 136(653):2109-2118.
doi: 10.1002/qj.709 URL |
| [1] | 李诗杰, 刘泰序, 刘佳仑, 董智霖, 徐诚祺. 基于GRU-MPC的双全回转推进拖轮轨迹跟踪控制[J]. 上海交通大学学报, 2026, 60(3): 418-426. |
| [2] | 梁佳铭, 余音, 胡祎乐. 基于时空神经网络的金属疲劳裂纹扩展预测[J]. 上海交通大学学报, 2026, 60(3): 511-521. |
| [3] | . DSNet:用于语音情感识别的带有中性校准的解耦孪生网络[J]. J Shanghai Jiaotong Univ Sci, 2026, 31(2): 248-257. |
| [4] | . 类间隙滞后非线性系统复合双通道干扰估计自适应控制器设计[J]. J Shanghai Jiaotong Univ Sci, 2026, 31(1): 106-116. |
| [5] | 夏筱彦, 张宇, 胡锡坤, 钟平. 基于扩散模型的无人机遥感目标检测物理对抗攻击方法研究[J]. 空天防御, 2026, 9(1): 52-62. |
| [6] | 李湘, 陈思远, 张俊, 柯德平, 高杰迈, 杨欢欢. 基于物理信息嵌入的非固定长度电力系统暂态稳定快速评估[J]. 上海交通大学学报, 2025, 59(7): 962-970. |
| [7] | 王晓倩, 周羽生, 毛源军, 李彬, 周文晴, 苏盛. 基于神经网络分位数的分布式光伏发电功率异常识别方法[J]. 上海交通大学学报, 2025, 59(6): 836-844. |
| [8] | . 基于深度学习序列方法的多人姿态估计用来检测人体与关键点位置[J]. J Shanghai Jiaotong Univ Sci, 2025, 30(6): 1103-1113. |
| [9] | 曾金灿, 何耿生, 李姚旺, 杜尔顺, 张宁, 朱浩骏. 基于卷积神经网络与轻量级梯度提升树组合模型的电力行业短期以电折碳方法[J]. 上海交通大学学报, 2025, 59(6): 746-757. |
| [10] | 潘美琪, 贺兴. 基于零样本学习的风力机故障诊断方法[J]. 上海交通大学学报, 2025, 59(5): 561-568. |
| [11] | 黄逸翔, 窦迅, 李林溪, 杨函煜, 于建成, 霍现旭. 基于全局灵敏度分析的综合能源设备响应价值量化方法[J]. 上海交通大学学报, 2025, 59(5): 569-579. |
| [12] | 周添龙, 姚方競, 饶卫雄. 部分已知状态空间模型下的目标状态估计算法[J]. 空天防御, 2025, 8(3): 111-122. |
| [13] | 江宜航, 骆天溯, 陆营波, 周金鹏, 姚方競. 基于内外场混合数据的仿真模型置信度评估方法研究[J]. 空天防御, 2025, 8(3): 59-65. |
| [14] | . 基于基尼不纯度结构优化物理引导神经网络的薄膜型声学超材料传声损失预测[J]. J Shanghai Jiaotong Univ Sci, 2025, 30(3): 613-624. |
| [15] | . 基于毫米波雷达的智能心率提取方法[J]. J Shanghai Jiaotong Univ Sci, 2025, 30(3): 493-498. |
| 阅读次数 | ||||||
|
全文 |
|
|||||
|
摘要 |
|
|||||