多模态特征融合的船舶CAD模型检索技术
网络出版日期: 2025-10-07
Multimodal Feature Fusion for Ship CAD Model Retrieval Technology
Online published: 2025-10-07
刘子昂, 吕超凡, 张丹, 鲍劲松 . 多模态特征融合的船舶CAD模型检索技术[J]. 上海交通大学学报, 0 : 1 . DOI: 10.16183/j.cnki.jsjtu.2025.146
This paper This paper proposes a multimodal feature fusion method for ship CAD (Computer-Aided Design) model retrieval, aiming at solving the problem of limited single-modal expression ability in traditional ship CAD model retrieval - 3D geometric features are difficult to capture semantic information, text descriptions are unable to express the precise Geometric structure and image features are significantly affected by changes in viewing angle and illumination. The method maps reference images to pseudo-word tokens by referring to the Context-I2W network, and fuses BOM (Bill of Materials) information and mesh geometric features of CAD models; and designs a multimodal feature fusion framework based on the WR (Weighted Residual) matrix to align image, text and 3D geometric features in the semantic space; Constructing a combinatorial query mechanism for matching retrieval by calculating the similarity between combinatorial embeddings and candidate model features. The experiments are validated on a dataset containing 204 ship multimodal samples, and the results show that the method achieves an average mAP (Mean Average Precision) of 83.5% on the retrieval task of three types of typical components, namely, hull structure, outfitting parts, and piping arrangement, which is a 16.7% enhancement over the existing zero-sample methods, and the area under the ROC (Receiver Operating Characteristic) curve area under the curve reaches 0.818, which achieves excellent retrieval performance without labeling data.
/
| 〈 |
|
〉 |