A Line-Drawing-Guided and Transformer-Enhanced U-Net Method

Expand
  • 1. Key Laboratory of Linguistic and Cultural Computing of the Ministry of Education, Northwest Minzu University, Lanzhou 730000, China;2. Institute of Mathematics and Computer Science, Northwest Minzu University, Lanzhou 730000, China;3. School of Information Science and Engineering, Lanzhou University, Lanzhou 730000, China

Online published: 2025-11-14

Abstract

This paper introduces a line-drawing-guided Transformer-enhanced U-Net (LDG-TEUN) for the digital restoration of Dunhuang murals. A cross-attention module that integrates axial attention with two-dimensional positional encoding is embedded in the encoder to capture global structures and long-range dependencies, thereby alleviating the structural loss caused by large-scale damage. A dual-domain partial convolution (DPConv) unit is then designed to jointly model spatial- and frequency-domain features, enhancing the reconstruction of complex textures and fine edges while addressing challenges in detail recovery. Finally, a composite loss function is formulated to enforce structural consistency, texture fidelity, and color distribution simultaneously, which improves overall restoration quality and, in particular, enables more authentic color reconstruction. Experimental results demonstrate that the proposed method outperforms state-of-the-art approaches in both structural coherence and color restoration, confirming its effectiveness and practical value for the digital conservation of Dunhuang murals.

Cite this article

LI Qiaoqiao1, TAN Qiulin1, LIU Hongcai1, WANG Weilan2, ZHAO Dongdong3 .  A Line-Drawing-Guided and Transformer-Enhanced U-Net Method[J]. Journal of Shanghai Jiaotong University, 0 : 1 . DOI: 10.16183/j.cnki.jsjtu.2025.164

Outlines

/