Journal of Shanghai Jiao Tong University

Previous Articles     Next Articles

 A Line-Drawing-Guided and Transformer-Enhanced U-Net Method

  

  1. 1. Key Laboratory of Linguistic and Cultural Computing of the Ministry of Education, Northwest Minzu University, Lanzhou 730000, China;2. Institute of Mathematics and Computer Science, Northwest Minzu University, Lanzhou 730000, China;3. School of Information Science and Engineering, Lanzhou University, Lanzhou 730000, China

Abstract: This paper introduces a line-drawing-guided Transformer-enhanced U-Net (LDG-TEUN) for the digital restoration of Dunhuang murals. A cross-attention module that integrates axial attention with two-dimensional positional encoding is embedded in the encoder to capture global structures and long-range dependencies, thereby alleviating the structural loss caused by large-scale damage. A dual-domain partial convolution (DPConv) unit is then designed to jointly model spatial- and frequency-domain features, enhancing the reconstruction of complex textures and fine edges while addressing challenges in detail recovery. Finally, a composite loss function is formulated to enforce structural consistency, texture fidelity, and color distribution simultaneously, which improves overall restoration quality and, in particular, enables more authentic color reconstruction. Experimental results demonstrate that the proposed method outperforms state-of-the-art approaches in both structural coherence and color restoration, confirming its effectiveness and practical value for the digital conservation of Dunhuang murals.

Key words: image inpainting, Transformer, U-Net, line-drawing-guided, Dunhuang murals

CLC Number: