Maintenance Strategy for Production Lines from a Multi-Cost Perspective Based on Graph Attention Networks and Reinforcement Learning

Expand
  • School of Mechanical Engineering; State Key Laboratory of Mechanical System and Vibration, Shanghai Jiao Tong University, Shanghai 200240, China

Online published: 2026-05-11

Abstract

In multi-operation manufacturing systems, production and maintenance interact, and machine condition is coupled with performance loss. To quantify the effects of maintenance and improve the economic performance of maintenance policies, this study considers the state evolution of machines and proposes a production-line maintenance strategy from a multi-cost perspective. Maintenance timing and level are taken as decision variables, and an integrated cost optimization model is built that includes preventive maintenance cost, downtime cost, delivery gap penalty, and performance loss cost on both product and equipment sides. To handle the high-dimensional and time-dependent action space, maintenance decision units are represented as graph nodes, and a graph-attention-based proximal policy optimization algorithm (GAT-PPO) is developed. Case studies show that the obtained policy can automatically identify high-risk machines and critical periods. Comparative experiments further indicate that the proposed algorithm achieves a favorable trade-off between solution quality and online computation time under different maintenance planning horizons, demonstrating strong potential for engineering applications and extension.

Cite this article

WANG Zichun, CHEN Zhen, ZHAO Yixi, PAN Ershun . Maintenance Strategy for Production Lines from a Multi-Cost Perspective Based on Graph Attention Networks and Reinforcement Learning[J]. Journal of Shanghai Jiaotong University, 0 : 1 . DOI: 10.16183/j.cnki.jsjtu.2025.406

Outlines

/