面向柔性作业车间动态调度的双系统强化学习方法
刘亚辉, 申兴旺, 顾星海, 彭涛, 鲍劲松, 张丹

A Dual-System Reinforcement Learning Method for Flexible Job Shop Dynamic Scheduling
LIU Yahui, SHEN Xingwang, GU Xinghai, PENG Tao, BAO Jinsong, ZHANG Dan
表4 工序排序决策动作
Tab.4 Decision-making action of process sequencing
符号 描述 量化方式
FIFO 先到先加工优先规则 a t 2 = m i n r k , i ( r k , i )
SPT 工序加工时间最短优先规则 a t 2 = m i n k = 1 K i = 1 n ( e o k , i , j t - s o k , i , j t + r o k , i , j t )
EDD 交货期最早加工优先规则 a t 2 = m i n D P k
SL 松弛时间最短优先规则 a t 2 = m i n D P k - x - j = 1 m B k , i T ( x )
SRPT 剩余加工时间最长优先规则 a t 2 = m a x j = j ' m B k , i T ( j ' )