Systems Engineering and Electronics ›› 2022, Vol. 44 ›› Issue (1): 199-208.doi: 10.12305/j.issn.1001-506X.2022.01.25

• Systems Engineering • Previous Articles     Next Articles

Scheduling strategies research based on reinforcement learning for wartime support force

Bin ZENG1, Rui WANG2,*, Houpu LI3, Xu FAN1   

  1. 1. Department of Management and Economics, Naval University of Engineering, Wuhan 430033, China
    2. Teaching and Research Support Center, Naval University of Engineering, Wuhan 430033, China
    3. Department of Navigation Engineering, Naval University of Engineering, Wuhan 430033, China
  • Received:2020-11-28 Online:2022-01-01 Published:2022-01-19
  • Contact: Rui WANG

Abstract:

Intelligent logistics and equipment support is one of the research hotspots in the current military field, it is necessary for the wartime support to be adaptive in the complicated and changeable battlefield. Aiming at this problem, a reinforcement learning model based on Markov decision process (MDP) is proposed, which can adaptively learn the optimal assignment policy and obtain the scheduling scheme according to historical data and prediction based on current situation. A simulation procedure based on probability statistical model is adopted into the model solution to consider the impact of the uncertainty events. Furthermore, the post decision state is used in the design of Bellman iterative equation to decrease the computation complexity brought by the random incidents. Finally, the approximate function based on composition of basis functions is proposed to overcome the problem of dimensionality curse. Simulation experiment shows that the reinforcement learning capability can significantly improve the scheduling performance of support force.

Key words: wartime support, reinforcement learning, uncertainty, optimal scheduling

CLC Number: 

[an error occurred while processing this directive]