dc.contributor.authorJiang, Zhijin
dc.description.abstractThe primary goal for this research is to obtain the optimal or near-optimal joint production and maintenance scheduling policy by means of reinforcement learning. In this research, we adopted reinforcement algorithm to control the feeding interval and the maintenance state of upstream station in production system. With the help of this algorithm, the work-in-process(WIP) in the production system can be limited to a reasonable level and machines are preventively maintained to be functional. By balancing the reward and cost from WIP, maintenance and the idle loss of bottleneck machine, the reinforcement learning algorithm is able to find the acceptable policy for adjusting the feeding rate and scheduling the preventive maintenance for upstream machine. However reinforcement learning involves in a lot of parameters and in practice parameters may range widely from cases to cases. There are totally five experiments performed in this research, the first and the second is the validation experiments and the third and forth is to explain the property of the algorithm. the fifth experiment describes how fast the algorithm can learn to achieve the target state of upstream station. The developed model consists of reinforcement learning based, decision-making agents with simulation model of the integrated production system. The smart agent determine the optimal or near-optimal action for each system state by interacting with their environment.en_US
dc.format.extent74 p.en_US
dc.subjectDRNTU::Engineering::Systems engineeringen_US
dc.titleApplication of reinforcement learning to production systemen_US
dc.contributor.supervisorRajesh Piplani (MAE)en_US
dc.contributor.schoolSchool of Mechanical and Aerospace Engineeringen_US
dc.description.degreeMaster of Science (Supply Chain & Logistics)en_US

Files in this item


This item appears in the following Collection(s)

Show simple item record