Logotipo ImpactU
Autor

Hybrid algorithm based on reinforcement learning for smart inventory management

Acceso Abierto

Abstract:

Abstract This article proposes a hybrid algorithm based on reinforcement learning and the inventory management methodology called DDMRP (Demand Driven Material Requirement Planning) to determine the optimal time to buy a certain product, and how much quantity should be requested. For this, the inventory management problem is formulated as a Markov Decision Process where the environment with which the system interacts is designed from the concepts raised in the DDMRP methodology, and through the reinforcement learning algorithm—specifically, Q-Learning. The optimal policy is determined for making decisions about when and how much to buy. To determine the optimal policy, three approaches are proposed for the reward function: the first one is based on inventory levels; the second is an optimization function based on the distance of the inventory to its optimal level, and the third is a shaping function based on levels and distances to the optimal inventory. The results show that the proposed algorithm has promising results in scenarios with different characteristics, performing adequately in difficult case studies, with a diversity of situations such as scenarios with discontinuous or continuous demand, seasonal and non-seasonal behavior, and with high demand peaks, among others.

Tópico:

Supply Chain and Inventory Management

Citaciones:

Citations: 23
23

Citaciones por año:

Altmétricas:

Paperbuzz Score: 0
0

Información de la Fuente:

SCImago Journal & Country Rank
FuenteJournal of Intelligent Manufacturing
Cuartil año de publicaciónNo disponible
Volumen34
Issue1
Páginas123 - 149
pISSNNo disponible
ISSN0956-5515

Enlaces e Identificadores:

Artículo de revista