TY - JOUR
T1 - Reinforcement Learning-based Non-Autoregressive Solver for Traveling Salesman Problems
AU - Xiao, Yubin
AU - Wang, Di
AU - Li, Boyang
AU - Pang, Wei
AU - Wu, Xuan
AU - Li, Hao
AU - Xu, Dong
AU - Liang, Yanchun
AU - Zhou, You
PY - 2024/10/13
Y1 - 2024/10/13
N2 - The Traveling Salesman Problem (TSP) is a well known combinatorial optimization problem with broad real world applications. Recently, neural networks have gained popularity in this research area because as shown in the literature, they provide strong heuristic solutions to TSPs. Compared to autoregressive neural approaches, non-autoregressive (NAR)networks exploit the inference parallelism to elevate inference speed but suffer from comparatively low solution quality. In this paper, we propose a novel NAR model named NAR4TSP, which incorporates a specially designed architecture and an enhanced reinforcement learning strategy. To the best of our knowledge,NAR4TSP is the first TSP solver that successfully combines RL and NAR networks. The key lies in the incorporation of NAR network output decoding into the training process.NAR4TSP efficiently represents TSP encoded information as rewards and seamlessly integrates it into reinforcement learning strategies, while maintaining consistent TSP sequence constraints during both training and testing phases. Experimental results on both synthetic and real-world TSPs demonstrate that NAR4TSPoutperforms five state-of-the-art models in terms of solution quality, inference speed, and generalization to unseen scenarios.
AB - The Traveling Salesman Problem (TSP) is a well known combinatorial optimization problem with broad real world applications. Recently, neural networks have gained popularity in this research area because as shown in the literature, they provide strong heuristic solutions to TSPs. Compared to autoregressive neural approaches, non-autoregressive (NAR)networks exploit the inference parallelism to elevate inference speed but suffer from comparatively low solution quality. In this paper, we propose a novel NAR model named NAR4TSP, which incorporates a specially designed architecture and an enhanced reinforcement learning strategy. To the best of our knowledge,NAR4TSP is the first TSP solver that successfully combines RL and NAR networks. The key lies in the incorporation of NAR network output decoding into the training process.NAR4TSP efficiently represents TSP encoded information as rewards and seamlessly integrates it into reinforcement learning strategies, while maintaining consistent TSP sequence constraints during both training and testing phases. Experimental results on both synthetic and real-world TSPs demonstrate that NAR4TSPoutperforms five state-of-the-art models in terms of solution quality, inference speed, and generalization to unseen scenarios.
M3 - Article
SN - 2162-237X
JO - IEEE Transactions on Neural Networks and Learning Systems
JF - IEEE Transactions on Neural Networks and Learning Systems
ER -