Maximizing the Probability of Arriving on Time: A Practical Q-Learning Method

Z. Cao; H. Guo; J. Zhang; F. Oliehoek; U. Fastenrath

Maximizing the Probability of Arriving on Time: A Practical Q-Learning Method

Authors	Z. Cao H. Guo J. Zhang F. Oliehoek U. Fastenrath
Publication date	2017
Book title	Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, Twenty-Ninth Innovative Applications of Artificial Intelligence Conference, Seventh Symposium on Educational Advances in Artificial Intelligence
Book subtitle	4-9 February 2017, San Francisco, California, USA
ISBN	9781577357858
Event	Thirty-First AAAI Conference on Artificial Intelligence
Volume \| Issue number	6
Pages (from-to)	4481-4487
Publisher	Palo Alto, California: AAAI Press
Organisations	Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract	The stochastic shortest path problem is of crucial importance for the development of sustainable transportation systems. Existing methods based on the probability tail model seek for the path that maximizes the probability of arriving at the destination before a deadline. However, they suffer from low accuracy and/or high computational cost. We design a novel Q-learning method where the converged Q-values have the practical meaning as the actual probabilities of arriving on time so as to improve accuracy. By further adopting dynamic neural networks to learn the value function, our method can scale well to large road networks with arbitrary deadlines. Experimental results on real road networks demonstrate the significant advantages of our method over other counterparts.
Document type	Conference contribution
Language	English
Published at	https://ojs.aaai.org/index.php/AAAI/article/view/11170 (Final published version)
Permalink to this page

Back

UvA-DARE

Digital Academic Repository

Maximizing the Probability of Arriving on Time: A Practical Q-Learning Method