Intent prediction of pedestrians via motion trajectories using stacked recurrent neural networks

Publication Type:
Journal Article
IEEE Transactions on Intelligent Vehicles, 2018, 3, (4), pp. 414-424
Issue Date:
Full metadata record
The problem of intent understanding between highly and fully automated vehicles and vulnerable road users (VRUs) such as pedestrians in urban traffic environment has got some momentum over the past few years. Previous work has been tackling the problem using two common approaches, namely dynamical motion modeling and motion planning. In this paper, a novel radical end-to-end data-driven approach is proposed for long-term intent prediction of VRUs in urban traffic environment based solely on their motion trajectories. In the proposed approach, we utilized the widely adopted architecture of recurrent neural networks with Long-Short Term Memory (LSTM) modules to form a deep stacked LSTM network. Three common approaches used in the literature were compared against our proposed approach over two different real-world datasets involving pedestrians collected from vehicle-based stereo cameras. The results over the testing datasets showed that the proposed approach achieved higher accuracies over most of the scenarios of the testing datasets with a small mean lateral position error of 0.48 m. Moreover, the proposed approach showed also a significant generalization capability over totally unobserved testing scenes during the training phase with only 0.58 m in mean lateral position error.
Please use this identifier to cite or link to this item: