54 BIBLIOGRAPHY
[101] Dongbin Zhao, Zhaohui Hu, Zhongpu Xia, Cesare Alippi, Yuanheng Zhu, and Ding
Wang. Full-range adaptive cruise control based on supervised adaptive dynamic program-
ming. Neurocomputing, 125:57–67, February 2014. DOI: 10.1016/j.neucom.2012.09.034
20
[102] Bin Wang, Dongbin Zhao, Chengdong Li, and Yujie Dai. Design and implementation
of an adaptive cruise control system based on supervised actor-critic learning. 5th Inter-
national Conference on Information Science and Technology (ICIST), pages 243–248, 2015.
DOI: 10.1109/icist.2015.7288976 20
[103] Rui Zheng, Chunming Liu, and Qi Guo. A decision-making method for autonomous
vehicles based on simulation and reinforcement learning. International Conference on Ma-
chine Learning and Cybernetics, pages 362–369, 2013. DOI: 10.1109/icmlc.2013.6890495
20
[104] Shai Shalev-Shwartz, Nir Ben-Zrihem, Aviad Cohen, and Amnon Shashua. Long-term
planning by short-term prediction. ArXiv Preprint ArXiv:1602.01580, 2016. 22, 26
[105] Wei Xia, Huiyun Li, and Baopu Li. A control strategy of autonomous vehicles based on
deep reinforcement learning. In Computational Intelligence and Design (ISCID), 9th Inter-
national Symposium on, vol. 2, pages 198–201, IEEE, 2016. DOI: 10.1109/iscid.2016.2054
22, 26
[106] Ahmad El Sallab, Mohammed Abdou, Etienne Perot, and Senthil Yogamani.
End-to-end deep reinforcement learning for lane keeping assist. ArXiv Preprint
ArXiv:1612.04340, 2016. 22, 26
[107] Jiakai Zhang and Kyunghyun Cho. Query-efficient imitation learning for end-to-end
autonomous driving. ArXiv Preprint ArXiv:1605.06450, 2016. 22, 26
[108] Stéphane Ross, Geoffrey Gordon, and Drew Bagnell. A reduction of imitation learning
and structured prediction to no-regret online learning. In Proc. of the 14th International
Conference on Artificial Intelligence and Statistics, pages 627–635, 2011. 22
[109] Yunpeng Pan, Ching-An Cheng, Kamil Saigol, Keuntaek Lee, Xinyan Yan, Evange-
los eodorou, and Byron Boots. Agile autonomous driving using end-to-end deep
imitation learning. Proc. of Robotics: Science and Systems, Pittsburgh, PA, 2018. DOI:
10.15607/rss.2018.xiv.056 23, 26
[110] Dequan Wang, Coline Devin, Qi-Zhi Cai, Fisher Yu, and Trevor Darrell. Deep object
centric policies for autonomous driving. ArXiv Preprint ArXiv:1811.05432, 2018. 23, 26
[111] Horia Porav and Paul Newman. Imminent collision mitigation with reinforcement learn-
ing and vision. In 21st International Conference on Intelligent Transportation Systems (ITSC),
pages 958–964, IEEE, 2018. DOI: 10.1109/itsc.2018.8569222 23, 26