- Wikipedia, Creative Commons ShareAlike License
- Watkins, C.J.C.H. (1989), Learning from Delayed Rewards (Ph.D. thesis), Cambridge University
- Online Q-Learning using Connectionist Systems, Rummery & Niranjan (1994)
- Wiering, Marco; Schmidhuber, Jürgen (1998-10-01), Fast Online Q(λ). Machine Learning. 33 (1): 105-115
- Copyright (c) 2009-2017, Accord.NET Authors at: [email protected]
- Kenan Deen, https://kenandeen.wordpress.com/
..................Content has been hidden....................
You can't read the all page of ebook, please click
here login for view all page.