References

  • Wikipedia, Creative Commons ShareAlike License
  • Watkins, C.J.C.H. (1989), Learning from Delayed Rewards (Ph.D. thesis), Cambridge University
  • Online Q-Learning using Connectionist Systems, Rummery & Niranjan (1994)
  • Wiering, Marco; Schmidhuber, Jürgen (1998-10-01), Fast Online Q(λ). Machine Learning. 33 (1): 105-115
  • Copyright (c) 2009-2017, Accord.NET Authors at: [email protected]
  • Kenan Deen, https://kenandeen.wordpress.com/
..................Content has been hidden....................

You can't read the all page of ebook, please click here login for view all page.
Reset