Deep Auto-Encoder Neural Networks in Reiforcement Learnning Sascha Lange and Martin Riedmil er Computer Science Department,Albert-Ludwigs University of Freiburg, D-79194 Freiburg, Germany (IJCNN2010) 2013/02/15 M1 金子 貴輝
参考文献  M. Riedmiller and H. Braun, “A Direct Adaptive Method for Faster Backpropagation Learning: The RPROP Algorithm,” in Proc. of the ICNN, 1993, pp. 586–591.  D. Ernst, P. Geurts, and L. Wehenkel, “Tree-Based Batch Mode Reinforcement Learning,” Journal of Machine Learning Research, vol. 6, no. 1, pp. 503–556, 2006.