Yaroslav Rosokha
@ Purdue University
Projects
Reinforcement Learning
Performance comparison of TD vs LSTD methods on learning a value function task of example 6.2 in Sutton and Barto (1998).
Performance comparison of TD vs LSTD methods on learning a value function task of example 6.2 in Sutton and Barto (1998).