Convergent Temporal-Difference Learning with Arbitrary Differentiable Function Approximator

Maei, Hamid Reza and Szepesvári, Csaba and Bhathnagar, Shalabh and Silver, David and Precup, Doina and Sutton, Richard (2010) Convergent Temporal-Difference Learning with Arbitrary Differentiable Function Approximator. In: Neural Information Processing Systems (NIPS-22).

Full text not available from this repository.
Item Type: Conference or Workshop Item (Paper)
Subjects: Q Science > QA Mathematics and Computer Science > QA75 Electronic computers. Computer science / számítástechnika, számítógéptudomány
Depositing User: EPrints Admin
Date Deposited: 11 Dec 2012 16:04
Last Modified: 11 Dec 2012 16:04
URI: http://eprints.sztaki.hu/id/eprint/5837

Update Item Update Item