Items where Author is "Szepesvári, Csaba"

Up a level
Export as [feed] Atom [feed] RSS 1.0 [feed] RSS 2.0
Group by: Date | Item Type | No Grouping
Number of items: 45.
DateAuthor/TitleDocument Type
2012-03Gelly, Sylvain and Kocsis, Levente and Schoenauer, Marc and Sebag, Michèle and Silver, David and Szepesvári, Csaba and Teytaud, Olivier
The grand challenge of computer Go: Monte Carlo Tree Search and Extensions
ISI Article
2010-05-13Torma, Péter and György, András and Szepesvári, Csaba
A Markov-chain Monte Carlo approach to simultaneous localization and mapping
Conference or Workshop Item
2010Antos, András and Grover, Varun and Szepesvári, Csaba
Active learning in heteroscedastic noise
ISI Article
2010Li, L. and Póczos, B. and Szepesvári, Csaba
Budgeted distribution learning of belief net parameters
Conference or Workshop Item
2010Maei, Hamid Reza and Szepesvári, Csaba and Bhathnagar, Shalabh and Silver, David and Precup, Doina and Sutton, Richard
Convergent Temporal-Difference Learning with Arbitrary Differentiable Function Approximator
Conference or Workshop Item
2010Farahmand, A. M. and Munos, R. and Szepesvári, Csaba
Error propagation for approximate policy and value iteration
Conference or Workshop Item
2010Pál, D. and Póczos, B. and Szepesvári, Csaba
Estimation of Rényi entropy and mutual information based on generalized nearest-neighbor graphs
Conference or Workshop Item
2010Yu, Yaoliang and Li, Yuxi and Szepesvári, Csaba and Schuurmans, Dale
A General Projection Property for Distribution Families
Conference or Workshop Item
2010Szita, I. and Szepesvári, Csaba
Model-based reinforcement learning with nearly tight exploration complexity bounds
Conference or Workshop Item
2010Bartók, Gábor and Szepesvári, Csaba and Zilles, S.
Models of active learning in group-structured state spaces
ISI Article
2010Neu, Gergely and György, András and Szepesvári, Csaba and Antos, András
Online Markov decision processes under bandit feedback
Conference or Workshop Item
2010Maei, H. and Szepesvári, Csaba and Bhatnagar, S. and Sutton, R. S.
Toward off-policy learning control with function approximation
Conference or Workshop Item
2010Neu, Gergely and György, András and Szepesvári, Csaba
The online loop-free stochastic shortest-path problem
Conference or Workshop Item
2009Audibert, Jean-Yves and Munos, Remi and Szepesvári, Csaba
Exploration-exploitation tradeoff using variance estimates in multi-armed bandits
ISI Article
2009Yao, Hengshuai and Bhatnagar, Shalabh and Szepesvári, Csaba
LMS-2: Towards an algorithm that is as cheap as LMS and almost as efficient as RLS
Conference or Workshop Item
2009Farhangfar, Alireza and Greiner, Russ and Szepesvári, Csaba
Learning to segment from a few well-selected training images
Conference or Workshop Item
2009Póczos, Barnabás and Abbasi-Yadkori, Yasin and Szepesvári, Csaba and Greiner, Russ and Sturtevant, Nathan
Learning when to stop thinking and do something!
Conference or Workshop Item
2009Farahmand, Amir massoud and Shademan, Azad and Jägersand, Martin and Szepesvári, Csaba
Model-based and model-free reinforcement learning for visual servoing
Conference or Workshop Item
2009Neu, Gergely and Szepesvári, Csaba
Training parsers by inverse reinforcement learning
ISI Article
2008Antos, András and Grover, Varun and Szepesvári, Csaba
Active learning in multi-armed bandits
Conference or Workshop Item
2008Bartók, Gábor and Szepesvári, Csaba and Zilles, Sandra
Active learning of group-structured environments
Conference or Workshop Item
2008Sutton, Richard S. and Szepesvári, Csaba and Geramifard, Alborz and Bowling, Michael H.
Dyna-style planning with linear function approximation and prioritized sweeping
Conference or Workshop Item
2008Mnih, Volodymyr and Szepesvári, Csaba and Audibert, Jean-Yves
Empirical Bernstein stopping
Conference or Workshop Item
2008Munos, Remi and Szepesvári, Csaba
Finite-time bounds for fitted value iteration
ISI Article
2008Antos, András and Szepesvári, Csaba and Munos, Rémi
Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path
ISI Article
2008Farahmand, Amir massoud and Ghavamzadeh, Mohammad and Szepesvári, Csaba and Mannor, Shie
Regularized fitted Q-iteration: application to planning
Conference or Workshop Item
2008Isaza, Alejandro and Szepesvári, Csaba and Bulitko, Vadim and Greiner, Russel
Speeding up planning in Markov decision processes via automatically constructed abstractions
Conference or Workshop Item
2007György, András and Kocsis, Levente and Szabó, Ivett and Szepesvári, Csaba
Continuous time associative bandit problems
Conference or Workshop Item
2007Antos, András and Munos, Rémi and Szepesvári, Csaba
Fitted Q-iteration in continuous action-space MDPs
Conference or Workshop Item
2007Antos, András and Szepesvári, Csaba and Munos, Rémi
Value-iteration based fitted policy iteration: learning with a single trajectory
Conference or Workshop Item
2006Kocsis, Levente and Szepesvári, Csaba
Bandit based Monte-Carlo planning
ISI Article
2006Antos, András and Szepesvári, Csaba and Munos, R.
Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path
ISI Article
2006Torma, P. and Szepesvári, Csaba
Local importance sampling: a novel technique to enhance particle filtering
Article
2006Kocsis, Levente and Szepesvári, Csaba and Winands, MHM
RSPSA: enhanced parameter optimisation in games
ISI Article
2006Kocsis, Levente and Szepesvári, Csaba
Universal parameter Optimisation in games based on SPSA
ISI Article
2005Szepesvári, Csaba and Munos, R.
Finite time bounds for sampling based fitted value iteration
Conference or Workshop Item
2005Gerencsér, László and Rásonyi, Miklós and Szepesvári, Csaba and Vágó, Zs
Log-optimal currency portfolios and control Lyapunov exponents
Conference or Workshop Item
2005Torma, P. and Szepesvári, Csaba
On using likelihood-adjusted proposals in paprticle filtering: local importance sampling
Conference or Workshop Item
2004Torma, P. and Szepesvári, Csaba
Enhancing particle filters using local likelihood sampling
Article
2004Szepesvári, Csaba and Smart, WD
Interpolation-based Q-learning
Conference or Workshop Item
2004Szepesvári, Csaba and Kocsor, A. and Kovács, K.
Kernel machine based feature extraction algorithms for regression problems
Conference or Workshop Item
2004Kocsor, A. and Kovács, K. and Szepesvári, Csaba
Margin maximizing discriminant analysis
Article
2004Szepesvári, Csaba
Shortest path discovery problems: a framework, algorithms and experimental results
Conference or Workshop Item
2003French, M. and Szepesvári, Csaba and Rogers, E.
Performance of nonlinear approximate adaptive controllers
Book
2003Torma, P. and Szepesvári, Csaba
Sequential importance sampling for visual tracking reconsidered
Conference or Workshop Item
This list was generated on Thu Dec 5 03:18:32 2024 CET.