Items where Author is "Szepesvári, Csaba"
Up a level |
Number of items: 45.
2003
Date | Author/Title | Document Type | |
---|---|---|---|
2003 | French, M. and Szepesvári, Csaba and Rogers, E. Performance of nonlinear approximate adaptive controllers | Book | |
2003 | Torma, P. and Szepesvári, Csaba Sequential importance sampling for visual tracking reconsidered | Conference or Workshop Item |
2004
Date | Author/Title | Document Type | |
---|---|---|---|
2004 | Torma, P. and Szepesvári, Csaba Enhancing particle filters using local likelihood sampling | Article | |
2004 | Szepesvári, Csaba and Smart, WD Interpolation-based Q-learning | Conference or Workshop Item | |
2004 | Szepesvári, Csaba and Kocsor, A. and Kovács, K. Kernel machine based feature extraction algorithms for regression problems | Conference or Workshop Item | |
2004 | Kocsor, A. and Kovács, K. and Szepesvári, Csaba Margin maximizing discriminant analysis | Article | |
2004 | Szepesvári, Csaba Shortest path discovery problems: a framework, algorithms and experimental results | Conference or Workshop Item |
2005
Date | Author/Title | Document Type | |
---|---|---|---|
2005 | Szepesvári, Csaba and Munos, R. Finite time bounds for sampling based fitted value iteration | Conference or Workshop Item | |
2005 | Gerencsér, László and Rásonyi, Miklós and Szepesvári, Csaba and Vágó, Zs Log-optimal currency portfolios and control Lyapunov exponents | Conference or Workshop Item | |
2005 | Torma, P. and Szepesvári, Csaba On using likelihood-adjusted proposals in paprticle filtering: local importance sampling | Conference or Workshop Item |
2006
Date | Author/Title | Document Type | |
---|---|---|---|
2006 | Kocsis, Levente and Szepesvári, Csaba Bandit based Monte-Carlo planning | ISI Article | |
2006 | Antos, András and Szepesvári, Csaba and Munos, R. Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path | ISI Article | |
2006 | Torma, P. and Szepesvári, Csaba Local importance sampling: a novel technique to enhance particle filtering | Article | |
2006 | Kocsis, Levente and Szepesvári, Csaba and Winands, MHM RSPSA: enhanced parameter optimisation in games | ISI Article | |
2006 | Kocsis, Levente and Szepesvári, Csaba Universal parameter Optimisation in games based on SPSA | ISI Article |
2007
Date | Author/Title | Document Type | |
---|---|---|---|
2007 | György, András and Kocsis, Levente and Szabó, Ivett and Szepesvári, Csaba Continuous time associative bandit problems | Conference or Workshop Item | |
2007 | Antos, András and Munos, Rémi and Szepesvári, Csaba Fitted Q-iteration in continuous action-space MDPs | Conference or Workshop Item | |
2007 | Antos, András and Szepesvári, Csaba and Munos, Rémi Value-iteration based fitted policy iteration: learning with a single trajectory | Conference or Workshop Item |
2008
Date | Author/Title | Document Type | |
---|---|---|---|
2008 | Antos, András and Grover, Varun and Szepesvári, Csaba Active learning in multi-armed bandits | Conference or Workshop Item | |
2008 | Bartók, Gábor and Szepesvári, Csaba and Zilles, Sandra Active learning of group-structured environments | Conference or Workshop Item | |
2008 | Sutton, Richard S. and Szepesvári, Csaba and Geramifard, Alborz and Bowling, Michael H. Dyna-style planning with linear function approximation and prioritized sweeping | Conference or Workshop Item | |
2008 | Mnih, Volodymyr and Szepesvári, Csaba and Audibert, Jean-Yves Empirical Bernstein stopping | Conference or Workshop Item | |
2008 | Munos, Remi and Szepesvári, Csaba Finite-time bounds for fitted value iteration | ISI Article | |
2008 | Antos, András and Szepesvári, Csaba and Munos, Rémi Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path | ISI Article | |
2008 | Farahmand, Amir massoud and Ghavamzadeh, Mohammad and Szepesvári, Csaba and Mannor, Shie Regularized fitted Q-iteration: application to planning | Conference or Workshop Item | |
2008 | Isaza, Alejandro and Szepesvári, Csaba and Bulitko, Vadim and Greiner, Russel Speeding up planning in Markov decision processes via automatically constructed abstractions | Conference or Workshop Item |
2009
Date | Author/Title | Document Type | |
---|---|---|---|
2009 | Audibert, Jean-Yves and Munos, Remi and Szepesvári, Csaba Exploration-exploitation tradeoff using variance estimates in multi-armed bandits | ISI Article | |
2009 | Yao, Hengshuai and Bhatnagar, Shalabh and Szepesvári, Csaba LMS-2: Towards an algorithm that is as cheap as LMS and almost as efficient as RLS | Conference or Workshop Item | |
2009 | Farhangfar, Alireza and Greiner, Russ and Szepesvári, Csaba Learning to segment from a few well-selected training images | Conference or Workshop Item | |
2009 | Póczos, Barnabás and Abbasi-Yadkori, Yasin and Szepesvári, Csaba and Greiner, Russ and Sturtevant, Nathan Learning when to stop thinking and do something! | Conference or Workshop Item | |
2009 | Farahmand, Amir massoud and Shademan, Azad and Jägersand, Martin and Szepesvári, Csaba Model-based and model-free reinforcement learning for visual servoing | Conference or Workshop Item | |
2009 | Neu, Gergely and Szepesvári, Csaba Training parsers by inverse reinforcement learning | ISI Article |
2010
Date | Author/Title | Document Type | |
---|---|---|---|
2010-05-13 | Torma, Péter and György, András and Szepesvári, Csaba A Markov-chain Monte Carlo approach to simultaneous localization and mapping | Conference or Workshop Item | |
2010 | Antos, András and Grover, Varun and Szepesvári, Csaba Active learning in heteroscedastic noise | ISI Article | |
2010 | Li, L. and Póczos, B. and Szepesvári, Csaba Budgeted distribution learning of belief net parameters | Conference or Workshop Item | |
2010 | Maei, Hamid Reza and Szepesvári, Csaba and Bhathnagar, Shalabh and Silver, David and Precup, Doina and Sutton, Richard Convergent Temporal-Difference Learning with Arbitrary Differentiable Function Approximator | Conference or Workshop Item | |
2010 | Farahmand, A. M. and Munos, R. and Szepesvári, Csaba Error propagation for approximate policy and value iteration | Conference or Workshop Item | |
2010 | Pál, D. and Póczos, B. and Szepesvári, Csaba Estimation of Rényi entropy and mutual information based on generalized nearest-neighbor graphs | Conference or Workshop Item | |
2010 | Yu, Yaoliang and Li, Yuxi and Szepesvári, Csaba and Schuurmans, Dale A General Projection Property for Distribution Families | Conference or Workshop Item | |
2010 | Szita, I. and Szepesvári, Csaba Model-based reinforcement learning with nearly tight exploration complexity bounds | Conference or Workshop Item | |
2010 | Bartók, Gábor and Szepesvári, Csaba and Zilles, S. Models of active learning in group-structured state spaces | ISI Article | |
2010 | Neu, Gergely and György, András and Szepesvári, Csaba and Antos, András Online Markov decision processes under bandit feedback | Conference or Workshop Item | |
2010 | Maei, H. and Szepesvári, Csaba and Bhatnagar, S. and Sutton, R. S. Toward off-policy learning control with function approximation | Conference or Workshop Item | |
2010 | Neu, Gergely and György, András and Szepesvári, Csaba The online loop-free stochastic shortest-path problem | Conference or Workshop Item |
2012
Date | Author/Title | Document Type | |
---|---|---|---|
2012-03 | Gelly, Sylvain and Kocsis, Levente and Schoenauer, Marc and Sebag, Michèle and Silver, David and Szepesvári, Csaba and Teytaud, Olivier The grand challenge of computer Go: Monte Carlo Tree Search and Extensions | ISI Article |