Items where Author is "Szepesvári, Csaba"

Group by: Date | Item Type | No Grouping

Number of items: 45.

Date	Author/Title	Document Type
2012-03	Gelly, Sylvain and Kocsis, Levente and Schoenauer, Marc and Sebag, Michèle and Silver, David and Szepesvári, Csaba and Teytaud, Olivier The grand challenge of computer Go: Monte Carlo Tree Search and Extensions	ISI Article
2010-05-13	Torma, Péter and György, András and Szepesvári, Csaba A Markov-chain Monte Carlo approach to simultaneous localization and mapping	Conference or Workshop Item
2010	Antos, András and Grover, Varun and Szepesvári, Csaba Active learning in heteroscedastic noise	ISI Article
2010	Li, L. and Póczos, B. and Szepesvári, Csaba Budgeted distribution learning of belief net parameters	Conference or Workshop Item
2010	Maei, Hamid Reza and Szepesvári, Csaba and Bhathnagar, Shalabh and Silver, David and Precup, Doina and Sutton, Richard Convergent Temporal-Difference Learning with Arbitrary Differentiable Function Approximator	Conference or Workshop Item
2010	Farahmand, A. M. and Munos, R. and Szepesvári, Csaba Error propagation for approximate policy and value iteration	Conference or Workshop Item
2010	Pál, D. and Póczos, B. and Szepesvári, Csaba Estimation of Rényi entropy and mutual information based on generalized nearest-neighbor graphs	Conference or Workshop Item
2010	Yu, Yaoliang and Li, Yuxi and Szepesvári, Csaba and Schuurmans, Dale A General Projection Property for Distribution Families	Conference or Workshop Item
2010	Szita, I. and Szepesvári, Csaba Model-based reinforcement learning with nearly tight exploration complexity bounds	Conference or Workshop Item
2010	Bartók, Gábor and Szepesvári, Csaba and Zilles, S. Models of active learning in group-structured state spaces	ISI Article
2010	Neu, Gergely and György, András and Szepesvári, Csaba and Antos, András Online Markov decision processes under bandit feedback	Conference or Workshop Item
2010	Maei, H. and Szepesvári, Csaba and Bhatnagar, S. and Sutton, R. S. Toward off-policy learning control with function approximation	Conference or Workshop Item
2010	Neu, Gergely and György, András and Szepesvári, Csaba The online loop-free stochastic shortest-path problem	Conference or Workshop Item
2009	Audibert, Jean-Yves and Munos, Remi and Szepesvári, Csaba Exploration-exploitation tradeoff using variance estimates in multi-armed bandits	ISI Article
2009	Yao, Hengshuai and Bhatnagar, Shalabh and Szepesvári, Csaba LMS-2: Towards an algorithm that is as cheap as LMS and almost as efficient as RLS	Conference or Workshop Item
2009	Farhangfar, Alireza and Greiner, Russ and Szepesvári, Csaba Learning to segment from a few well-selected training images	Conference or Workshop Item
2009	Póczos, Barnabás and Abbasi-Yadkori, Yasin and Szepesvári, Csaba and Greiner, Russ and Sturtevant, Nathan Learning when to stop thinking and do something!	Conference or Workshop Item
2009	Farahmand, Amir massoud and Shademan, Azad and Jägersand, Martin and Szepesvári, Csaba Model-based and model-free reinforcement learning for visual servoing	Conference or Workshop Item
2009	Neu, Gergely and Szepesvári, Csaba Training parsers by inverse reinforcement learning	ISI Article
2008	Antos, András and Grover, Varun and Szepesvári, Csaba Active learning in multi-armed bandits	Conference or Workshop Item
2008	Bartók, Gábor and Szepesvári, Csaba and Zilles, Sandra Active learning of group-structured environments	Conference or Workshop Item
2008	Sutton, Richard S. and Szepesvári, Csaba and Geramifard, Alborz and Bowling, Michael H. Dyna-style planning with linear function approximation and prioritized sweeping	Conference or Workshop Item
2008	Mnih, Volodymyr and Szepesvári, Csaba and Audibert, Jean-Yves Empirical Bernstein stopping	Conference or Workshop Item
2008	Munos, Remi and Szepesvári, Csaba Finite-time bounds for fitted value iteration	ISI Article
2008	Antos, András and Szepesvári, Csaba and Munos, Rémi Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path	ISI Article
2008	Farahmand, Amir massoud and Ghavamzadeh, Mohammad and Szepesvári, Csaba and Mannor, Shie Regularized fitted Q-iteration: application to planning	Conference or Workshop Item
2008	Isaza, Alejandro and Szepesvári, Csaba and Bulitko, Vadim and Greiner, Russel Speeding up planning in Markov decision processes via automatically constructed abstractions	Conference or Workshop Item
2007	György, András and Kocsis, Levente and Szabó, Ivett and Szepesvári, Csaba Continuous time associative bandit problems	Conference or Workshop Item
2007	Antos, András and Munos, Rémi and Szepesvári, Csaba Fitted Q-iteration in continuous action-space MDPs	Conference or Workshop Item
2007	Antos, András and Szepesvári, Csaba and Munos, Rémi Value-iteration based fitted policy iteration: learning with a single trajectory	Conference or Workshop Item
2006	Kocsis, Levente and Szepesvári, Csaba Bandit based Monte-Carlo planning	ISI Article
2006	Antos, András and Szepesvári, Csaba and Munos, R. Learning near-optimal policies with Bellman-residual minimization based fitted policy iteration and a single sample path	ISI Article
2006	Torma, P. and Szepesvári, Csaba Local importance sampling: a novel technique to enhance particle filtering	Article
2006	Kocsis, Levente and Szepesvári, Csaba and Winands, MHM RSPSA: enhanced parameter optimisation in games	ISI Article
2006	Kocsis, Levente and Szepesvári, Csaba Universal parameter Optimisation in games based on SPSA	ISI Article
2005	Szepesvári, Csaba and Munos, R. Finite time bounds for sampling based fitted value iteration	Conference or Workshop Item
2005	Gerencsér, László and Rásonyi, Miklós and Szepesvári, Csaba and Vágó, Zs Log-optimal currency portfolios and control Lyapunov exponents	Conference or Workshop Item
2005	Torma, P. and Szepesvári, Csaba On using likelihood-adjusted proposals in paprticle filtering: local importance sampling	Conference or Workshop Item
2004	Torma, P. and Szepesvári, Csaba Enhancing particle filters using local likelihood sampling	Article
2004	Szepesvári, Csaba and Smart, WD Interpolation-based Q-learning	Conference or Workshop Item
2004	Szepesvári, Csaba and Kocsor, A. and Kovács, K. Kernel machine based feature extraction algorithms for regression problems	Conference or Workshop Item
2004	Kocsor, A. and Kovács, K. and Szepesvári, Csaba Margin maximizing discriminant analysis	Article
2004	Szepesvári, Csaba Shortest path discovery problems: a framework, algorithms and experimental results	Conference or Workshop Item
2003	French, M. and Szepesvári, Csaba and Rogers, E. Performance of nonlinear approximate adaptive controllers	Book
2003	Torma, P. and Szepesvári, Csaba Sequential importance sampling for visual tracking reconsidered	Conference or Workshop Item

This list was generated on Wed Jul 22 20:21:56 2026 CEST.

Export as	Atom RSS 1.0 RSS 2.0