Adaptive sampling based large-scale stochastic resource control
Csáji, Balázs Csanád and Monostori, László (2006) Adaptive sampling based large-scale stochastic resource control. In: AAAI 2006 - IAAI 2006. 21st national conference on artificial intelligence. 18th conference on innovative applications of artificial intelligence. Boston, 2006..
Full text not available from this repository.Abstract
We consider closed-loop solutions to stochastic optimization problems of resource allocation type. They concern with the dynamic allocation of reusable resources over time to non-preemtive interconnected tasks with stochastic durations. The aim is to minimize the expected value of a regular performance measure. First, we formulate the problem as a stochastic shortest path problem and argue that our formulation has favorable properties, e.g., it has finite horizon, it is acyclic, thus, all policies are proper, and moreover, the space of control policies can be safely restricted. Then, we propose an iterative solution. Essentially, we apply a reinforcement learning based adaptive sampler to compute a suboptimal control policy. We suggest several approaches to enhance this solution and make it applicable to largescale problems. The main improvements are: (1) the value function is maintained by feature-based support vector regression; (2) the initial exploration is guided by rollout algorithms; (3) the state space is partitioned by clustering the tasks while keeping the precedence constraints satisfied; (4) the action space is decomposed and, consequently, the number of available actions in a state is decreased; and, finally, (5) we argue that the sampling can be effectively distributed among several processors. The effectiveness of the approach is demonstrated by experimental results on both artificial (benchmark) and real-world (industry related) data.
Item Type: | Conference or Workshop Item (Paper) |
---|---|
Uncontrolled Keywords: | Dynamic programming, stochastic resource control |
Subjects: | Q Science > QA Mathematics and Computer Science > QA75 Electronic computers. Computer science / számítástechnika, számítógéptudomány |
Divisions: | Research Laboratory on Engineering & Management Intelligence |
Depositing User: | Eszter Nagy |
Date Deposited: | 11 Dec 2012 15:26 |
Last Modified: | 25 Jul 2018 12:57 |
URI: | https://eprints.sztaki.hu/id/eprint/4460 |
Update Item |