< Terug naar vorige pagina

Publicatie

Application of Learning Automata for Stochastic Online Scheduling

Boekbijdrage - Hoofdstuk

We look at a stochastic online scheduling problem where exact job-lenghts are unknown and jobs arrive over time. Heuristics exist which perform very well, but do not extend to multi-stage problems where all jobs must be processed by a sequence of machines.
We apply Learning Automata (LA), a Reinforcement Learning technique, successfully to such a multi-stage scheduling setting. We use a Learning Automaton at each decision point in the production chain. Each Learning Automaton has a probability distribution over the machines it can chose. The difference with simple randomization algorithms is the update rule used by the LA. Whenever a job is finished, the LA are notified and update their probability distribution: if the job was finished faster than expected the probability for selecting the same action is increased, otherwise it is decreased.
Due to this adaptation, LA can learn processing capacities of the machines, or more correctly: the entire downstream production chain.
Boek: The 14th Belgian-French-German Conference on Optimization
Series: Recent Advances in Optimization and its Applications in Engineering
Pagina's: 491-498
Aantal pagina's: 8
ISBN:978-3-642-12597-3
Jaar van publicatie:2010
Trefwoorden:Reinforcement Learning, Scheduling
  • ORCID: /0000-0001-6346-4564/work/65577368
  • Scopus Id: 84865781178