Distributed value function approximation for multi-agent reinforcement learning

Izdanje: Sinteza 2018 International Scientific Conference on Information Technology and Data Related Research

Oblast: Abstract Preview

Stranice: 339

Apstrakt:
In this work we propose a novel class of distributed algorithms for iterative multi-agent value function approximation for reinforcement learning in Markov decision processes. The algorithms do not require any fusion center and are based on incorporating consensus-based collaborations between the agents over time-varying communication network. We allow local learning strategies of the agents to belong to a unified class of the existing single-agent algorithms which are based on stochastic gradient descent minimization of appropriately defined local cost functions, and which include off-policy learning with eligibility traces. The off-policy local schemes are particularly important since they allow the agents in the resulting distributed algorithm to have different behavior policies while evaluating the response to a single target policy. We discuss the convergence properties of the algorithms, and show that, by a proper design of the network parameters and/or network topology, the convergence point (if exists) can be tuned to coincide with the globally optimal point. The properties and the effectiveness of the proposed algorithms are illustrated by simulations.
Ključne reči: reinforcement learning, value function approximation, multi-agent system, convergence, distributed stochastic approximation
Priložene datoteke:
  • 339 ( veličina: 149,57 KB, broj pregleda: 369 )

Preuzimanje citata:

BibTeX format
@article{article,
  author  = {M. Stanković}, 
  title   = {Distributed value function approximation for multi-agent reinforcement learning},
  journal = {Sinteza 2018 International Scientific Conference on Information Technology and Data Related Research},
  year    = 2018,
  pages   = {339},
  doi     = {}
}
RefWorks Tagged format
RT Conference Proceedings
A1 Miloš Stanković
T1 Distributed value function approximation for multi-agent reinforcement learning
AD Univerzitet Singidunum, Beograd, Beograd, Srbija
YR 2018
NO doi: 
Unapred formatirani prikaz citata
M. Stanković, Distributed value function approximation for multi-agent reinforcement learning, Univerzitet Singidunum, Beograd, 2018, doi: