Distributed value function approximation for multi-agent reinforcement learning
Distributed value function approximation for multi-agent reinforcement learning
Autori:
Izdanje: Sinteza 2018 International Scientific Conference on Information Technology and Data Related Research
Oblast: Abstract Preview
Stranice: 339339
Apstrakt:
In this work we propose a novel class of distributed algorithms for iterative multi-agent value function approximation for reinforcement learning in Markov decision processes. The algorithms do not require any fusion center and are based on incorporating consensus-based collaborations between the agents over time-varying communication network. We allow local learning strategies of the agents to belong to a unified class of the existing single-agent algorithms which are based on stochastic gradient descent minimization of appropriately defined local cost functions, and which include off-policy learning with eligibility traces. The off-policy local schemes are particularly important since they allow the agents in the resulting distributed algorithm to have different behavior policies while evaluating the response to a single target policy. We discuss the convergence properties of the algorithms, and show that, by a proper design of the network parameters and/or network topology, the convergence point (if exists) can be tuned to coincide with the globally optimal point. The properties and the effectiveness of the proposed algorithms are illustrated by simulations.
Ključne reči: reinforcement learning, value function approximation, multi-agent system, convergence, distributed stochastic approximation
Priložene datoteke:
- 339 ( veličina: 149,57 KB, broj pregleda: 369 )
Zahvaljujemo se što ste preuzeli publikaciju sa portala Singipedia.
Ukoliko želite da se prijavite za obaveštenja o sadržajima iz oblasti ove publikacije, možete nam ostaviti adresu svoje elektronske pošte.
Preuzimanje citata:
BibTeX format
RefWorks Tagged format
Unapred formatirani prikaz citata
BibTeX format
@article{article, author = {M. Stanković}, title = {Distributed value function approximation for multi-agent reinforcement learning}, journal = {Sinteza 2018 International Scientific Conference on Information Technology and Data Related Research}, year = 2018, pages = {339}, doi = {} }
RT Conference Proceedings A1 Miloš Stanković T1 Distributed value function approximation for multi-agent reinforcement learning AD Univerzitet Singidunum, Beograd, Beograd, Srbija YR 2018 NO doi:
M. Stanković, Distributed value function approximation for multi-agent reinforcement learning, Univerzitet Singidunum, Beograd, 2018, doi: