Distributed Value Function Approximation for Collaborative Multi-Agent Reinforcement Learning Autori: Miloš Stanković, Marko Beko, Srdjan Stanković Časopis: IEEE Transactions on Control of Network Systems DOI: 10.1109/TCNS.2021.3061909