1. Learn adaptive dynamic policy under mixed multi-agent environment
    Zheng Xiao & Shiyong Zhang 2008. 2008 8th IEEE International Conference on Computer and Information Technology p.249
    doi : 10.1109/CIT.2008.4594682
  2. Markov Decision Processes in Artificial Intelligence
    Andriy Burkov et al. 2013. p.229
    doi : 10.1002/9781118557426.ch8

Mise-à-jour / Updated: 2018-10-17