TY - JOUR
T1 - Tensor decomposition for multi-agent predictive state representation
AU - Ma, Biyang
AU - Chen, Bilian
AU - Zeng, Yifeng
AU - Tang, Jing
AU - Cao, Langcai
N1 - Funding Information: This work was supported in part by the National Natural Science Foundation of China (Grants No. 61772442 and 61836005). Yifeng Zeng receives the EPSRC, UK New Investigator Award.
PY - 2022/3/1
Y1 - 2022/3/1
N2 - Predictive state representation (PSR) uses a vector of action-observation sequence to represent the system dynamics and subsequently predicts the probability of future events. It is a concise knowledge representation that is well studied in a single-agent planning problem domain. To the best of our knowledge, there is no existing work on using PSR to solve multi-agent planning problems. Learning a multi-agent PSR model is quite difficult especially with the increasing number of agents, not to mention the complexity of a problem domain. In this paper, we resort to tensor techniques to tackle the challenging task of multi-agent PSR model development problems. By first focusing on a two-agent setting, we construct the system dynamics matrix as a high order tensor for a PSR model, learn the prediction parameters and deduce state vectors directly through two different tensor decomposition methods respectively, and derive the transition parameters via linear regression. Subsequently we generalize the PSR learning approaches in a multi-agent setting. Experimental results show that our methods can effectively solve multi-agent PSR modelling problems in multiple problem domains.
AB - Predictive state representation (PSR) uses a vector of action-observation sequence to represent the system dynamics and subsequently predicts the probability of future events. It is a concise knowledge representation that is well studied in a single-agent planning problem domain. To the best of our knowledge, there is no existing work on using PSR to solve multi-agent planning problems. Learning a multi-agent PSR model is quite difficult especially with the increasing number of agents, not to mention the complexity of a problem domain. In this paper, we resort to tensor techniques to tackle the challenging task of multi-agent PSR model development problems. By first focusing on a two-agent setting, we construct the system dynamics matrix as a high order tensor for a PSR model, learn the prediction parameters and deduce state vectors directly through two different tensor decomposition methods respectively, and derive the transition parameters via linear regression. Subsequently we generalize the PSR learning approaches in a multi-agent setting. Experimental results show that our methods can effectively solve multi-agent PSR modelling problems in multiple problem domains.
KW - Learning approaches
KW - Predictive state representations
KW - Tensor optimization
UR - http://www.scopus.com/inward/record.url?scp=85118481239&partnerID=8YFLogxK
U2 - 10.1016/j.eswa.2021.115969
DO - 10.1016/j.eswa.2021.115969
M3 - Article
AN - SCOPUS:85118481239
SN - 0957-4174
VL - 189
JO - Expert Systems with Applications
JF - Expert Systems with Applications
M1 - 115969
ER -