Diverse Generation for Multi-agent Sports Games

どんなもの？

マルチエージェントのトラッキングデータを用いた新しい軌道生成モデルの提案．

先行研究と比べてどこがすごい?

GNNと変分リカレントNN（VRNN）を活用しスポーツに適した順列等変量モデルを実現している点．

今までのロールベースの手法の課題を克服．

技術や手法のキモはどこ？

GNNと変分リカレントNN（VRNN）を活用しスポーツシーンに適用させている．

どうやって有効だと検証した？

生成されたサンプルの品質を分布レベルで定量化する、いくつかの新しい評価指標を提案

否定的対数尤度(NLL)。テスト集合における負の対数尤度
Mean and best L2-error
Max L2-error (Best)
Miss rate:
Conditional L2-error (Best):
Distributional metrics:
Player の平均速度

議論はある？

サッカーにおける軌道のモデリングはバスケほどうまく行っていない．これは確率的な要素が少ないという仮説がある．
急加速はガウス尤度ではうまく捉えられない疎なバースト的現象であるため、いずれのモデルも選手の加速度を適切にモデル化できていない．
ボールもモデル化できているけど，バスケほどの顕著な性能は出なかった

次に読むべき論文は？

Where Will They Go? Predicting Fine-Grained Adversarial Multi-Agent Motion using Conditional Variational Autoencoders(ECCV2018)
Generative multiagent behavioral cloning(Arxiv 2018)
Representing and discovering adversarial team behaviors using player roles(CVPR 2013)

歩行者を用いたGNN：1, 15, 24, 30,40]

A recurrent latent variable model for sequential data(NeurIPS 2015)

Memo

論文読み：44 min

詳細（式等の確認）：5 min

メモ：44min

2019年CVPR論文
軌道予測のタスクをGNNを利用して解いている．
検索タスクにおいても使えそうだから理解しておきたい．
VRNNは，完全連結グラフ(Full)または自己ループのみを含むグラフ、すなわち対角隣接行列(Diag)を持つグラフ変分リカレントニューラルネット。
問題の定式化：

下記を時間$t$におけるエージェントkの二次元位置として，

{x}_{k}^{t} \in \mathbb{R}^{2}

\mathbf{x}_{k}=\left(\mathbf{x}_{k}^{1}, \ldots, \mathbf{x}_{k}^{T}\right)

をゲームの一つのセグメントとする． $\mathcal{D}=\{\mathbf{x}\}$ は全てのセグメントを含む．

提案手法はVRNNに基づいて構築される．

以下の分布とRNN更新式で特徴付けられる．

\begin{aligned}p_{\theta}\left(\mathbf{z}^{t} \mid \mathbf{x}^{<t}, \mathbf{z}^{<t}\right) & =\mathcal{N}\left(\mathbf{z}^{t} \mid \boldsymbol{\mu}_{\mathrm{pri}}^{t},\left(\boldsymbol{\sigma}_{\mathrm{pri}}^{t}\right)^{2}\right) & & \text { (prior), } \\q_{\phi}\left(\mathbf{z}^{t} \mid \mathbf{x}^{\leq t}, \mathbf{z}^{<t}\right) & =\mathcal{N}\left(\mathbf{z}^{t} \mid \boldsymbol{\mu}_{\mathrm{enc}}^{t},\left(\boldsymbol{\sigma}_{\text {enc }}^{t}\right)^{2}\right) & & \text { (inference), } \\p_{\theta}\left(\mathbf{x}^{t} \mid \mathbf{x}^{<t}, \mathbf{z}^{\leq t}\right) & =\mathcal{N}\left(\mathbf{x}^{t} \mid \boldsymbol{\mu}_{\text {dec }}^{t},\left(\boldsymbol{\sigma}_{\text {dec }}^{t}\right)^{2}\right) & & \text { (generation), } \\h^{t} & =f_{\mathrm{rnn}}\left(\mathbf{x}^{t}, \mathbf{z}^{t}, h^{t-1}\right) & & \text { (recurrence), }\end{aligned}

where

\begin{aligned}{\left[\boldsymbol{\mu}_{\mathrm{pri}}^{t}, \boldsymbol{\sigma}_{\mathrm{pri}}^{t}\right] } & =f_{\mathrm{pri}}\left(h^{t-1} ; \theta\right), \\{\left[\boldsymbol{\mu}_{\mathrm{enc}}^{t}, \boldsymbol{\sigma}_{\mathrm{enc}}^{t}\right] } & =f_{\mathrm{enc}}\left(\mathbf{x}^{t}, h^{t-1} ; \phi\right), \\{\left[\boldsymbol{\mu}_{\mathrm{dec}}^{t}, \boldsymbol{\sigma}_{\mathrm{dec}}^{t}\right] } & =f_{\mathrm{dec}}\left(\mathbf{z}^{t}, h^{t-1} ; \theta\right),\end{aligned}

N (-μ, σ2) は平均μ、共分散行列 diag(σ2)を持つ多変量正規分布を表す。ここで、fpri, fenc, fdec は学習可能なパラメータφとθを持つ事前ネットワーク、エンコーダ、デコーダに対応するディープネットである。
VRNNはエビデンス下限(ELBO)を最大化することによって学習される。

\begin{array}{r}\sum_{\mathbf{x} \in \mathcal{D}} \sum_{t} \mathbb{E}_{q_{\phi}\left(\mathbf{z}^{t} \mid \mathbf{x} \leq t, \mathbf{z}^{<t}\right)}\left[\log p_{\theta}\left(\mathbf{x}^{t} \mid \mathbf{x}^{<t}, \mathbf{z}^{\leq t}\right)-\right. \\\left.D_{\mathrm{KL}}\left(q_{\phi}\left(\mathbf{z}^{t} \mid \mathbf{x}^{\leq t}, \mathbf{z}^{<t}\right) \| p_{\theta}\left(\mathbf{z}^{t} \mid \mathbf{x}^{<t}, \mathbf{z}^{<t}\right)\right)\right] .\end{array}

Consistent representation:

上記のVRNNは軌道kの順序を暗黙的にカプセル化している．しかしマルチエージェントにおいては順序が一貫していることが重要である．
D1とD2は異なる表現であり，同じモデル確率を出力しない．この順序の不一致をどうにかする必要がある．

Graph networks

各ノードviの特徴ベクトルから始める。
そして、各辺eijに対して、接続されているノードに基づいて特徴ベクトルを導出する。
その後、接続された各ノードにエッジ特徴ベクトルを「メッセージ」として送信し、その新しい出力状態oiを計算する。

\begin{aligned}& v \rightarrow e: \quad \mathrm{e}_{(i, j)}=f_{e}\left(\left[\mathrm{v}_{i}, \mathrm{v}_{j}\right]\right), \\& e \rightarrow v: \quad \mathrm{o}_{i}=f_{v}\left(\sum_{j \in N(i)} \mathrm{e}_{(i, j)}\right),\end{aligned}

GNNは特徴ベクトル $\mathrm{v}_{1: K}$ を取って，隣接行列を生成し，ノードベクトル $\mathrm{o}_{1: K}, \mathrm{o}_{1: K}=\operatorname{GNN}\left(\mathrm{v}{1: K}\right)$ を出力する．

Graph VRNNs

我々のモデルは、各エージェントに対して独立したRNNと観測値を持つ。
各ノードがエージェントであり、グラフが完全連結であるGNNを用いて、各ステップにおけるそれらの相互作用をモデル化する。

\begin{aligned}p_{\theta}\left(\mathbf{z}^{t} \mid \mathbf{x}^{<t}, \mathbf{z}^{<t}\right) & =\prod_{k} \mathcal{N}\left(\mathbf{z}^{t} \mid \boldsymbol{\mu}_{\mathrm{pri}, k}^{t},\left(\boldsymbol{\sigma}_{\mathrm{pri}, k}^{t}\right)^{2}\right) \\q_{\phi}\left(\mathbf{z}^{t} \mid \mathbf{x}^{\leq t}, \mathbf{z}^{<t}\right) & =\prod_{k} \mathcal{N}\left(\mathbf{z}^{t} \mid \boldsymbol{\mu}_{\mathrm{enc}, k}^{t},\left(\boldsymbol{\sigma}_{\mathrm{enc}, k}^{t}\right)^{2}\right) \\p_{\theta}\left(\mathbf{x}^{t} \mid \mathbf{x}^{<t}, \mathbf{z}^{\leq t}\right) & =\prod_{k} \mathcal{N}\left(\mathbf{x}^{t} \mid \boldsymbol{\mu}_{\mathrm{dec}, k}^{t},\left(\boldsymbol{\sigma}_{\mathrm{dec}, k}^{t}\right)^{2}\right), \\h_{k}^{t} & =f_{\mathrm{rnn}}\left(\mathbf{x}_{k}^{t}, \mathbf{z}_{k}^{t}, h_{k}^{t-1}\right)\end{aligned}

where

\begin{aligned}{\left[\boldsymbol{\mu}_{\text {pri,1:K }}^{t}, \boldsymbol{\sigma}_{\text {pri, }, K}^{t}\right] } & =\operatorname{GNN}_{\text {pri }}\left(h_{1: K}^{t-1}\right), \\{\left[\boldsymbol{\mu}_{\text {enc }, 1: K}^{t}, \boldsymbol{\sigma}_{\text {enc }, 1: K}^{t}\right] } & =\operatorname{GNN}_{\text {enc }}\left(\left[\mathbf{x}_{1: K}^{t}, h_{1: K}^{t-1}\right]\right), \\{\left[\boldsymbol{\mu}_{\text {dec }, 1: K}^{t}, \boldsymbol{\sigma}_{\text {dec }, 1: K}^{t}\right] } & =\operatorname{GNN}_{\text {dec }}\left(\left[\mathbf{z}_{1: K}^{t}, h_{1: K}^{t-1}\right]\right)\end{aligned}