1. Equations
\[p_ {\theta} (y_ t \vert \mathbf{X}_ {\text{I}}, \mathbf{X}_ {\text{V}}, y_ {\lt t}) = [ (p_ {\theta} ^ w) ] _ {w \in \mathcal{W}}\]
\[\hat{\theta}\]
\[\alpha_ {q, k} ^ \ell = 0 \quad \text{where} \quad \ell \in \mathcal{L}_ {\text{chunk}}, k \in \mathcal{I}\]
\[w^ \star = \arg \max_ {w \in \mathcal{W}} p_ {\theta} ^ w (y_ t \vert \mathbf{X}_ {\text{I}}, \mathbf{X}_ {\text{V}}, y_ {\lt t})\]
\[\Delta p_ t = p_ {\hat{\theta}} ^ {w^ \star} - p_ {\theta} ^ {w^ \star}\]
\[\text{JSD}_ t = \text{JSD} (p_ {\hat{\theta}} \Vert p_ {\theta} )\]
\[\Delta H_ t = H (p_ {\hat{\theta}} ) - H (p_ {\theta} )\]
Hidden 카테고리 내 다른 글 보러가기
댓글 남기기