Doorkeeper

Talk by Paul Pu Liang (CMU)

2019-01-08(火)11:00 - 12:30 JST

理化学研究所 革新知能統合研究センター

〒103-0027 東京都中央区日本橋1-4-1 日本橋一丁目三井ビルディング 15階 会議室3

申し込む

申し込み受付は終了しました

今後イベント情報を受け取る

参加費無料

詳細

Speaker: Paul Pu Liang (Carnegie Mellon University)

http://www.cs.cmu.edu/~pliang/

Title: Computational Modeling of Human Multimodal Language

Abstract: Computational modeling of human multimodal language is an emerging research area in natural language processing spanning the language, visual and acoustic modalities. Comprehending multimodal language requires not only the modeling of interactions within each modality (intra-modal interactions) but more importantly the interactions between modalities (cross-modal interactions). Modeling these interactions lie at the core of multimodal language analysis. This talk will describe several recent advances in modeling multimodal language from a machine learning perspective. We will cover models that involve synchronized recurrent networks, tensor products, gating mechanisms, Bayesian ranking algorithms, hybrid generative-discriminative objectives, and robust representation learning via modality translations. From a resource perspective, there is also a genuine need for large-scale datasets that allow for in-depth studies of human multimodal language. We will introduce the CMU-Multimodal Opinion Sentiment and Emotion Intensity (MOSEI), the largest dataset for multimodal sentiment analysis and emotion recognition. The talk will conclude with several open research directions in human language modeling and multimodal machine learning.

コミュニティについて

RIKEN AIP Public

RIKEN AIP Public

Public events of RIKEN Center for Advanced Intelligence Project (AIP)

メンバーになる