DeepCU: Integrating both Common and Unique Latent Information for Multimodal Sentiment Analysis

Verma, S; Wang, C; Zhu, L; Liu, W

DeepCU: Integrating both Common and Unique Latent Information for Multimodal Sentiment Analysis

Verma, S Wang, C Zhu, L Liu, W

Permalink

Publisher:: International Joint Conferences on Artificial Intelligence Organization
Publication Type:: Conference Proceeding
Citation:: Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, IJCAI-19, 2019, 2019-August, pp. 3627-3634
Issue Date:: 2019

Open Access

Copyright Clearance Process

Recently Added
In Progress
Open Access

This item is open access.

Adobe PDF

Download Published versionAdobe PDF (897.14 kB)

View on publisher's site

View statistics

Full metadata record

Field	Value	Language
dc.contributor.author	Verma, S
dc.contributor.author	Wang, C
dc.contributor.author	Zhu, L
dc.contributor.author	Liu, W
dc.date	2019-08-10
dc.date.accessioned	2020-06-16T05:34:24Z
dc.date.available	2020-06-16T05:34:24Z
dc.date.issued	2019
dc.identifier.citation	Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, IJCAI-19, 2019, 2019-August, pp. 3627-3634
dc.identifier.isbn	9780999241141
dc.identifier.issn	1045-0823
dc.identifier.uri	http://hdl.handle.net/10453/141444
dc.description.abstract	© 2019 International Joint Conferences on Artificial Intelligence. All rights reserved. Multimodal sentiment analysis combines information available from visual, textual, and acoustic representations for sentiment prediction. The recent multimodal fusion schemes combine multiple modalities as a tensor and obtain either; the common information by utilizing neural networks, or the unique information by modeling low-rank representation of the tensor. However, both of these information are essential as they render inter-modal and intra-modal relationships of the data. In this research, we first propose a novel deep architecture to extract the common information from the multi-mode representations. Furthermore, we propose unique networks to obtain the modality-specific information that enhances the generalization performance of our multimodal system. Finally, we integrate these two aspects of information via a fusion layer and propose a novel multimodal data fusion architecture, which we call DeepCU (Deep network with both Common and Unique latent information). The proposed DeepCU consolidates the two networks for joint utilization and discovery of all-important latent information. Comprehensive experiments are conducted to demonstrate the effectiveness of utilizing both common and unique information discovered by DeepCU on multiple real-world datasets. The source code of proposed DeepCU is available at https://github.com/sverma88/DeepCU-IJCAI19.
dc.language	en
dc.publisher	International Joint Conferences on Artificial Intelligence Organization
dc.relation.ispartof	Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, IJCAI-19
dc.relation.ispartof	International Joint Conference on Artificial Intelligence
dc.relation.isbasedon	10.24963/ijcai.2019/503
dc.rights	info:eu-repo/semantics/openAccess
dc.title	DeepCU: Integrating both Common and Unique Latent Information for Multimodal Sentiment Analysis
dc.type	Conference Proceeding
utslib.citation.volume	2019-August
utslib.location.activity	China
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology/A/DRsch The Data Science Institute
pubs.organisational-group	/University of Technology Sydney
utslib.copyright.status	open_access	*
pubs.consider-herdc	false
dc.date.updated	2020-06-16T05:34:21Z
pubs.finish-date	2019-08-16
pubs.publication-status	Published
pubs.start-date	2019-08-10
pubs.volume	2019-August

Abstract:

© 2019 International Joint Conferences on Artificial Intelligence. All rights reserved. Multimodal sentiment analysis combines information available from visual, textual, and acoustic representations for sentiment prediction. The recent multimodal fusion schemes combine multiple modalities as a tensor and obtain either; the common information by utilizing neural networks, or the unique information by modeling low-rank representation of the tensor. However, both of these information are essential as they render inter-modal and intra-modal relationships of the data. In this research, we first propose a novel deep architecture to extract the common information from the multi-mode representations. Furthermore, we propose unique networks to obtain the modality-specific information that enhances the generalization performance of our multimodal system. Finally, we integrate these two aspects of information via a fusion layer and propose a novel multimodal data fusion architecture, which we call DeepCU (Deep network with both Common and Unique latent information). The proposed DeepCU consolidates the two networks for joint utilization and discovery of all-important latent information. Comprehensive experiments are conducted to demonstrate the effectiveness of utilizing both common and unique information discovered by DeepCU on multiple real-world datasets. The source code of proposed DeepCU is available at https://github.com/sverma88/DeepCU-IJCAI19.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/141444