Improving Multimodal fusion via Mutual Dependency Maximisation

Colombo, Pierre; Chapuis, Emile; Labeau, Matthieu; Clavel, Chloe

Computer Science > Machine Learning

arXiv:2109.00922 (cs)

[Submitted on 31 Aug 2021 (v1), last revised 9 Sep 2021 (this version, v2)]

Title:Improving Multimodal fusion via Mutual Dependency Maximisation

Authors:Pierre Colombo, Emile Chapuis, Matthieu Labeau, Chloe Clavel

View PDF

Abstract:Multimodal sentiment analysis is a trending area of research, and the multimodal fusion is one of its most active topic. Acknowledging humans communicate through a variety of channels (i.e visual, acoustic, linguistic), multimodal systems aim at integrating different unimodal representations into a synthetic one. So far, a consequent effort has been made on developing complex architectures allowing the fusion of these modalities. However, such systems are mainly trained by minimising simple losses such as $L_1$ or cross-entropy. In this work, we investigate unexplored penalties and propose a set of new objectives that measure the dependency between modalities. We demonstrate that our new penalties lead to a consistent improvement (up to $4.3$ on accuracy) across a large variety of state-of-the-art models on two well-known sentiment analysis datasets: \texttt{CMU-MOSI} and \texttt{CMU-MOSEI}. Our method not only achieves a new SOTA on both datasets but also produces representations that are more robust to modality drops. Finally, a by-product of our methods includes a statistical network which can be used to interpret the high dimensional representations learnt by the model.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computation and Language (cs.CL)
Cite as:	arXiv:2109.00922 [cs.LG]
	(or arXiv:2109.00922v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2109.00922
Journal reference:	EMNLP 2021

Submission history

From: Pierre Colombo [view email]
[v1] Tue, 31 Aug 2021 06:26:26 UTC (421 KB)
[v2] Thu, 9 Sep 2021 16:14:59 UTC (785 KB)

Computer Science > Machine Learning

Title:Improving Multimodal fusion via Mutual Dependency Maximisation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Improving Multimodal fusion via Mutual Dependency Maximisation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators