LSMI-Sinkhorn: Semi-supervised Mutual Information Estimation with Optimal Transport

Liu, Y; Yamada, M; Tsai, YHH; Le, T; Salakhutdinov, R; Yang, Y

LSMI-Sinkhorn: Semi-supervised Mutual Information Estimation with Optimal Transport

Liu, Y Yamada, M Tsai, YHH Le, T Salakhutdinov, R Yang, Y

Permalink

Publisher:: Springer International Publishing
Publication Type:: Chapter
Citation:: Machine Learning and Knowledge Discovery in Databases. Research Track, 2021, 12975 LNAI, pp. 655-670
Issue Date:: 2021-01-01

Closed Access

	Filename	Description	Size
	Liu2021_Chapter_LSMI-SinkhornSemi-supervisedMu.pdf	Published version	1.87 MB		View/Open

Copyright Clearance Process

Recently Added
In Progress
Closed Access

This item is closed access and not available.

Full metadata record

Field	Value	Language
dc.contributor.author	Liu, Y
dc.contributor.author	Yamada, M
dc.contributor.author	Tsai, YHH
dc.contributor.author	Le, T
dc.contributor.author	Salakhutdinov, R
dc.contributor.author	Yang, Y https://orcid.org/0000-0002-0512-880X
dc.date.accessioned	2022-07-05T06:53:04Z
dc.date.available	2022-07-05T06:53:04Z
dc.date.issued	2021-01-01
dc.identifier.citation	Machine Learning and Knowledge Discovery in Databases. Research Track, 2021, 12975 LNAI, pp. 655-670
dc.identifier.isbn	9783030864859
dc.identifier.uri	http://hdl.handle.net/10453/158670
dc.description.abstract	Estimating mutual information is an important statistics and machine learning problem. To estimate the mutual information from data, a common practice is preparing a set of paired samples {(xi,yi)}i=1n ∼ i. i. d. p(x, y). However, in many situations, it is difficult to obtain a large number of data pairs. To address this problem, we propose the semi-supervised Squared-loss Mutual Information (SMI) estimation method using a small number of paired samples and the available unpaired ones. We first represent SMI through the density ratio function, where the expectation is approximated by the samples from marginals and its assignment parameters. The objective is formulated using the optimal transport problem and quadratic programming. Then, we introduce the Least-Squares Mutual Information with Sinkhorn (LSMI-Sinkhorn) algorithm for efficient optimization. Through experiments, we first demonstrate that the proposed method can estimate the SMI without a large number of paired samples. Then, we show the effectiveness of the proposed LSMI-Sinkhorn algorithm on various types of machine learning problems such as image matching and photo album summarization. Code can be found at https://github.com/csyanbin/LSMI-Sinkhorn.
dc.language	en
dc.publisher	Springer International Publishing
dc.relation	http://purl.org/au-research/grants/arc/DP200100938
dc.relation.ispartof	Machine Learning and Knowledge Discovery in Databases. Research Track
dc.relation.isbasedon	10.1007/978-3-030-86486-6_40
dc.rights	info:eu-repo/semantics/closedAccess
dc.subject.classification	Artificial Intelligence & Image Processing
dc.title	LSMI-Sinkhorn: Semi-supervised Mutual Information Estimation with Optimal Transport
dc.type	Chapter
utslib.citation.volume	12975 LNAI
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	/University of Technology Sydney/Strength - AAII - Australian Artificial Intelligence Institute
utslib.copyright.status	closed_access	*
dc.date.updated	2022-07-05T06:53:02Z
pubs.publication-status	Published
pubs.volume	12975 LNAI

Abstract:

Estimating mutual information is an important statistics and machine learning problem. To estimate the mutual information from data, a common practice is preparing a set of paired samples {(xi,yi)}i=1n ∼ i. i. d. p(x, y). However, in many situations, it is difficult to obtain a large number of data pairs. To address this problem, we propose the semi-supervised Squared-loss Mutual Information (SMI) estimation method using a small number of paired samples and the available unpaired ones. We first represent SMI through the density ratio function, where the expectation is approximated by the samples from marginals and its assignment parameters. The objective is formulated using the optimal transport problem and quadratic programming. Then, we introduce the Least-Squares Mutual Information with Sinkhorn (LSMI-Sinkhorn) algorithm for efficient optimization. Through experiments, we first demonstrate that the proposed method can estimate the SMI without a large number of paired samples. Then, we show the effectiveness of the proposed LSMI-Sinkhorn algorithm on various types of machine learning problems such as image matching and photo album summarization. Code can be found at https://github.com/csyanbin/LSMI-Sinkhorn.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/158670