How unlabeled web videos help complex event detection?

Liu, H; Zheng, Q; Luo, M; Zhang, D; Chang, X; Deng, C

How unlabeled web videos help complex event detection?

Liu, H Zheng, Q Luo, M Zhang, D Chang, X

Deng, C

Permalink

Publisher:: IJCAI-INT JOINT CONF ARTIF INTELL
Publication Type:: Conference Proceeding
Citation:: IJCAI International Joint Conference on Artificial Intelligence, 2017, 0, pp. 4040-4046
Issue Date:: 2017-01-01

Closed Access

	Filename	Description	Size
	0564.pdf	Published version	691.8 kB	Adobe PDF	View/Open

Copyright Clearance Process

Recently Added
In Progress
Closed Access

This item is closed access and not available.

Full metadata record

Field	Value	Language
dc.contributor.author	Liu, H
dc.contributor.author	Zheng, Q
dc.contributor.author	Luo, M
dc.contributor.author	Zhang, D
dc.contributor.author	Chang, X https://orcid.org/0000-0002-7778-8807
dc.contributor.author	Deng, C
dc.contributor.editor	Sierra, C
dc.date	2017-08-19
dc.date.accessioned	2023-03-31T10:56:14Z
dc.date.available	2023-03-31T10:56:14Z
dc.date.issued	2017-01-01
dc.identifier.citation	IJCAI International Joint Conference on Artificial Intelligence, 2017, 0, pp. 4040-4046
dc.identifier.isbn	9780999241103
dc.identifier.issn	1045-0823
dc.identifier.uri	http://hdl.handle.net/10453/169007
dc.description.abstract	The lack of labeled exemplars is an important factor that makes the task of multimedia event detection (MED) complicated and challenging. Utilizing artificially picked and labeled external sources is an effective way to enhance the performance of MED. However, building these data usually requires professional human annotators, and the procedure is too time-consuming and costly to scale. In this paper, we propose a new robust dictionary learning framework for complex event detection, which is able to handle both labeled and easy-to-get unlabeled web videos by sharing the same dictionary. By employing the lq-norm based loss jointly with the structured sparsity based regularization, our model shows strong robustness against the substantial noisy and outlier videos from open source. We exploit an effective optimization algorithm to solve the proposed highly non-smooth and non-convex problem. Extensive experiment results over standard datasets of TRECVID MEDTest 2013 and TRECVID MEDTest 2014 demonstrate the effectiveness and superiority of the proposed framework on complex event detection.
dc.language	en
dc.publisher	IJCAI-INT JOINT CONF ARTIF INTELL
dc.relation.ispartof	IJCAI International Joint Conference on Artificial Intelligence
dc.relation.ispartof	26th International Joint Conference on Artificial Intelligence (IJCAI)
dc.relation.isbasedon	10.24963/ijcai.2017/564
dc.rights	info:eu-repo/semantics/closedAccess
dc.title	How unlabeled web videos help complex event detection?
dc.type	Conference Proceeding
utslib.citation.volume	0
utslib.location.activity	Melbourne, AUSTRALIA
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	/University of Technology Sydney/Strength - AAII - Australian Artificial Intelligence Institute
utslib.copyright.status	closed_access	*
dc.date.updated	2023-03-31T10:56:13Z
pubs.finish-date	2017-08-25
pubs.publication-status	Published
pubs.start-date	2017-08-19
pubs.volume	0

Abstract:

The lack of labeled exemplars is an important factor that makes the task of multimedia event detection (MED) complicated and challenging. Utilizing artificially picked and labeled external sources is an effective way to enhance the performance of MED. However, building these data usually requires professional human annotators, and the procedure is too time-consuming and costly to scale. In this paper, we propose a new robust dictionary learning framework for complex event detection, which is able to handle both labeled and easy-to-get unlabeled web videos by sharing the same dictionary. By employing the lq-norm based loss jointly with the structured sparsity based regularization, our model shows strong robustness against the substantial noisy and outlier videos from open source. We exploit an effective optimization algorithm to solve the proposed highly non-smooth and non-convex problem. Extensive experiment results over standard datasets of TRECVID MEDTest 2013 and TRECVID MEDTest 2014 demonstrate the effectiveness and superiority of the proposed framework on complex event detection.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/169007