Very low complexity convolutional neural network for quadtree structures

Caruana, A; Vidal-Calleja, T

Very low complexity convolutional neural network for quadtree structures

Caruana, A Vidal-Calleja, T

Permalink

Publication Type:: Conference Proceeding
Citation:: Australasian Conference on Robotics and Automation, ACRA, 2018, 2018-December
Issue Date:: 2018-01-01

Open Access

Copyright Clearance Process

Recently Added
In Progress
Open Access

This item is open access.

Adobe PDF

Download Published versionAdobe PDF (2.87 MB)

View statistics

Full metadata record

Field	Value	Language
dc.contributor.author	Caruana, A	en_US
dc.contributor.author	Vidal-Calleja, T https://orcid.org/0000-0002-5763-9644	en_US
dc.date.available	2018-11-12	en_US
dc.date.issued	2018-01-01	en_US
dc.identifier.citation	Australasian Conference on Robotics and Automation, ACRA, 2018, 2018-December	en_US
dc.identifier.issn	1448-2053	en_US
dc.identifier.uri	http://hdl.handle.net/10453/129917
dc.description.abstract	© 2018 Australasian Robotics and Automation Association. All rights reserved. In this paper, we present a Very Low Complexity Convolutional Neural Network (VLC-CNN) for the purpose of generating quadtree data structures for image segmentation. The use of quadtrees to encode images has applications including video encoding and robotic perception, with examples including the Coding Tree Unit in the High Efficiency Video Coding (HEVC) standard and Occupancy Grid Maps (OGM) as environment representations with variable grid-size. While some methods for determining quadtree structures include brute-force algorithms or heuristics, this paper describes the use of a Convolutional Neural Network (CNN) to predict the quadtree structure. CNNs traditionally require substantial computational and memory resources to operate, however, VLC-CNN exploits downsampling and integer-only quantised arithmetic to achieve minimal complexity. Therefore, VLC-CNN's minimal design makes it feasible for implementation in realtime or memory-constrained processing applications.	en_US
dc.relation.ispartof	Australasian Conference on Robotics and Automation, ACRA	en_US
dc.rights	info:eu-repo/semantics/openAccess
dc.title	Very low complexity convolutional neural network for quadtree structures	en_US
dc.type	Conference Proceeding
utslib.citation.volume	2018-December	en_US
utslib.for	0801 Artificial Intelligence and Image Processing	en_US
pubs.embargo.period	Not known	en_US
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology/School of Mechanical and Mechatronic Engineering
pubs.organisational-group	/University of Technology Sydney/Strength - CAS - Centre for Autonomous Systems
utslib.copyright.status	open_access	*
pubs.publication-status	Published	en_US
pubs.volume	2018-December	en_US

Abstract:

© 2018 Australasian Robotics and Automation Association. All rights reserved. In this paper, we present a Very Low Complexity Convolutional Neural Network (VLC-CNN) for the purpose of generating quadtree data structures for image segmentation. The use of quadtrees to encode images has applications including video encoding and robotic perception, with examples including the Coding Tree Unit in the High Efficiency Video Coding (HEVC) standard and Occupancy Grid Maps (OGM) as environment representations with variable grid-size. While some methods for determining quadtree structures include brute-force algorithms or heuristics, this paper describes the use of a Convolutional Neural Network (CNN) to predict the quadtree structure. CNNs traditionally require substantial computational and memory resources to operate, however, VLC-CNN exploits downsampling and integer-only quantised arithmetic to achieve minimal complexity. Therefore, VLC-CNN's minimal design makes it feasible for implementation in realtime or memory-constrained processing applications.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/129917