Classification of very high resolution aerial photos using spectral-spatial convolutional neural networks

Sameen, MI; Pradhan, B; Aziz, OS

Classification of very high resolution aerial photos using spectral-spatial convolutional neural networks

Sameen, MI Pradhan, B

Aziz, OS

Permalink

Publication Type:: Journal Article
Citation:: Journal of Sensors, 2018, 2018
Issue Date:: 2018-01-01

Open Access

Copyright Clearance Process

Recently Added
In Progress
Open Access

This item is open access.

Adobe PDF

Download Published VersionAdobe PDF (34.52 MB)

View on publisher's site

View statistics

Full metadata record

Field	Value	Language
dc.contributor.author	Sameen, MI	en_US
dc.contributor.author	Pradhan, B https://orcid.org/0000-0001-9863-2054	en_US
dc.contributor.author	Aziz, OS	en_US
dc.date.issued	2018-01-01	en_US
dc.identifier.citation	Journal of Sensors, 2018, 2018	en_US
dc.identifier.issn	1687-725X	en_US
dc.identifier.uri	http://hdl.handle.net/10453/130221
dc.description.abstract	© 2018 Maher Ibrahim Sameen et al. Classification of aerial photographs relying purely on spectral content is a challenging topic in remote sensing. A convolutional neural network (CNN) was developed to classify aerial photographs into seven land cover classes such as building, grassland, dense vegetation, waterbody, barren land, road, and shadow. The classifier utilized spectral and spatial contents of the data to maximize the accuracy of the classification process. CNN was trained from scratch with manually created ground truth samples. The architecture of the network comprised of a single convolution layer of 32 filters and a kernel size of 3 × 3, pooling size of 2 × 2, batch normalization, dropout, and a dense layer with Softmax activation. The design of the architecture and its hyperparameters were selected via sensitivity analysis and validation accuracy. The results showed that the proposed model could be effective for classifying the aerial photographs. The overall accuracy and Kappa coefficient of the best model were 0.973 and 0.967, respectively. In addition, the sensitivity analysis suggested that the use of dropout and batch normalization technique in CNN is essential to improve the generalization performance of the model. The CNN model without the techniques above achieved the worse performance, with an overall accuracy and Kappa of 0.932 and 0.922, respectively. This research shows that CNN-based models are robust for land cover classification using aerial photographs. However, the architecture and hyperparameters of these models should be carefully selected and optimized.	en_US
dc.relation.ispartof	Journal of Sensors	en_US
dc.relation.isbasedon	10.1155/2018/7195432	en_US
dc.rights	info:eu-repo/semantics/openAccess
dc.title	Classification of very high resolution aerial photos using spectral-spatial convolutional neural networks	en_US
dc.type	Journal Article
utslib.citation.volume	2018	en_US
utslib.for	0801 Artificial Intelligence and Image Processing	en_US
utslib.for	0303 Macromolecular and Materials Chemistry	en_US
utslib.for	0306 Physical Chemistry (incl. Structural)	en_US
pubs.embargo.period	Not known	en_US
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology/School of Information, Systems and Modelling
pubs.organisational-group	/University of Technology Sydney/Strength - CAMGIS - Centre for Advanced Modelling and Geospatial lnformation Systems
utslib.copyright.status	open_access	*
pubs.publication-status	Published	en_US
pubs.volume	2018	en_US

Abstract:

© 2018 Maher Ibrahim Sameen et al. Classification of aerial photographs relying purely on spectral content is a challenging topic in remote sensing. A convolutional neural network (CNN) was developed to classify aerial photographs into seven land cover classes such as building, grassland, dense vegetation, waterbody, barren land, road, and shadow. The classifier utilized spectral and spatial contents of the data to maximize the accuracy of the classification process. CNN was trained from scratch with manually created ground truth samples. The architecture of the network comprised of a single convolution layer of 32 filters and a kernel size of 3 × 3, pooling size of 2 × 2, batch normalization, dropout, and a dense layer with Softmax activation. The design of the architecture and its hyperparameters were selected via sensitivity analysis and validation accuracy. The results showed that the proposed model could be effective for classifying the aerial photographs. The overall accuracy and Kappa coefficient of the best model were 0.973 and 0.967, respectively. In addition, the sensitivity analysis suggested that the use of dropout and batch normalization technique in CNN is essential to improve the generalization performance of the model. The CNN model without the techniques above achieved the worse performance, with an overall accuracy and Kappa of 0.932 and 0.922, respectively. This research shows that CNN-based models are robust for land cover classification using aerial photographs. However, the architecture and hyperparameters of these models should be carefully selected and optimized.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/130221