Deep local-to-global feature learning for medical image super-resolution.

Huang, W; Liao, X; Chen, H; Hu, Y; Jia, W; Wang, Q

Deep local-to-global feature learning for medical image super-resolution.

Huang, W Liao, X Chen, H Hu, Y Jia, W

Wang, Q

Permalink

Publisher:: Elsevier
Publication Type:: Journal Article
Citation:: Comput Med Imaging Graph, 2024, 115, pp. 102374
Issue Date:: 2024-03-26

Embargoed

	Filename	Description	Size
	Deep local-to-global feature learning for medical image super-resolution.pdf	Accepted version	16.1 MB	Adobe PDF	View/Open

Copyright Clearance Process

Recently Added
In Progress
Embargoed
Open Access

This item is currently unavailable due to the publisher's embargo.

The embargo period expires on 1 Jul 2025

Full metadata record

Field	Value	Language
dc.contributor.author	Huang, W
dc.contributor.author	Liao, X
dc.contributor.author	Chen, H
dc.contributor.author	Hu, Y
dc.contributor.author	Jia, W https://orcid.org/0000-0002-0940-3338
dc.contributor.author	Wang, Q
dc.date.accessioned	2024-05-04T06:11:03Z
dc.date.available	2024-03-19
dc.date.available	2024-05-04T06:11:03Z
dc.date.issued	2024-03-26
dc.identifier.citation	Comput Med Imaging Graph, 2024, 115, pp. 102374
dc.identifier.issn	0895-6111
dc.identifier.issn	1879-0771
dc.identifier.uri	http://hdl.handle.net/10453/178652
dc.description.abstract	Medical images play a vital role in medical analysis by providing crucial information about patients' pathological conditions. However, the quality of these images can be compromised by many factors, such as limited resolution of the instruments, artifacts caused by movements, and the complexity of the scanned areas. As a result, low-resolution (LR) images cannot provide sufficient information for diagnosis. To address this issue, researchers have attempted to apply image super-resolution (SR) techniques to restore the high-resolution (HR) images from their LR counterparts. However, these techniques are designed for generic images, and thus suffer from many challenges unique to medical images. An obvious one is the diversity of the scanned objects; for example, the organs, tissues, and vessels typically appear in different sizes and shapes, and are thus hard to restore with standard convolution neural networks (CNNs). In this paper, we develop a dynamic-local learning framework to capture the details of these diverse areas, consisting of deformable convolutions with adjustable kernel shapes. Moreover, the global information between the tissues and organs is vital for medical diagnosis. To preserve global information, we propose pixel-pixel and patch-patch global learning using a non-local mechanism and a vision transformer (ViT), respectively. The result is a novel CNN-ViT neural network with Local-to-Global feature learning for medical image SR, referred to as LGSR, which can accurately restore both local details and global information. We evaluate our method on six public datasets and one large-scale private dataset, which include five different types of medical images (i.e., Ultrasound, OCT, Endoscope, CT, and MRI images). Experiments show that the proposed method achieves superior PSNR/SSIM and visual performance than the state of the arts with competitive computational costs, measured in network parameters, runtime, and FLOPs. What is more, the experiment conducted on OCT image segmentation for the downstream task demonstrates a significantly positive performance effect of LGSR.
dc.format	Print-Electronic
dc.language	eng
dc.publisher	Elsevier
dc.relation.ispartof	Comput Med Imaging Graph
dc.relation.isbasedon	10.1016/j.compmedimag.2024.102374
dc.rights	info:eu-repo/semantics/embargoedAccess
dc.subject	0903 Biomedical Engineering, 1103 Clinical Sciences
dc.subject.classification	Nuclear Medicine & Medical Imaging
dc.subject.classification	3202 Clinical sciences
dc.subject.classification	4003 Biomedical engineering
dc.subject.classification	4603 Computer vision and multimedia computation
dc.title	Deep local-to-global feature learning for medical image super-resolution.
dc.type	Journal Article
utslib.citation.volume	115
utslib.location.activity	United States
utslib.for	0903 Biomedical Engineering
utslib.for	1103 Clinical Sciences
pubs.organisational-group	University of Technology Sydney
pubs.organisational-group	University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	University of Technology Sydney/Strength - GBDTC - Global Big Data Technologies
pubs.organisational-group	University of Technology Sydney/Faculty of Engineering and Information Technology/School of Electrical and Data Engineering
utslib.copyright.status	embargoed	*
utslib.copyright.embargo	2025-07-01T00:00:00+1000Z
dc.date.updated	2024-05-04T06:10:33Z
pubs.publication-status	Published online
pubs.volume	115

Abstract:

Medical images play a vital role in medical analysis by providing crucial information about patients' pathological conditions. However, the quality of these images can be compromised by many factors, such as limited resolution of the instruments, artifacts caused by movements, and the complexity of the scanned areas. As a result, low-resolution (LR) images cannot provide sufficient information for diagnosis. To address this issue, researchers have attempted to apply image super-resolution (SR) techniques to restore the high-resolution (HR) images from their LR counterparts. However, these techniques are designed for generic images, and thus suffer from many challenges unique to medical images. An obvious one is the diversity of the scanned objects; for example, the organs, tissues, and vessels typically appear in different sizes and shapes, and are thus hard to restore with standard convolution neural networks (CNNs). In this paper, we develop a dynamic-local learning framework to capture the details of these diverse areas, consisting of deformable convolutions with adjustable kernel shapes. Moreover, the global information between the tissues and organs is vital for medical diagnosis. To preserve global information, we propose pixel-pixel and patch-patch global learning using a non-local mechanism and a vision transformer (ViT), respectively. The result is a novel CNN-ViT neural network with Local-to-Global feature learning for medical image SR, referred to as LGSR, which can accurately restore both local details and global information. We evaluate our method on six public datasets and one large-scale private dataset, which include five different types of medical images (i.e., Ultrasound, OCT, Endoscope, CT, and MRI images). Experiments show that the proposed method achieves superior PSNR/SSIM and visual performance than the state of the arts with competitive computational costs, measured in network parameters, runtime, and FLOPs. What is more, the experiment conducted on OCT image segmentation for the downstream task demonstrates a significantly positive performance effect of LGSR.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/178652