High-Performance and Interpretable 3D Point Cloud Analysis

Feng, Tuo

High-Performance and Interpretable 3D Point Cloud Analysis

Feng, Tuo

Permalink

Publication Type:: Thesis
Issue Date:: 2025

Open Access

Copyright Clearance Process

Recently Added
In Progress
Open Access

This item is open access.

Adobe PDF

Download thesisAdobe PDF (15.55 MB)

View statistics

Full metadata record

Field	Value	Language
dc.contributor.author	Feng, Tuo
dc.date.accessioned	2026-03-16T04:07:33Z
dc.date.available	2026-03-16T04:07:33Z
dc.date.issued	2025
dc.identifier.uri	http://hdl.handle.net/10453/194346
dc.description	University of Technology Sydney. Faculty of Engineering and Information Technology.	en_US.UTF-8
dc.description.abstract	Three-dimensional computer vision is advancing rapidly, yet processing large, dynamic scenes still faces limits in feature extraction, clustering-based learning, efficiency, and model interpretability. This thesis proposes four complementary methods to address these gaps. Cluster3D is a clustering-driven representation learning approach that discovers fine-grained subclass patterns in point clouds and improves supervised segmentation on both static and dynamic data. LSK3DNet is a 3D backbone with large sparse kernels that uses spatial-wise dynamic sparsity and channel-wise weight selection to reduce computation while boosting accuracy for semantic segmentation and object detection. Interpretable3D is a prototype-based classifier that embeds interpretability directly into the architecture, providing transparent, case-level explanations while maintaining competitive results on shape classification and part segmentation. Shape2Scene is a scalable pretraining strategy that bridges shape-level learning and scene-level tasks, delivering stronger transfer and better scalability than prior approaches. Experiments across multiple benchmarks validate these contributions with state-of-the-art or competitive accuracy and notable efficiency gains. Overall, the thesis advances feature learning, clustering-based learning, efficiency, and interpretability for 3D perception, enabling more robust and accountable systems for real-world applications such as autonomous driving, robotics, and augmented reality.	en_US.UTF-8
dc.format	Thesis (PhD)
dc.language.iso	en_US	en_US.UTF-8
dc.relation	https://opus.lib.uts.edu.au/bitstream/10453/194346/1/thesis.pdf
dc.rights	info:eu-repo/semantics/openAccess
dc.rights	The author owns the copyright in this thesis including all reproduction and reuse rights for the work. The work may not be altered without the permission of the copyright owner. Attribution is essential when quoting or paraphrasing from this thesis.
dc.rights	© 2025 Tuo Feng
dc.rights	au.edu.uts.lib/cph
dc.title	High-Performance and Interpretable 3D Point Cloud Analysis	en_US.UTF-8
dc.type	Thesis
utslib.copyright.status	open_access	*

Abstract:

Three-dimensional computer vision is advancing rapidly, yet processing large, dynamic scenes still faces limits in feature extraction, clustering-based learning, efficiency, and model interpretability. This thesis proposes four complementary methods to address these gaps. Cluster3D is a clustering-driven representation learning approach that discovers fine-grained subclass patterns in point clouds and improves supervised segmentation on both static and dynamic data. LSK3DNet is a 3D backbone with large sparse kernels that uses spatial-wise dynamic sparsity and channel-wise weight selection to reduce computation while boosting accuracy for semantic segmentation and object detection. Interpretable3D is a prototype-based classifier that embeds interpretability directly into the architecture, providing transparent, case-level explanations while maintaining competitive results on shape classification and part segmentation. Shape2Scene is a scalable pretraining strategy that bridges shape-level learning and scene-level tasks, delivering stronger transfer and better scalability than prior approaches. Experiments across multiple benchmarks validate these contributions with state-of-the-art or competitive accuracy and notable efficiency gains. Overall, the thesis advances feature learning, clustering-based learning, efficiency, and interpretability for 3D perception, enabling more robust and accountable systems for real-world applications such as autonomous driving, robotics, and augmented reality.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/194346