VQSVM: A case study for incorporating prior domain knowledge into inductive machine learning

Yu, T; Simoff, S; Jan, T

VQSVM: A case study for incorporating prior domain knowledge into inductive machine learning

Yu, T Simoff, S Jan, T

Permalink

Publication Type:: Journal Article
Citation:: Neurocomputing, 2010, 73 (13-15), pp. 2614 - 2623
Issue Date:: 2010-08-01

Closed Access

	Filename	Description	Size
	2009007499OK.pdf		460.73 kB	Adobe PDF	View/Open

Copyright Clearance Process

Recently Added
In Progress
Closed Access

This item is closed access and not available.

Full metadata record

Field	Value	Language
dc.contributor.author	Yu, T	en_US
dc.contributor.author	Simoff, S	en_US
dc.contributor.author	Jan, T	en_US
dc.date.issued	2010-08-01	en_US
dc.identifier.citation	Neurocomputing, 2010, 73 (13-15), pp. 2614 - 2623	en_US
dc.identifier.issn	0925-2312	en_US
dc.identifier.uri	http://hdl.handle.net/10453/13780
dc.description.abstract	When dealing with real-world problems, there is considerable amount of prior domain knowledge that can provide insights on various aspect of the problem. On the other hand, many machine learning methods rely solely on the data sets for their learning phase and do not take into account any explicitly expressed domain knowledge. This paper proposes a framework that investigates and enables the incorporation of prior domain knowledge with respect to three key characteristics of inductive machine learning algorithms: consistency, generalization and convergence. The framework is used to review, classify and analyse key existing approaches to incorporating domain knowledge into inductive machine learning, as well as to consider the risks of doing so. The paper also demonstrates the design of a novel hierarchical semi-parametric machine learning method, capable of incorporating prior domain knowledge. The method-VQSVM-extends the support vector machine (SVM) family of methods with vector quantization (VQ) techniques to address the problem of learning from imbalanced data sets. The paper presents the results of testing the method on a collection of imbalanced data sets with various imbalance ratios and various numbers of subclasses. The learning process of the VQSVM method utilizes some domain knowledge to solve problem of fitting imbalance data. The experiments in the paper demonstrate that enabling the incorporation of prior domain knowledge into the SVM framework is an effective way to overcome the sensitivity of SVM towards the imbalance ratio in a data set. © 2010 Elsevier B.V.	en_US
dc.relation.ispartof	Neurocomputing	en_US
dc.relation.isbasedon	10.1016/j.neucom.2010.05.007	en_US
dc.subject.classification	Artificial Intelligence & Image Processing	en_US
dc.title	VQSVM: A case study for incorporating prior domain knowledge into inductive machine learning	en_US
dc.type	Journal Article
utslib.citation.volume	13-15	en_US
utslib.citation.volume	73	en_US
utslib.for	110999 Neurosciences not elsewhere classified	en_US
utslib.for	08 Information and Computing Sciences	en_US
utslib.for	09 Engineering	en_US
utslib.for	17 Psychology and Cognitive Sciences	en_US
pubs.embargo.period	Not known	en_US
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology/School of Electrical and Data Engineering
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology/School of Software
utslib.copyright.status	closed_access
pubs.issue	13-15	en_US
pubs.publication-status	Published	en_US
pubs.volume	73	en_US

Abstract:

When dealing with real-world problems, there is considerable amount of prior domain knowledge that can provide insights on various aspect of the problem. On the other hand, many machine learning methods rely solely on the data sets for their learning phase and do not take into account any explicitly expressed domain knowledge. This paper proposes a framework that investigates and enables the incorporation of prior domain knowledge with respect to three key characteristics of inductive machine learning algorithms: consistency, generalization and convergence. The framework is used to review, classify and analyse key existing approaches to incorporating domain knowledge into inductive machine learning, as well as to consider the risks of doing so. The paper also demonstrates the design of a novel hierarchical semi-parametric machine learning method, capable of incorporating prior domain knowledge. The method-VQSVM-extends the support vector machine (SVM) family of methods with vector quantization (VQ) techniques to address the problem of learning from imbalanced data sets. The paper presents the results of testing the method on a collection of imbalanced data sets with various imbalance ratios and various numbers of subclasses. The learning process of the VQSVM method utilizes some domain knowledge to solve problem of fitting imbalance data. The experiments in the paper demonstrate that enabling the incorporation of prior domain knowledge into the SVM framework is an effective way to overcome the sensitivity of SVM towards the imbalance ratio in a data set. © 2010 Elsevier B.V.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/13780