Symbolic and Statistical Learning Approaches to Speech Summarization: A Scoping Review

Rezazadegan, D; Berkovsky, S; Quiroz, JC; Kocaballi, AB; Wang, Y; Laranjo, L; Coiera, E

Symbolic and Statistical Learning Approaches to Speech Summarization: A Scoping Review

Rezazadegan, D Berkovsky, S Quiroz, JC Kocaballi, AB Wang, Y Laranjo, L Coiera, E

Permalink

Publisher:: Elsevier BV
Publication Type:: Journal Article
Citation:: Computer Speech and Language, 2022, 72, pp. 101305
Issue Date:: 2022-03-01

Closed Access

	Filename	Description	Size
	Symbolic and Statistical Learning Approaches to Speech Summarization.pdf	Published version	1.97 MB		View/Open

Copyright Clearance Process

Recently Added
In Progress
Closed Access

This item is closed access and not available.

Full metadata record

Field	Value	Language
dc.contributor.author	Rezazadegan, D
dc.contributor.author	Berkovsky, S
dc.contributor.author	Quiroz, JC
dc.contributor.author	Kocaballi, AB
dc.contributor.author	Wang, Y
dc.contributor.author	Laranjo, L
dc.contributor.author	Coiera, E
dc.date.accessioned	2022-01-20T07:44:00Z
dc.date.available	2022-01-20T07:44:00Z
dc.date.issued	2022-03-01
dc.identifier.citation	Computer Speech and Language, 2022, 72, pp. 101305
dc.identifier.issn	0885-2308
dc.identifier.issn	1095-8363
dc.identifier.uri	http://hdl.handle.net/10453/153385
dc.description.abstract	Speech summarization techniques take human speech as input and then output an abridged version as text or speech. Speech summarization has applications in many domains from information technology to health care, for example improving speech archives or reducing clinical documentation burden. This scoping review maps close to 2 decades of speech summarization literature, spanning from the early machine learning works up to ensemble models, with no restrictions on the language summarized, research method, or paper type. We reviewed a total of 110 papers out of a set of 188 found through a literature search and extracted speech features used, methods, scope, and training corpora. Most studies employ one of four speech summarization architectures: (1) Sentence extraction and compaction; (2) Feature extraction and classification or rank-based sentence selection; (3) Sentence compression and compression summarization; and (4) Language modelling. We also discuss the strengths and weaknesses of these different methods and speech features. Overall, supervised methods (e.g. Hidden Markov support vector machines, Ranking support vector machines, Conditional random fields) performed better than unsupervised methods. As supervised methods require manually annotated training data which can be costly, there was more interest in unsupervised methods. Recent research into unsupervised methods focusses on extending language modelling, for example by combining Uni-gram modelling with deep neural networks. This review does not include recent work in deep learning.
dc.language	en
dc.publisher	Elsevier BV
dc.relation.ispartof	Computer Speech and Language
dc.relation.isbasedon	10.1016/j.csl.2021.101305
dc.rights	info:eu-repo/semantics/closedAccess
dc.subject	0801 Artificial Intelligence and Image Processing, 1702 Cognitive Sciences, 2004 Linguistics
dc.subject.classification	Speech-Language Pathology & Audiology
dc.title	Symbolic and Statistical Learning Approaches to Speech Summarization: A Scoping Review
dc.type	Journal Article
utslib.citation.volume	72
utslib.for	0801 Artificial Intelligence and Image Processing
utslib.for	1702 Cognitive Sciences
utslib.for	2004 Linguistics
pubs.organisational-group	/University of Technology Sydney
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology
pubs.organisational-group	/University of Technology Sydney/Faculty of Engineering and Information Technology/School of Computer Science
utslib.copyright.status	closed_access	*
dc.date.updated	2022-01-20T07:43:57Z
pubs.publication-status	Accepted
pubs.volume	72

Abstract:

Speech summarization techniques take human speech as input and then output an abridged version as text or speech. Speech summarization has applications in many domains from information technology to health care, for example improving speech archives or reducing clinical documentation burden. This scoping review maps close to 2 decades of speech summarization literature, spanning from the early machine learning works up to ensemble models, with no restrictions on the language summarized, research method, or paper type. We reviewed a total of 110 papers out of a set of 188 found through a literature search and extracted speech features used, methods, scope, and training corpora. Most studies employ one of four speech summarization architectures: (1) Sentence extraction and compaction; (2) Feature extraction and classification or rank-based sentence selection; (3) Sentence compression and compression summarization; and (4) Language modelling. We also discuss the strengths and weaknesses of these different methods and speech features. Overall, supervised methods (e.g. Hidden Markov support vector machines, Ranking support vector machines, Conditional random fields) performed better than unsupervised methods. As supervised methods require manually annotated training data which can be costly, there was more interest in unsupervised methods. Recent research into unsupervised methods focusses on extending language modelling, for example by combining Uni-gram modelling with deep neural networks. This review does not include recent work in deep learning.

Please use this identifier to cite or link to this item:

http://hdl.handle.net/10453/153385