Multi-document summarization based on sentence cluster using non-negative matrix factorization
- Publication Type:
- Journal Article
- Journal of Intelligent and Fuzzy Systems, 2017, 33 (3), pp. 1867 - 1879
- Issue Date:
© 2017 - IOS Press and the authors. All rights reserved. Multi-document summarization aims to produce a concise summary that contains salient information from a set of source documents. Many approaches use statistics and machine learning techniques to extract sentences from documents. In this paper, we propose a new multi-document summarization framework based on sentence cluster using Nonnegative Matrix Tri-Factorization (NMTF). The proposed framework employs NMTF to cluster sentences using inter-type relationships among documents, sentences and terms, and incorporate the intra-type information through manifold regularization. The most informative sentences are selected from each sentence cluster to form the summary. When evaluated on the DUC2004 and TAC2008 datasets, the performance of the proposed framework is comparable with that of the top three systems.
Please use this identifier to cite or link to this item: