Unsupervised decomposition of a multi-author document based on naive-Bayesian model

Publication Type:
Conference Proceeding
ACL-IJCNLP 2015 - 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, Proceedings of the Conference, 2015, 2 pp. 501 - 505
Issue Date:
Full metadata record
© 2015 Association for Computational Linguistics. This paper proposes a new unsupervised method for decomposing a multi-author document into authorial components. We assume that we do not know anything about the document and the authors, except the number of the authors of that document. The key idea is to exploit the difference in the posterior probability of the Naive-Bayesian model to increase the precision of the clustering assignment and the accuracy of the classification process of our method. Experimental results show that the proposed method outperforms two state-of-the-art methods.
Please use this identifier to cite or link to this item: