Unraveling Scientific Evolutionary Paths: An Embedding-Based Topic Analysis

Publisher:
Institute of Electrical and Electronics Engineers (IEEE)
Publication Type:
Journal Article
Citation:
IEEE Transactions on Engineering Management, 2023, PP, (99), pp. 1-15
Issue Date:
2023-01-01
Filename Description Size
Unraveling Scientific Evolutionary Paths An Embedding-Based Topic Analysis.pdfAccepted version1.95 MB
Adobe PDF
Full metadata record
Understanding the evolution of knowledge has been and will continue to be the key task of science, technology, and innovation management. Existing research on evolutionary path identification relies primarily on traditional co-occurrence analysis and bag-of-words (BOW)–based models for topic extraction. However, these approaches have limitations in effectively capturing the underlying semantics and linkages of the topics. In this article, we propose a novel embedding-based methodology for scientific evolution analysis, in which word embedding, document embedding, clustering, and network analysis are applied to extract topics, measure topical semantic similarities, and quantitatively distinguish topics’ evolutionary states. We first perform benchmark experiments to demonstrate that doc2vec generally outperforms the BOW-based models in topic extraction before evolution analysis. We then consider topic consistency in vector spaces to identify evolutionary states including newborn, convergence, inheritance, and extinction. Scientific evolutionary paths are finally unraveled based on topic similarity matrixes and evolutionary states. We conduct a case study on object detection research to validate the effectiveness of our methodology. The empirical results, validated by domain experts, demonstrate that the proposed methodology is capable of effectively revealing patterns of knowledge inheritance and integration. Consequently, this methodology can be used to improve decision-making processes in future innovation management.
Please use this identifier to cite or link to this item: