Molecular cavity topological representation for pattern analysis: A NLP analogy-based word2vec method

Publication Type:
Journal Article
International Journal of Molecular Sciences, 2019, 20 (23)
Issue Date:
Full metadata record
© 2019 by the authors. Licensee MDPI, Basel, Switzerland. Cavity analysis in molecular dynamics is important for understanding molecular function. However, analyzing the dynamic pattern of molecular cavities remains a difficult task. In this paper, we propose a novel method to topologically represent molecular cavities by vectorization. First, a characterization of cavities is established through Word2Vec model, based on an analogy between the cavities and natural language processing (NLP) terms. Then, we use some techniques such as dimension reduction and clustering to conduct an exploratory analysis of the vectorized molecular cavity. On a real data set, we demonstrate that our approach is applicable to maintain the topological characteristics of the cavity and can find the change patterns from a large number of cavities.
Please use this identifier to cite or link to this item: