Sequencing dropout-and-batch effect normalization for single-cell mRNA profiles: a survey and comparative analysis.

Publisher:
Oxford University Press (OUP)
Publication Type:
Journal Article
Citation:
Briefings in bioinformatics, 2020
Issue Date:
2020-10-19
Filename Description Size
Briefings in Bioinformatics_2020.pdfPublished version6.83 MB
Adobe PDF
Full metadata record
Single-cell mRNA sequencing has been adopted as a powerful technique for understanding gene expression profiles at the single-cell level. However, challenges remain due to factors such as the inefficiency of mRNA molecular capture, technical noises and separate sequencing of cells in different batches. Normalization methods have been developed to ensure a relatively accurate analysis. This work presents a survey on 10 tools specifically designed for single-cell mRNA sequencing data preprocessing steps, among which 6 tools are used for dropout normalization and 4 tools are for batch effect correction. In this survey, we outline the main methodology for each of these tools, and we also compare these tools to evaluate their normalization performance on datasets which are simulated under the constraints of dropout inefficiency, batch effect or their combined effects. We found that Saver and Baynorm performed better than other methods in dropout normalization, in most cases. Beer and Batchelor performed better in the batch effect normalization, and the Saver-Beer tool combination and the Baynorm-Beer combination performed better in the mixed dropout-and-batch effect normalization. Over-normalization is a common issue occurred to these dropout normalization tools that is worth of future investigation. For the batch normalization tools, the capability of retaining heterogeneity between different groups of cells after normalization can be another direction for future improvement.
Please use this identifier to cite or link to this item: