IR-ER- A Hybrid Pipeline for Classifying COVID-19 RNA Seq Data

Publisher:
ASSOC COMPUTING MACHINERY
Publication Type:
Conference Proceeding
Citation:
ACM International Conference Proceeding Series, 2023, pp. 183-189
Issue Date:
2023-01-30
Filename Description Size
IR-ER Hybrid publication.pdfPublished version847.95 kB
Adobe PDF
Full metadata record
Bioinformatics has numerous approaches for evaluating the similarities between RNA-seq data for disease classification. Processing RNA-sequencing (RNA-seq) data using clustering or classification approach is extremely challenging, although analysis of ribonucleic acid (RNA-Seq) helps understand differentially expressed genes and classify the patient in a risk-free method. In this study, we present a hybrid end-to-end pipeline for analyzing, processing, and classifying the RNA-Seq data with a major focus on the covid-19 data set. The pipeline has been developed in three phases initially the raw data is normalized. Then the normalized data is pushed to a colonization algorithm to remove the noise data. The optimized data set is passed to a Deep Learning (DL) classifier. Further, a comparative analysis is performed with state of art methods discussed in the literature. The results prove that our proposed hybrid pipeline achieved the best accuracy over other methods. Gene set enrichment analysis was also performed to analyze the genes that are informative towards COVID-19 identification.
Please use this identifier to cite or link to this item: