Home >> Research >> Papers

Identification of gene and protein signatures associated with long-term effects of COVID-19 on the immune system after patient recovery by analyzing single-cell multi-omics data using a machine learning approach

Identification of gene and protein signatures associated with long-term effects of COVID-19 on the immune system after patient recovery by analyzing single-cell multi-omics data using a machine learning approach
Author Ren, JX; Gao, Q; Zhou, XC; Chen, L; Guo, W; Feng, KY; Hu, JRY; Huang, T; Cai, YD
Journal VACCINE
Pub Year 2024
Type
Abstract Viral infections significantly impact the immune system, and impact will persist until recovery. However, the influence of severe acute respiratory syndrome coronavirus 2 infection on the homeostatic immune status and secondary immune response in recovered patients remains unclear. To investigate these persistent alterations, we employed five feature-ranking algorithms (LASSO, MCFS, RF, CATBoost, and XGBoost), incremental feature selection, synthetic minority oversampling technique and two classification algorithms (decision tree and knearest neighbors) to analyze multi-omics data (surface proteins and transcriptome) from coronavirus disease 2019 (COVID-19) recovered patients and healthy controls post-influenza vaccination. The single-cell multi-omics dataset was divided into five subsets corresponding to five immune cell subtypes: B cells, CD4+ T cells, CD8+ T cells, Monocytes, and Natural Killer cells. Each cell was represented by 28,402 scRNA-seq (RNA) features, 3 Hash Tag Oligo (HTO) features, 138 Cellular indexing of transcriptomes and epitopes by sequencing (CITE) features and 23,569 Single Cell Transform (SCT) features. Some multi-omics markers were identified and effective classifiers were constructed. Our findings indicate a distinct immune status in COVID-19 recovered patients, characterized by low expression of ribosomal protein (RPS26) and high expression of immune cell surface proteins (CD33, CD48). Notably, TMEM176B, a membrane protein, was highly expressed in monocytes of COVID19 convalescent patients. These observations aid in discerning molecular differences among immune cell subtypes and contribute to understanding the prolonged effects of COVID-19 on the immune system, which is valuable for treating infectious diseases like COVID-19.
Issue 42
Volume 42
SCI 4.5