Research Paper Volume 16, Issue 15 pp 11606—11625

Exploration of bacterial lipopolysaccharide-related genes signature based on T cells for predicting prognosis in colorectal cancer

Lichao Cao1,2,3, *, , Ying Ba1,2,3, *, , Fang Chen1,2,3, , Shenrui Zhang1,2,3, , Hezi Zhang1,2,3, ,

* Equal contribution

Received: April 3, 2024       Accepted: July 19, 2024       Published: August 6, 2024      

https://doi.org/10.18632/aging.206041

Copyright: © 2024 Cao et al. This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Abstract

Purpose: The intratumoral microorganisms participates in the progression and immunotherapy of colorectal cancer (CRC). However, due to technical limitations, the impact of microorganisms on CRC has not been fully understood. Therefore, we conducted a systematic analysis of relationship between bacterial lipopolysaccharide (LPS)-associated genes and immune cells to explore new biomarkers for predicting the prognosis of CRC.

Methods: The single-cell RNA sequencing data and the Comparative Toxicogenomics Database were used to screen T cells-associated LPS-related genes (TALRGs). Then, we established and validated the TALRGs risk signature in The Cancer Genome Atlas Colon Adenocarcinoma (TCGA-COAD) cohort and GSE39582 cohort. Besides, we compared the differences in tumor-infiltrating immune cell types, immunotherapeutic response, somatic mutation profiles, and tumor mutation burden (TMB) between high-risk group and low-risk group. In addition, the immunotherapeutic cohort (Imvigor210) treated with an anti-PD-L1 agent was performed to explore the potential value of the TALRGs signature on immunotherapy.

Results: Five prognostic TALRGs were identified and selected to build the prognostic model. The high-risk group had poor prognosis in both TCGA-COAD cohort (P < 0.0001) and GSE39582 cohort (P = 0.00019). The areas under the curves (AUCs) of TALRGs signature were calculated (TCGA-COAD cohort: 0.624 at 1 years, 0.639 at 3 years, 0.648 at 5 years; anti-PD-L1 cohort was 0.59). The high-risk group had advanced pathological stages and higher TMN stages in both TCGA-COAD cohort and GSE39582 cohort. The high-risk group had the higher infiltration of immunosuppressive cells, the expressions of immune checkpoint molecules, the IC50 values of chemotherapy drugs, and TP53 mutation rate (P < 0.05). In addition, patients with high TMB had worse prognosis (P < 0.05). Furthermore, the Imvigor210 also showed patients with high-risk scores had poor prognosis (platinum-treated cohort: P = 0.0032; non-platinum-treated cohort: P = 0.00017).

Conclusions: Microorganisms are closely related to the tumor microenvironment to influence the progression and immune response of CRC via stimulating T cells through LPS-related genes. The TALRGs signature contributed to predict the prognosis and immunotherapy of CRC, and became new therapeutic targets and biomarkers of CRC.

Introduction

Colorectal cancer (CRC) is the second leading cause of cancer-related death worldwide, with high morbidity and mortality [1]. With the development of economy, the incidence and death rate of CRC are gradually increasing, especially in individuals under 50 years of age [2]. Treatment options for CRC are complex, usually with surgery as the primary treatment, radiotherapy, chemotherapy, targeted therapy, and combination therapy as adjuvant treatments [3]. Unfortunately, not all patients benefit from these treatments, and there is also an increased rate of treatment failure due to high recurrence rates and resistance to anti-cancer drugs, which is closely associated with the heterogeneity of the CRC tumor microenvironment (TME) [4, 5]. Therefore, predicting the prognosis and determining the best treatment for poor prognosis of CRC from the perspective of TME is of great significance for improving the survival time and quality of patients.

The role of microorganisms in the occurrence, diagnosis, prognosis and treatment of cancer has been increasingly recognized. Some studies have found that the infections of gut microbiota may be associated with the development of CRC [6]. For example, a potential association between streptococcal infections and gastrointestinal tumors was discovered as early as 1950s [7]. In addition, gut and intratumoral microorganisms affected the immune infiltrating cells of TME via their derived metabolites, genotoxins, and antigens, thereby regulating the antitumor immune response [8, 9]. For instance, the immunogenic intestinal bacteria (Helicobacter hepaticus) inhibited tumor growth of colon adenocarcinoma (COAD) by activating CD4+ T cells- and B cells-associated anti-tumor immunity [10]. Bacterial lipopolysaccharides (LPS) are present in both cancer cells and immune cells of TME [11], which can bind to the TLR4 of monocytes, causing them to differentiate into an immunosuppressive M2 phenotype [12, 13]; and can also promote the recruitment of CD11b+Gr-1+ myeloid-derived suppressor cells (MDSCs) and CD1d+CD5+ regulatory B (Breg) cells on tumor cells, which together inhibit the local anti-tumor T cells response [14]. Actually, unlike gut microbes, the comprehensive characterization of intratumoral microorganisms’ signatures related to immune responses is still in its infancy due to technical limitations. Therefore, through immune infiltrating cells in TME, we can explore the role of major pathogenic components of intratumoral microorganisms on the occurrence, development, prognosis and treatment of cancer, laying a foundation for the clinical prevention and treatment of cancer.

In this study, we aim to identify a T cells-associated LPS-related gene (TALRG) signature as a biomarker for the diagnosis and prognosis of CRC. We first obtained differentially expressed genes (DEGs) in T cells from the single-cell RNA sequencing (scRNA-seq) data and LPS-related genes from the Comparative Toxicogenomics Database (CTD). Then, TALRGs were obtained by making the intersection of DEGs in T cells and LPS-related genes. Subsequently, Univariate and multivariate Cox analyses were used to identify a prognostic 5-TALRGs signature. This study revealed TALRGs’ clinical value in CRC and provided recommendations for finding new treatment options of CRC.

Methods

Acquisition and processing of data

The scRNA-seq dataset GSE200997 was obtained from the Gene Expression Omnibus (GEO, https://www.ncbi.nlm.nih.gov) database, which included 16 CRC samples and 7 adjacent samples of normal colon tissue. There was a total of 49,859 cells in all samples, including 31,586 cells from tumor samples and 18,273 cells from normal samples. A total of 5286 LPS-related genes were downloaded from CTD database (http://ctdbase.org/).

The Cancer Genome Atlas Colon Adenocarcinoma (TCGA-COAD) cohort was downloaded from UCSC Xena platform (https://xenabrowser.net/datapages/), which is used to construct the prognosis model, including the mRNA expression data, clinical information (gender, age, microsatellite status, overall survival (OS), pathological stage, TNM stage), and mutation profiling data. The detailed information was shown in Supplementary Table 1. The validation dataset (GSE39582) was downloaded from the GEO database, and the detailed information was shown in Supplementary Table 2.

According to the published report [15], we gained an immunotherapeutic cohort (IMvigor210), which was treated with atezolizumab (anti-PD-L1 agent). Based on the guideline on http://research-pub.gene.com/IMvigor210CoreBiologies, the IMvigor210CoreBiologies R package was used to acquire Expression sets and clinical information of this cohort [16]. The samples were divided into platinum-treated cohort and non-platinum-treated cohort based on whether or not they received platinum-based chemotherapy.

scRNA-seq dataset analysis and TALRGs identification

The cell filtration criteria in the scRNA-seq dataset (GSE200997) were consistent with previous study [17]. According to the previous report [17], the filtered data was used to reduce the dimensionality of the features. The cells were then clustered using the FindNeighbors and FindClusters functions. The t-distributed stochastic neighbor embedding, and uniform manifold approximation and projection (UMAP) algorithms were used to reduce the dimensionality and visualize clustering classification based on the first 20 principal components selected. Then, we used SingleR package of R software to annotate the cell types of clusters, with “HumanPrimaryCellAtlasData” as reference [18]. Subsequently, we identified DEGs between tumor and normal cells in T cells using the FindMarkers function with logfc.threshold = 0.585. Moreover, the Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analyses of DEGs in T cells were conducted using the Database for Annotation, Visualization and Integrated Discovery (DAVID) database (https://david.ncifcrf.gov/conversion.jsp). Finally, we selected the intersection of DEGs in T cells and bacterial LPS-related genes to obtain TALRGs.

Development and validation of the TALRGs prognostic model

TCGA-COAD cohort was used to recognize the prognostic TALRGs and established a prognostic risk model. Univariate Cox analysis was used to screen the prognostic signature using Survival R package. Kaplan–Meier analysis was performed to evaluate the relationship between mRNA expression of prognostic TALRGs and overall survival (OS). According to the each TALRG’ coefficients (which obtained from the multivariate Cox results), the risk score was calculated by the following formula [17]:

Kaplan–Meier analysis and time-dependent receiver operating characteristic (ROC) curve were applied to validate the predictive ability of the TALRGs prognostic model using the survminer R package and survivalROC R package, respectively. To further confirm the prediction model, we introduced a validation dataset: GSE39582. The prediction ability of the model was verified by the above method.

Assessment of clinicopathological characteristics

TCGA-COAD cohort and GSE39582 dataset were performed to validate the prognostic capability of TALRGs prognostic model. According to the constructed risk scoring model, the dataset was divided into high-risk group and low-risk group. Kruskal-Wallis test and Wilcoxon test were used to investigate the relationship between TALRGs prognostic model and clinicopathological characteristics, including pathological stage and TNM stage.

Establishment of nomogram

Nomogram is a multivariate regression analysis based on clinicopathological information and risk score. Based on the risk score and clinical factors such as age, sex, microsatellite status, and pathological stage, the nomogram was constructed using the TCGA-COAD cohort using rms R package. The calibration curve was employed to evaluate the suitability of clinical use.

Estimation of immunological properties of TALRGs signature

Similar to the previous approach [16], we employed CIBERSORT algorithm to evaluate the infiltration of 22 types of tumor-infiltrating immune cells [19]. The Wilcoxon test was applied to compare to the proportions of immune cells between low- and high-risk groups (P-value < 0.05). Then, we employed unpaired t-test and Wilcoxon test to assess the difference of the mRNA levels of immune checkpoints and IC50 value of chemotherapy drugs between the two risk groups, respectively.

Analysis of mutation characteristics of TALRGs signature

We employed the maftools R package to analyze and visualize the somatic mutation profiles. Subsequently, the correlation between the mutation status of gene and OS was evaluated. The value of tumor mutation burden (TMB) was calculated and visualized by the maftools R package. The Kruskal-Wallis test was performed to compare the TMB value between the two risk groups. The surv_cutpoint algorithm of survival R package was used to determine the optimal cutoff value of TMB, and then all samples were divided into high-TMB group and low-TMB group. Kaplan–Meier analysis was used to evaluate the relationship between TMB values and OS. P < 0.05 was considered statistically significant.

The role of the TALRGs signature in immunotherapy

Similar to the previous study [16], we validated the constructed TALRGs signature using platinum-treated cohort and non-platinum-treated cohort, respectively. Secondly, patients with complete response (CR) or partial response (PR) were classified as responders and compared with non-responders (stable disease (SD) or progressive disease (PD)), and the risk score for each patient was calculated according to the built risk score model. Kaplan–Meier analysis and ROC curve were applied to assess predictive ability of the risk model. Finally, we employed the Wilcoxon test to compare the tumor mutation load and neoantigen load between the two risk groups.

Results

scRNA-seq analysis of CRC and identification of TALRGs

In all, 47,958 high-quality cells were obtained after strict quality control (Figure 1A), including 30,341 tumor cells and 17,617 normal cells. The dimensionality reduction by principal component analysis (PCA) showed that tumor cells and normal cells were significantly different (Figure 1B). Subsequently, 22 distinct cell subgroups were identified and visualized by using UMAP method (Figure 1C). Through the annotation of 22 cell subgroups, a total of 6 major cell types were identified, mainly including T cells, B cells, NK cells, endothelial cells, hepatocytes, and monocytes (Figure 1D). Subsequently, we compared and analyzed the genes between tumor and normal cells in individual T cell clusters. T cells revealed 82 DEGs between tumor and normal cells (Supplementary Table 3). KEGG enrichment analysis showed that DEGs in T cells are mainly involved in “Antigen processing and presentation”, “Intestinal immune network for IgA production”, “Staphylococcus aureus infection” and “Cell adhesion molecules” (Figure 1E), which are correlative signaling pathways of bacterial LPS-related genes. A total of 5,286 bacterial LPS-related genes were obtained from the CTD database. Then LPS-related genes were overlapped with DEGs of T cells, and 44 TALRGs were obtained (Figure 1F).

TALRGs identification from scRNA-seq data and CTD. (A) The quality control of scRNA-seq data. (B) The dimensionality of tumor cells and normal cells was reduced by PCA. (C) UMAP distribution of the 22 cell clusters. (D) The cell clusters were annotated into 6 main cell types. (E) KEGG analysis of DEGs in T cells. (F) The Venn diagram of DEGs in T cells and LPS-related genes.

Figure 1. TALRGs identification from scRNA-seq data and CTD. (A) The quality control of scRNA-seq data. (B) The dimensionality of tumor cells and normal cells was reduced by PCA. (C) UMAP distribution of the 22 cell clusters. (D) The cell clusters were annotated into 6 main cell types. (E) KEGG analysis of DEGs in T cells. (F) The Venn diagram of DEGs in T cells and LPS-related genes.

Establishment and validation of TALRGs signature

We further studied the association of TALRGs with prognosis, and 5 optimal prognosis-related TALRGs with P < 0.05 were identified using univariate Cox analysis (Table 1), namely HSPA1A, HSPB1, SERPINA1, SLC2A3, and TIMP1. Compared with normal samples, the expressions of HSPA1A, HSPB1 and SERPINA1 were down-regulated in tumor samples, while TIMP1 was up-regulated in tumor samples. SLC2A3 expression had no significant difference between normal and tumor samples (Figure 2A). Furthermore, Kaplan–Meier survival analyses demonstrated that patients with increased HSPA1A (P = 0.018, Supplementary Figure 1A), SLC2A3 (P = 0.0073, Supplementary Figure 1D), and TIMP1 (P = 0.0031, Supplementary Figure 1E) showed poor survival, while patients with increased SERPINA1 indicated favorable survival (P = 0.02, Supplementary Figure 1C). There was no significant relationship between HSPB1 expression and OS (P = 0.28, Supplementary Figure 1B). Subsequently, multivariate Cox analysis established a TALRGs signature based on the TCGA-COAD cohort, and the hazard ratio of each gene was shown in Figure 2B. All patients were then divided into high- and low-risk groups based on the median risk score. The formula for the risk score model was as follows: risk score = (0.15872 × expression value of HSPA1A) + (0.04431 × expression value of HSPB1) + (−0.19961 × expression value of SERPINA1) + (0.0404 × expression value of SLC2A3) + (0.22075 × expression value of TIMP1). The Kaplan–Meier analysis manifested low-risk patients had a better survival advantage (P < 0.0001, Figure 2C). In addition, the area under the curve (AUC) of the TCGA-COAD cohort was 0.624 (at 1-years), 0.639 (at 3-years), and 0.648 (at 5-years), respectively (Figure 2D). The model was validated in the GSE39582 cohort. The OS rate of the low-risk group was higher than that of the high-risk group (P = 0.00019, Figure 2E). The AUC of anti-PD-L1 cohort was 0.59 with P = 0.0366463 (Figure 2F). These results suggested that TALRGs prognostic model is effective in predicting the prognosis of CRC patients.

Table 1. The detailed information on the identified prognostic TALRGs.

TALRGsHRHR.95LHR.95HP-value
HSPA1A1.2345648361.0898869421.3984481110.0009216
HSPB11.2330814831.0309517911.4748409730.02180952
SERPINA10.8698819520.7719665630.9802168220.022142324
SLC2A31.1738869461.0205139681.3503103410.024818267
TIMP11.3043911571.0604031911.6045182670.011901692
Construction and validation of the TALRGs signature. (A) Comparison of the TALRGs expression between tumor and normal samples. (B) The multivariate Cox analysis. (C) Kaplan–Meier analysis of the TALRGs signature in TCGA-COAD cohort. (D) ROC curve analysis of the TALRGs signature in TCGA-COAD cohort. (E) Kaplan–Meier analysis of the TALRGs signature in GSE39582 cohort. (F) ROC curve analysis of the TALRGs signature in anti-PD-L1 cohort.

Figure 2. Construction and validation of the TALRGs signature. (A) Comparison of the TALRGs expression between tumor and normal samples. (B) The multivariate Cox analysis. (C) Kaplan–Meier analysis of the TALRGs signature in TCGA-COAD cohort. (D) ROC curve analysis of the TALRGs signature in TCGA-COAD cohort. (E) Kaplan–Meier analysis of the TALRGs signature in GSE39582 cohort. (F) ROC curve analysis of the TALRGs signature in anti-PD-L1 cohort.

Evaluation of clinicopathological characteristics

To explore the relationship between prognostic values and clinicopathological characteristics of CRC patients, clinical traits in TCGA-COAD and GSE39582 cohorts were analyzed. The CRC patients with more advanced clinical stages in TCGA-COAD cohort, such as advanced pathological stage (P = 0.00022, Figure 3A), later T stage (P = 0.0032, Figure 3B), later M stage (P = 0.0042, Figure 3C), and later N stage (P = 0.0017, Figure 3D) had higher risk scores, indicating patients with low-risk scores have a better prognosis. Analogously, there were differences in the risk scores of different pathological stages (P = 0.018, Figure 3E) and different T stages (P = 0.0051, Figure 3F) in GSE39582 cohort, suggesting that risk scores were correlated with pathological stages and T stages. Besides, the risk score in M1 was not significantly different from M0 (P = 0.26, Figure 3G), while the risk score in N2/N3 was significantly higher than N0/N1 (P = 0.00045, Figure 3H). These results indicated that TALRGs signature has good clinical performance.

Analysis of clinicopathological characteristics. The Kruskal-Wallis test was used to analyze the relationship between risk score and (A) advanced pathological stages, (B) T stages, (C) M stages, and (D) N stages in TCGA-COAD cohort. The Kruskal-Wallis test was used to analyze the relationship between risk score and (E) advanced pathological stages and (F) T stages in GSE39582 cohort. The Wilcoxon test was performed to analyze the relationship between risk score and (G) M stages and (H) N stages in GSE39582 cohort.

Figure 3. Analysis of clinicopathological characteristics. The Kruskal-Wallis test was used to analyze the relationship between risk score and (A) advanced pathological stages, (B) T stages, (C) M stages, and (D) N stages in TCGA-COAD cohort. The Kruskal-Wallis test was used to analyze the relationship between risk score and (E) advanced pathological stages and (F) T stages in GSE39582 cohort. The Wilcoxon test was performed to analyze the relationship between risk score and (G) M stages and (H) N stages in GSE39582 cohort.

Construction of a nomogram

As shown in Figure 4A, age (≤ 60), stage III and IV, and risk score were observably associated with OS (P < 0.05), while gender, microsatellite status, stage I and II were not (P > 0.05). Then, we constructed a nomogram plot to ameliorate the accuracy of TALRGs signature and provided a quantitative and intuitive way to predict 1-, 3-, and 5-years survival probabilities for CRC patients (Figure 4B). We can determine the total score based on the individual score and thus predict the probability of survival of the patient at 1-, 3-, and 5-years. In addition, the calibration curves of the model showed that the predicted curves are close to the ideal curves, indicating that the nomogram has a good prediction performance (Figure 4C4E).

Construction and validation of a nomogram. (A) The forest map demonstrated the association between risk score, clinical characteristics, and OS. (B) Nomogram based on risk score and clinical characteristics in predicting 1-, 3-, and 5-years OS. Calibration curves were used to predict the probability of 1-year (C), 3-year (D), and 5-year (E) OS.

Figure 4. Construction and validation of a nomogram. (A) The forest map demonstrated the association between risk score, clinical characteristics, and OS. (B) Nomogram based on risk score and clinical characteristics in predicting 1-, 3-, and 5-years OS. Calibration curves were used to predict the probability of 1-year (C), 3-year (D), and 5-year (E) OS.

Immune infiltrate analysis and efficacy prediction of TALRGs signature

Given that the characteristics of tumor immune infiltrating cells are associated with tumor development and progression and may affect the prognosis of CRC, we used the CIBERSORT method to compare the immune infiltration of 22 immune cells in different risk groups. The results showed B cells, plasma cells, resting CD4+ T cells, activated CD4+ T cells, and resting dendritic cell (DC) cells were more abundant in the low-risk group, while CD8+ T cells, T regulatory cells (Tregs), and M0/M1/M2 macrophages were more abundant in the high-risk group (Figure 5A). In addition, we analyzed correlations between the risk model and CD274 (PDL1), PDCD1 (PD1), CTLA4, HAVCR2 (TIM3), LAG3, and TIGIT. We found observably higher expressions of all immune checkpoint molecules in the high-risk group (Figure 5B), indicating that immune function is suppressed in high-risk patients. Moreover, we discovered the IC50 of dactinomycin, docetaxel, mitoxantrone, vinorelbine, camptothecin, and teniposide was higher in the high-risk group, while the IC50 of rapamycin was higher in the low-risk group (Figure 5C5J), suggesting that patients in the high-risk group are more resistant to most chemotherapy drugs.

Immune infiltrate analysis and efficacy prediction of TALRGs signature. (A) The Wilcoxon test was applied to compare the proportions of immune cells between low- and high-risk groups. (B) Comparing the difference of the mRNA levels of immune checkpoints unpaired between the two risk groups using unpaired t-test. (C–J) Comparing the difference of IC50 value of chemotherapy drugs between the two risk groups using Wilcoxon test.

Figure 5. Immune infiltrate analysis and efficacy prediction of TALRGs signature. (A) The Wilcoxon test was applied to compare the proportions of immune cells between low- and high-risk groups. (B) Comparing the difference of the mRNA levels of immune checkpoints unpaired between the two risk groups using unpaired t-test. (CJ) Comparing the difference of IC50 value of chemotherapy drugs between the two risk groups using Wilcoxon test.

Mutation analysis of TALRGs signature

Genetic alteration analysis showed that APC, TP53, TTN, KRAS, SYNE1, MUC16, PIK3CA, FAT4, RYR2 and OBSCN were the mutation rates of the top 10 genes in CRC patients (Figure 6A). The mutant profiles of high-risk and low-risk patients were shown in Supplementary Figure 2A, 2B, respectively. Besides, patients in the high-risk group had a higher median number of mutations than those in the low-risk group (118 vs. 110). In the high-risk group, single nucleotide polymorphism (SNP) was the most variant type, missense mutation was the most common SNP classification, and C > T was the main single nucleotide variants (SNV) class, and TTN, APC, MUC16, SYNE1, TP53, RYR2, DNAH11, ABCA13, KRAS, PIK32CA were the top 10 mutated genes in patients in the high-risk group (Figure 6B). Except for the top 10 mutated genes (TTN, APC, MUC16, SYNE1, TP53, FAT4, KRAS, OBSCN, PIK3CA, ZFHX4), the mutation profiles of the low-risk group were similar to that of the high-risk group (Figure 6C). Moreover, the mutation rate of TP53 in the high-risk group was significantly higher than in low-risk group (P = 0.0257, Supplementary Figure 2C). The mutation status of TP53 didn’t affect patients’ OS (P = 0.183, Supplementary Figure 2D), while TP53 combined with mutations in other genes (e.g. LRP18, SYNE1, TTN) significantly reduced survival time (Figure 6D). In addition, the median TMB value was 2.22/Mb in CRC patients (Supplementary Figure 2E), and there was no significant difference in TMB value between low-risk group and high-risk group (P = 0.8, Supplementary Figure 2F). However, patients with lower TMB had better prognosis (P = 0.028, Figure 6E).

Analysis of mutation characteristics of TALRGs signature. (A) The mutation profiles of CRC patients. (B) The summary information of high-risk patients’ mutation profiles. (C) The summary information of low-risk patients’ mutation profiles. (D) Kaplan–Meier analysis of the two genes mutation status. (E) CRC patients with higher TMB values had poor OS.

Figure 6. Analysis of mutation characteristics of TALRGs signature. (A) The mutation profiles of CRC patients. (B) The summary information of high-risk patients’ mutation profiles. (C) The summary information of low-risk patients’ mutation profiles. (D) Kaplan–Meier analysis of the two genes mutation status. (E) CRC patients with higher TMB values had poor OS.

Effect of immunotherapy on prognosis

In order to evaluate the predictive effect of TALRGs signature on immunotherapy, we further validated constructed risk score model using the platinum-treated cohort and the non-platinum-treated cohort. The results exhibited patients with low-risk scores had significantly longer survival times (platinum-treated cohort: P = 0.0032, Figure 7A; non-platinum-treated cohort: P = 0.00017, Figure 7B). The AUCs of constructed model in anti-PD-L1 cohorts were 0.639 (platinum-treated cohort, Figure 7C) and 0.627 (non-platinum-treated cohort, Figure 7D), respectively. The clinical benefit of immunotherapy was not significant in patients receiving platinum therapy (P = 0.27, Figure 7E), while patients receiving non-platinum therapy marked clinical benefits from immunotherapy in the low-risk score group compared to those with a high-risk score group (P = 0.023, Figure 7F). In addition, higher tumor mutation load and higher neoantigen burden in platinum-treated patients was not significantly associated with risk score (P = 0.29, Figure 7G; P = 0.94, Figure 7H), while non-platinum-treated patients were associated with a low-risk score (P = 5.1e-06, Figure 7I; P = 1.7e-05, Figure 7J). These findings demonstrated that TALRGs signature is effective in predicting the prognosis of immunotherapy patients.

Effect of immunotherapy on prognosis. Kaplan–Meier analysis of the TALRGs signature in the platinum-treated cohort (A) and non-platinum-treated cohort (B). ROC curve analysis of anti-PD-L1 cohort in the platinum-treated cohort (C) and non-platinum-treated cohort (D). Comparing the difference of the risk score between responders group and non-responders group in the platinum-treated cohort (E) and non-platinum-treated cohort (F). Comparing the difference of the tumor mutation load and neoantigen load between low- and high-risk groups in the platinum-treated cohort (G, H) and non-platinum-treated cohort (I, J).

Figure 7. Effect of immunotherapy on prognosis. Kaplan–Meier analysis of the TALRGs signature in the platinum-treated cohort (A) and non-platinum-treated cohort (B). ROC curve analysis of anti-PD-L1 cohort in the platinum-treated cohort (C) and non-platinum-treated cohort (D). Comparing the difference of the risk score between responders group and non-responders group in the platinum-treated cohort (E) and non-platinum-treated cohort (F). Comparing the difference of the tumor mutation load and neoantigen load between low- and high-risk groups in the platinum-treated cohort (G, H) and non-platinum-treated cohort (I, J).

Discussion

Microorganisms play an important role in cancer occurrence, diagnosis, prognosis and treatment, especially CRC [20, 21]. Our previous research has identified 2 pathogenic microorganisms that can be used as biomarkers for COAD diagnosis and prediction [22]. However, due to technical limitations, the types of microorganisms in CRC and the depth of their role are not fully understood. Considering that microorganisms in tumors drive tumor progression by promoting immune tolerance [23], we applied bioinformatics tools for the first time to identify TALRGs based on LPS-related genes and T cells, and establish the prognostic risk model and nomogram for CRC. Then, the performance of TALRGs signature and its effect on immunotherapy were examined in multiple cohorts.

For this study, the prognostic 5-TALRGs genes HSPA1A, HSPB1, SERPINA1, SLC2A3, and TIMP1 were obtained. HSPA1A is a member of the heat shock protein group A (HSP70), which plays an important role in regulating the correct folding of proteins and maintaining protein homeostasis [24]. HSPA1A protects cell membrane integrity from bacterial LPS stimulation [25]. HSPA1A was down-regulated in CRC compared to adjacent nontumor tissues, which was consistent with other findings [26]. Additionally, Guan et al. [26] reported that increased HSPA1A in CRC was related to poor prognosis, which was consistent with our study. These findings indicated that HSPA1A may be related to progression rather than tumorigenesis of CRC. Besides, HSPB1 (also called human HSP27) as a member of HSPs, is involved in regulating molecular chaperones of cytoskeletal tissue [27] or stabilizing abnormally folded proteins to prevent aggregation [28]. LPS caused endothelial barrier dysfunction by promoting phosphorylation of HSPB1 [29]. HSPB1 is associated with the progression of CRC and can influence the response of CRC to different treatments [30]. SERPINA1 gene encodes alpha1-antitrypsin (A1AT), in which A1AT knockdown aggravates LPS-induced inflammation and cell death in human bronchial epithelial cells [31]. A1AT has been reported to bind complement components, which regulate T cell function [32]. Furthermore, SERPINA1 was reported to affect the CRC prognosis by regulating Th17 cells’ infiltration [33], which is considered as a biomarker of CRC diagnosis and prognosis [34]. SLC2A3, a member of the glucose transporter superfamily, was significantly up-regulated in LPS-induced human umbilical vein endothelial cells compared with control [35]. Gao and his team showed that elevated SLC2A3 is associated with poor survival of CRC [36], which is same as our study. Interestingly, SLC2A3 expression was positively related to CD4+ T cells and CD8+ T cells in COAD, respectively [36]. TIMP1 is a natural inhibitor of matrix metalloproteinases (MMPs), and increased LPS stimulation of macrophages during hepatitis C virus infection can lead to MMP/TIMP1 imbalance [37]. TIMP1 was markedly correlated with poor prognosis of CRC with its high expression [38]. Our results lead to a similar conclusion where TIMP1 is the prognostic biomarker for CRC. Meanwhile, TIMP1 expression was positively associated with CD8+ T cells in CRC [39]. From the findings above, the biological function of these 5-TALRGs were influenced by LPS and may affect the progression and immune response of CRC by modulating immune cells, especially T cells.

We constructed a prognostic model with high accuracy based on the combination of 5 prognostic TALRGs. Unlike other previous LPS-related prognostic models [40, 41], our 5-TALRGs signature contained the smallest number of genes and was relatively easy to apply clinically. As with other good prognostic signatures [42, 43], we also established a nomogram with better accuracy to contribute to predicting the prognosis of CRC using age, sex, microsatellite status, pathological stage, and risk score. Besides, TALRGs signature had a good performance in understanding immunological properties for CRC. For example, in the terms of immune cells infiltration, low-risk group had the higher infiltration of B cells, plasma cells, CD4+ T cells and DC cells, as well as lower infiltration of immunosuppressive cells, such as macrophages and Tregs. Studies have shown that the infiltration of B cells is associated with good prognosis of patients [44], which has anti-tumor effects for CRC [45]. Additionally, B cells are a major component and initiator of tertiary lymphoid structures (TLS), and tumors with mature TLS, high density of B cells and plasma cells typically have good clinical outcomes and respond to immunotherapy [46]. Besides, CD4+ T cells acted as an antitumor factor to provide clinical benefit to patients [47]. Furthermore, Tregs can promote chemotherapy resistance in CRC patients via FOXO1/CEBPB/NF-κB signaling pathway [48]. Therefore, the infiltration of different immune cells in different risk populations affects the response of colon cancer patients to immunotherapy.

Recent studies have shown that immunotherapy and chemotherapy can be used as adjunctive therapy for tumor patients [49], and have important implications for the treatment of colon cancer. In this study, risk stratification based on the TALRGs signature has been shown to help predict immune and chemotherapy responses. We found that patients with high-risk score had higher expression of immune checkpoint molecules and no sensitivity to multiple chemotherapy drugs of CRC, meaning that high-risk patients are more suitable for immunosuppressive therapy and chemotherapy drugs are more beneficial to low-risk patients. SNP is the most common variant type in the human genome [50]. Our study found SNP was the most variant type in both high-risk and low-risk patients, with the most missense mutations in the common SNP classification and C > T being the main SNV class, which is similar to previously study [17]. Missense mutation can be associated with a number of pathological conditions and can affect susceptibility to disease and drug treatment [51]. These findings suggested that the molecular mechanism of mutational effects can be revealed to understand the occurrence and development of CRC in order to improve the prognosis and clinical treatment of patients. In addition, immunotherapy efficacy is connected with TMB, usually MHC I molecules with high mutation load of tumors can produce more peptide neoantigens, which are recognized as “non-self”, thus triggering T cells to activate and kill tumor cells [52]. However, we found patients with lower TMB had better prognosis. The main mechanism of immune response of TMB is still being explored. More importantly, TALRGs signature was also significantly associated with OS in patients treated with anti-PD-L1. Similar to previous results [16], we speculated that patients with low-risk scores may be more sensitive to immune checkpoint inhibitor therapy. Briefly, based on the TALRGs signature, we can assign more reasonable treatment plans to patients, resulting in improving the survival rate of CRC patients.

Conclusions

In conclusion, TALRGs signature constructed by the combination of T cells and LPS-related genes has a good performance in predicting prognosis and immune response of CRC, providing a promising strategy for guiding individualized therapy and improving prognosis prediction. This could aid in our understanding of the important role of microorganisms-mediated immune responses in the progression and prognosis of CRC.

Author Contributions

Lichao Cao, Ying Ba and Hezi Zhang designed the work; Lichao Cao draft the manuscript; Ying Ba reviewed and edited the manuscript; Hezi Zhang, Ying Ba and Fang Chen prepared the acquisition, analysis, and interpretation of data; Shengrui Zhang prepared figures and tables. All authors reviewed the manuscript.

Conflicts of Interest

All authors declare that the research was conducted without any commercial or financial relationships that could be construed as a potential conflict of interest.

Funding

No funding was used for this paper.

References

View Full Text Download PDF


Copyright © 2025 Rapamycin Press LLC dba Impact Journals
Impact Journals is a registered trademark of Impact Journals, LLC