Single-cell transcriptomic analysis of normal and pathological tissues from the same patient uncovers colon cancer progression
Cell & Bioscience volume 13, Article number: 62 (2023)
The aim of the present study was to elucidate the evolutionary trajectory of colon cells from normal colon mucosa, to adenoma, then to carcinoma in the same microenvironment. Normal colon, adenoma and carcinoma tissues from the same patient were analyzed by single-cell sequencing, which perfectly simulated the process of time-dependent colon cancer due to the same microenvironment. A total of 22 cell types were identified. Results suggest the presence of dominant clones of same cells including C2 goblet cell, epithelial cell subtype 1 (Epi1), enterocyte cell subset 0 (Entero0), and Entero5 in carcinoma. Epi1 and Entero0 were Co-enriched in antibacterial and IL-17 signaling, Entero5 was enriched in immune response and mucin-type O-glycan biosynthesis. We discovered new colon cancer related genes including AC007952.4, NEK8, CHRM3, ANO7, B3GNT6, NEURL1, ODC1 and KCNMA1. The function of TBC1D4, LTB, C2CD4A, AND GBP4/5 in T cells needs to be clarified. We used colon samples from the same person, which provide new information for colon cancer therapy.
We performed single-cell transcriptome sequencing in one case carrying adenoma and carcinoma, and explored process of colon cancer progression.
Results suggest the presence of dominant clones of same cells including C2 goblet cell, epithelial cell subtype 1 (Epi1), enterocyte cell subset 0 (Entero0), and Entero5 in colon carcinoma
We discovered new colon cancer related genes including AC007952.4, NEK8, CHRM3, ANO7, B3GNT6, NEURL1, ODC1 and KCNMA1.
The function of TBC1D4, LTB, C2CD4A, AND GBP4/5 in T cells needs to be clarified.
The occurrence and development of colon cancer is a multi-stage process from adenoma to carcinoma. Colon cancer is an ideal model to study the multi-stages of carcinogenesis . Adenomatous polyps are definite precancerous lesions . Clinical research showed that the adenoma detection rate was inversely associated with the risks of interval colorectal cancer (CRC), advanced-stage interval cancer, and fatal interval cancer [3, 4].
Single-cell RNA sequencing (scRNA-seq) is a powerful analysis tool to reveal the gene expression state in a single cell. Research on colon cancer by scRNA-seq has been partially carried out. CRC showed obvious heterogeneity in single-cell level . Researchers have performed scRNA-seq analyses on immune and stromal populations from patients with CRC and identified specific macrophage and conventional dendritic cell subsets as key mediators of cellular cross-talk in the tumor microenvironment . Li et al.  performed scRNA-seq on the tumors and normal mucosa of patients with CRC, developed a new clustering method called reference component analysis.
Existing scRNA-seq studies of colon cancer have limitations, that is, samples were collected from different patients, and few works collected continuously different pathological stages of colon cancer in the same patient. The tumor microenvironment also varies indifferent patients, and mixed-microenvironment cells must mask this difference. In this study, we obtained the normal colon, adenoma, and carcinoma tissues from the same patient which carries adenoma and carcinoma at the same time and explored via scRNA-seq. We analyzed the overall evolution during transition from normal colon to adenoma and to carcinoma and the cell numbers and expression change patterns of epithelial cells, enterocyte cells, etc. Results describe the overall situation of transcriptome map in single-cell level and reflect the gradual change in colon cancer, which is of great significance to clarify dynamically the occurrence of colon cancer and patient’s individualized medical treatment.
Materials and methods
The normal tissue, adenoma tissue, and colon cancer tissue of the same patient were collected. The patient had not received chemotherapy or radiotherapy before operation. Clinical information was obtained from the patient’s medical record. Patient’s informed consent was obtained. The study was approved by the ethics committee of Xi’an Jiaotong University.
Library construction and sequencing
The prepared single-cell suspension and the antibody solution with the sample label were incubated. The sample was loaded into the U-card chip of BD Rhapsody (BD, USA), the magnetic beads were spread, and the excess beads were washed off. After standing for a period of time, cell lysis was conducted to combine free mRNA with the magnetic beads. The magnetic beads carrying the mRNA of single cells were recovered, reverse transcribed, and used to synthesize cDNA. After the completion of cDNA amplification, the enzyme sections were segmented, end repaired, added with A, and spliced. The amplified cDNA and library were evaluated by Agilent 4200. Finally, the sequencing was completed by pe150 strategy in Novaseq 6000 (Illumina, USA).
Single-cell data processing
A single-cell software suite was used to process the raw data. FastQC software was used to evaluate the quality of the raw sequencing data, and FASTX-Toolkit was used to control the quality of fastq sequence. Seurat 3.0 (R package) was used for clustering. After filtering out cells with gene number < 200, gene number ranked in the top 1% or a mitochondrial gene ratio of > 25%. The dimension was reduced by principal component analysis (PCA), and the data were visualized by TSNE or UMAP.
Copy number variation analysis
InferCNV package (https://github.com/broadinstitute/infercnv, v1.3.3) was used to infer the profiles of copy number variations based on the average expression of large group of genes in each chromosome. Single cell RNA-seq data of the normal tissue was served as the reference for CNV calling. Genes were arranged based on their chromosome coordinates. CNV for each cell was estimated by applying a moving average window with a length of 101 within each chromosome. Single cells from carcinoma tissue were clustered into three groups using the hierarchical clustering method. The cluster whose CNV pattern differs from that of reference cells and meanwhile displays high-frequency copy number alterations was assigned as cancer cells.
Cluster marker recognition, cell type annotation and pseudotime analysis
SingleR was used to analyze cell type annotation. SingleR (https://bioconductor.org/packages/devel/bioc/html/SingleR.html) was used for unbiased cell-type recognition from single-cell RNA sequencing data by leveraging the reference transcriptomic datasets of pure cell types to infer the origin of each single cell independently. Human Cell Landscape reference datasets were used.
Gene set variation analysis (GSVA)
Using hallmark gene sets in MsigDB as the gene set, we applied the standard settings in GSVA software to assign the estimated value of pathway activity to a single cell. We compared the activity score of each cell with the Limma software package and then evaluated the differential activity of pathways between cell subsets. The differential activity of the pathways of each identified cluster was calculated.
KEGG enrichment and DISEASE enrichment (human only) of cluster markers were performed with KOBAS software and Benjamini–Hochberg multiple testing adjustment by using the top 20 marker genes of the cluster. The results were visualized using R package.
Gene expression profiling interactive analysis (GEPIA) analysis
GEPIA software can provide rapid and customizable analysis using TCGA (The Cancer Genome Atlas, https://www.cancer.gov/about-nci/organization/ccg/research/structural-genomics/tcga) and GTEX data. We used GEPIA to detect the expression of (selected genes) in colon cancer and its relationship to the prognosis of colon cancer.
Hematoxylin–eosin staining (HE)
Paraffin sections of normal colon, adenoma, and carcinoma tissues were prepared. The paraffin specimens were treated with xylene and gradient ethanol for dewaxing and hydration. Hematoxylin semen and subsequent 1% hydrochloric acid ethanol were added. Finally, neutral gum sealing was performed. Two pathologists determined the diagnosis independently.
Some IHC results were obtained from The HPA (The Human Protein Atlas, https://www.proteinatlas.org/). Some IHC were performed by our lab. In brief, the paraffin sections were deparaffinized with xylene and hydrated using an alcohol gradient. Endogenous-peroxidase-blocking and antigen retrieval were conducted sequentially. The sections were then incubated with primary antibody followed by secondary antibody conjugated with horseradish peroxidase. Detection was conducted by 3,3′-diaminobenzidine (DAB) and hematoxylin. The score of immunostaining was evaluated by two independent pathologists [8, 9].
R and SPSS 23.0 statistical software were used for data analysis. All data are presented as mean ± SD. Student’s t-test or one-way ANOVA was used to evaluate differences among groups. Data were considered to be statistically significant when P < 0. 05.
Normal colon, adenoma, and carcinoma of the same patient via scRNA-seq
We performed a time sequential single-cell transcriptome profile analysis of colonic lesions at three different pathological stages including colon tissue, adenoma tissue, and carcinoma tissue (Fig. 1A, B). The sample diagnosis was conducted by Hematoxylin–eosin (HE) staining (Fig. 1A). A total of 17,397 cells were obtained. The results of the tSNE test showed the cell population distribution composed of cells from different sample sources (Fig. 1C). According to the expression of canonical markers and T-distribution random neighbor embedding (tSNE), we divided the cells into T cells, B cells, macrophages, dendritic cells, mast cells (and other immune cells), as well as enterocyte cells, epithelial cell, granulocytes cells, fibrocyte cells (and other non-immune cells) (Fig. 1D). KEGG analysis showed, compared with normal samples, carcinoma and adenoma tissues were preferentially enriched and suppressed the pathway of Th17 cell differentiation (Fig. 1E,F, G).
Cell annotation and number composition ratio
Colon cancer cells were identified by using copy number variation analysis, and 22 groups were identified (Fig. 2A, B). The marker genes of the different cell types were presented in Fig. 2C, Additional file 2: Table S1. We stratified 22 groups of cells from three samples (Fig. 2D) and counted the cell proportion (Fig. 2E–I, Additional file 2: Table S2). Firstly, we separate these cells into large clusters, including enterocyte cells, epithelial cells, malignant cells, immune cells, fibrocyte cells. The results showed that in carcinoma tissues, the proportion of enterocyte cells and malignant cells were increased, while the immune cells were decreased. Secondly, we performed the subclustering analysis. In enterocyte cells, the proportion of C2 goblet cells were increased in carcinoma tissues, and proportion of enteroendocrine cells were decreased. In epithelial cells, the proportion of C9 absorptive cells were increased in carcinoma tissues. In immune cells, the proportion of C5 plasma cells, C10 B cells, C13 macrophage cells, C19 granulocyte cells were increased in carcinoma tissues. In T cells, the proportion of C4 effector memory T cells, C14 CD8 + T cells, C16 natural killer T (NKT) cells were increased in carcinoma cells, while C6 regulatory T (Treg) cells, C11 T helper cell were decreased in carcinoma cells.
C2 goblet cells are the key cell group in the process of carcinogenesis
DISEASE enrichment results showed that C2 cells significantly enriched in intestinal cancer, biliary tract cancer, appendix cancer, pseudomyxoma peritonei, bile duct adenocarcinoma, cholangiocarcinoma, bile duct cancer, bile duct carcinoma, colon cancer, et al. (Additional file 1: Fig.S1A). Hence, C2 goblet cells may be the main carcinogenic cell group. KEGG analysis showed that C2 group cells were significantly enriched in mucin-type O − glycan biosynthesis, Vibrio cholerae infection, complement and coagulation cascades, cardiac muscle contraction, hypertrophic cardiomyopathy (HCM). The main enriched functional genes are MUC2 , ST6GALNAC1 , SERPINA1 , and TPM1  (Additional file 1: Fig.S1B). And the 6 coincident genes significantly enriched in adenoma and carcinoma were DEFA6 , SOX9 [15, 16], ID1 , L1TD1 , CXCL3 , and ODC1  (Additional file 1: Fig.S1C, S1D). In addition, the similar analysis was performed in C3 cancer cells (Additional file 1: Fig.S2). The main enriched functional genes in C3 groups are SLC12A2 , ATP1B1 , PLA2G2A , and HBEGF . (Additional file 1: Fig.S2B). And the 9 coincident genes significantly enriched in adenoma and carcinoma were APCDD1, DUOX2, NKD1, LYZ, CXCL1, TFF3, MSX1, CCL20, and IFI6 (Additional file 1: Fig.S2C, S2D).
Gene expression and characteristics of epithelial cells during the progression of normal to adenoma and to carcinoma
We identified 402 epithelial cells in all three stages (Fig. 3A), which were divided into four subtypes, named as Epi0, Epi1, Epi2, and Epi3 (Fig. 3B). Epithelial cells in three different pathological stages were analyzed in layers (Fig. 3C), and the number of cells in each layer was counted (Fig. 3D). In addition, we homogenized the proportion of cells between different groups compared with the normal group (Additional file 3: Table S2). Compared with normal tissues, the proportion of Epi1 cells in the adenoma tissues increased significantly and continued to increase in the carcinoma tissues, while the proportion of Epi3 cells was only increased slightly in carcinoma tissues. In addition, compared with normal tissues, the proportion of cells in Epi0 and Epi2 groups decreased in adenoma tissues and continued to decrease in carcinoma tissues (Fig. 3E). We made a histogram of the expression of top marker genes in different cell subsets at three pathological stages (Fig. 3F) and the heatmap of marker gene in all subsets (Additional file 1: Fig.S3A). We used the data of TCGA and HPA to verify the expression of these marker genes in colon cancer (Fig. 3H). The main marker genes are AQP8 , CA7 , MS4A12 , LRMP , LCN2 , and PI3 , BEST4, SH2D6 (no report is available about BEST4, SH2D6 and colon cancer).
We also conducted enrichment analysis of the four subsets of epithelial cells by GSVA method. Epi0 cells were enriched in myc targets, G2M check point, and E2F targets. Epi1 cells were enriched in bile acid metabolism. Epi2 and Epi3 cells showed similar expression patterns, Co-enriched in hypoxia, TNFa signaling via NFKB, KRAS signaling up, estrogen response, inflammatory response, apical junction, p53 pathway, UV response, interferon gamma response, IL2 stat5 signaling, epithelial mesenchymal transition, and bile acid metabolism (Fig. 3G). We also showed the expression of several marker genes in different epithelial cell subsets in the form of violin diagram (Fig. 3I).
As shown in Fig. 3D, E, the proportion of Epi1 cells increased significantly in the carcinoma tissue, suggesting that this group of cells may play a more important role in the evolution of tumor. The marker gene of the Epi1 subgroup is shown in Additional file 4: Table S3, Additional file 1: Fig.S3B. Some of them are involved in immuno reaction, such as IGKC , IGHA1, IGHA2, and JCHAIN . In addition, evidence of involvement in carcinogenesis was found,some of them function as oncogenes in colon cancer including LCN2 , DUOX2 , CEACAM6 , CD55 , MUC5B , TM4SF1 , REG4 , TFF1 , GDF15 , and ANXA2 . Some genes function as tumor suppressor including PLA2G2A [23, 42], PI3 , and NOS2 . An association was found between TNFRSF6B SNP with Crohn’s disease susceptibility .
Gene expression and characteristics of enterocyte cells during the progression of normal to adenoma and to carcinoma
We identified 4835 enterocyte cells in all three stages (Fig. 4A). These enterocyte cells were divided into 11 subtypes named Entero0 to Entero10 (Fig. 4B). The enterocyte cells in three different pathological stages were analyzed in layers (Fig. 4C), and the number of cells in each layer was counted (Fig. 4D). In addition, we homogenized the proportion of cells in different groups in this sample (Additional file 3: Table S2, Fig. 4E). Compared with the normal tissue, the proportion of Entero4 and Entero5 groups in the adenoma tissue increased significantly and continued to increase significantly in the carcinoma tissue. The proportion of cells in Entero7 group increased significantly in adenoma and carcinoma tissues, while the proportion of cells was very close. Compared with the normal tissue, the proportion of Entero0 cells in the adenoma tissue increased significantly but decreased in the carcinoma tissue.
We made a histogram of the top gene expression of marker genes in different cell subsets at three pathological stages (Fig. 4G) and a heat map of marker genes in all subsets of cells (Additional file 1: Fig.S4A). We used the data of TCGA and HPA to verify the expression of these marker genes in colon cancer (Fig. 4H). The main marker genes are DUOX2 , OLFM4 , MUC2 , DEFA6 , MKI67, GDF15 , SELENBP1 , IGLC1 (no related study with colon cancer has been reported), CA1 , CHGA , and GCG .
Moreover, CA1, CHGA, and GCG decreased significantly from normal to adenoma and continued to decrease significantly from adenoma to carcinoma, indicating that these genes continued to play a role in inhibiting the process from normal to adenoma then to carcinoma. We also conducted enrichment analysis of 11 subsets of enterocyte cells by GSVA method. Cells in Entero0 and Entero5 groups had similar expression patterns, Co-enriched in EMT, TNFa signaling via NFKB, inflammatory response, UV response, hypoxia, apoptosis, p53 pathway, KRAS signaling, et al. and others. Entero1 was enriched in DNA repair, androgen response, MYC targets, and oxidative phosphorylation (Fig. 4F). We also showed the expression of several marker genes in different enterocyte cell subsets in the form of violin diagram (Fig. 4I).
As shown in Fig. 4E, the proportion of Entero0 and Entero5 cells increased significantly in the adenoma tissue, suggesting that these group of cells may play an important role in the evolution of tumor. The marker gene of Entero0 and Entero5 subgroups are shown in Additional file 4: Table S3, Additional file 1: Fig.S4B, S5A.
Gene expression and characteristics of T cells during the progression of normal to adenoma and to carcinoma
We identified 1555 T cells in all three stages (Fig. 5A). These T cells were divided into nine subtypes, named T0 to T8 (Fig. 5B). The T cells in three different pathological stages were analyzed in layers (Fig. 5C), and the marker gene heat map of all subpopulations of cells was prepared Additional file 1: Fig.S6A. The number of cells in each layer was counted (Fig. 5D). In addition, we homogenized the proportion of cells among different groups compared with the normal group (Additional file 3: Table S2, Fig. 5E). Compared with the normal group, the proportion of cells in T1, T2, T4, and T7, especially T4 and T7 groups, decreased significantly, suggesting that the number of cells in T4 and T7 groups decreased during carcinogenesis. Compared with the normal group, the proportion of cells inT0, T3, T5, and T8, especially T8 group, increased significantly, suggesting that the number of cells in the T8 group increased during carcinogenesis.
In Fig. 5F, we selected the top 2 gene in the marker gene of different subpopulations of cells, including CCR6, FOS, CD69, CCL5, CD40LG, GZMK, MIAT, PIM2, KCNQ1OT1, and HES1. Among them, CCR6, FOS, CD69, CCL5, CD40LG, GZMK, MIAT, PIM2 have lower expression in carcinoma tissues compared with normal tissues. However, KCNQ1OT1 and HES1 have higher expression in the carcinoma tissue compared with normal tissues. We selected the top 2 gene of each subgroup marker gene, prepared a bubble chart based on the positive expression cells and expression level (Fig. 5G), and selected the top gene of each subgroup marker gene by tSNE method for display (Fig. 5H).
In addition, based on the significant increase in the proportion of cells in the T8 group in cancer tissues, we further analyzed the marker gene of cells in the T8 group (Additional file 4: Table S3, Additional file 1: Fig.S6B). Among them, CTLA4 , TIGIT , and ICOS  are treated as immune checkpoint. GBP4 might be as potential novel immune checkpoint genes of CRC . TTN , NAMPT , GBP4 , CTSC , and PIM2  may be potential immunotherapy target. FOXP3 , CTLA4 , and PIM2  mainly play inhibitory role to T cells. ICOS , MAF  mainly activate or preserve the function of Treg cells. Moreover, the function of TBC1D4, LTB, C2CD4A, GBP4/5 in T cells needs to be clarified further.
Cell fate differentiation based on enterocyte cells
Basing on the trajectory plots of enterocyte cells differentiated by sample, we can observe bifurcation (Fig. 6A). Pseudotime trajectory analysis reveals 9 states of enterocyte cells (Fig. 6B). 5 distinct cell clusters are present in enterocyte cells. Trajectory plots differentiated by cluster reveal the distribution of each cluster in pseudotime trajectory (Fig. 6C), and data showed that colon cancer cells were derived from enterocyte progenitor cells. By pseudo temporal trajectory analysis of the single-cell transcriptomes, we can observe the evolution of cells. (Fig. 6D). The top100 differentially expressed gene heatmap by pseudotime which might be associated with cancer lineage is shown (Fig. 6E). We found lots of them might play a key role in the transformation of enterocyte progenitor cells to cancer cells such as MALAT1 , B2M , EEF1A1 , RPL5 , B3GNT7 , RHOB  and ACTB  might play a key role in the transformation of enterocyte progenitor cells to cancer cells. Transcription factor regulated these cancer-related genes were presented in Fig. 6F. In addition, cell fate differentiation based on all the cells are shown in Additional file 1: Fig.S7.
The expression and clinical significance of the new discovered genes in colon cancer
We collected the resected carcinoma tissues from CRC patients and conducted immunohistochemical (IHC) staining analysis to examine the expression of some representative genes including NEK8, CHRM3, ANO7, B3GNT6, NEURL1, ODC1 and KCNMA1 in colon normal tissue, adenoma tissue and carcinoma tissues. The immunohistochemistry images below showed that all of these proteins were highly expressed in carcinoma tissues compared with the corresponding normal and adenoma tissues (Fig. 7A–G).
In this study, we used high-throughput scRNA-seq analysis and drew a complete atlas of transitional fate cells, characterized the transcriptome characteristics of cell clusters from normal colon to adenoma, then to carcinoma, and clarified the possible evolution trajectory of cells in the multi-step process. In the evolution process of colon cancer, we mainly focused on the expression transformation of epithelial cells, intestinal cells, and T cells related to tumorigenesis since how they change from normal colon to adenoma to carcinoma phenotype is a long-standing problem.
The constituent ratio of C2 goblet cells was significantly higher in cancer tissues than in normal tissues, suggesting that it may be a key group of carcinogenic cells. DISEASE enrichment results clearly showed that these cells were significantly enriched in a variety of tumors, including gastrointestinal tumors. KEGG analysis showed that C2 goblet cells were specially significantly enriched in the function of mucin type O-glycan biosynthesis (Additional file 1: Fig.S1B). MUC2 , ST6GALNAC1 , SERPINA1 , and TPM1  play key roles in C2 goblet cells. In C2 goblet cells, during the process from normal colon to adenoma and from adenoma to carcinoma, several continuously highly expressed genes were found, including DEFA6 , SOX9 [15, 16], ID1 , L1TD1 , CXCL3 , and ODC1 . No report is available about ODC1, and colon cancer. Further studies should explore the association among ODC1, and colon cancer.
By analyzing the subsets of epithelial cell, the cells in Epi1 group highly increased in cancer tissue, suggesting that Epi1 group cells may play a key carcinogenic role in epithelial cell population. GO enrichment analysis of marker gene of Epi1 group cells found that, IGKC, LCN2, PI3, JCHAIN, IGHA1, PLA2G2A, CD55, and IGHA2 are related to humoral immune response. LCN2, PI3, JCHAIN, IGHA1, PLA2G2A, and IGHA2 are related to antimicrobial humoral response. IGKC, LCN2, PI3, JCHAIN, IGHA1, PLA2G2A, IGHA2, and NOS2 are related to defense response to bacterium. IGKC, HBA2, HBA1, JCHAIN, IGHA1, IGHA2, and ANXA2 are related to receptor-mediated endocytosis (Additional file 1: Fig.S3C). KEGG analysis showed that LCN2, MUC5B are related to IL-17 signaling pathway (Additional file 1: Fig.S3D). These enrichments suggesting that antimicrobial and IL-17 signaling pathway are very important in Epi1 cells, and IGKC, LCN2, PI3, JCHAIN, IGHA1, PLA2G2A, CD55, IGHA2, NOS2, ANXA2, and MUC5B are the key oncogenes in Epi1 cells.
By analyzing the subsets of enterocyte cells, we found that most of these subsets are involved in the process of carcinogenesis. We think Entero0 and Entero5 subsets were the more important cell groups of enterocyte cells, since the cell percentages were significantly increased in Entero0 and Entero5 in carcinoma tissues compare to normal colon. Observing the top 20 marker genes of Entero0, GO analysis showed that LCN2, PI3, NOS2, PLA2G2A, CCL20 are related with defense response to bacterium. LCN2, PI3, CXCL1, PLA2G2A are related with humoral immune response and antimicrobial humoral response (Additional file 1: Fig.S4C). KEGG analysis showed that these genes enriched in IL-17 signaling pathway, pancreatic secretion, and amoebiasis (Additional file 1: Fig.S4D). These findings suggested that antimicrobial and IL-17 signaling pathway are the most important signaling in Entero0, and LCN2, PI3, NOS2, PLA2G2A, CCL20, and CXCL1 were the key oncogenes. Observing the Top 20 marker genes of Entero5, the GO analysis showed that DEFA6, LCN2, WFDC2, and SPNS2 are related to humoral immune response (Additional file 1: Fig.S5B). KEGG analysis showed that genes including ST6GALNAC1 and B3GNT6 are enriched in mucin type O-glycan biosynthesis, CLCA1 and KCNMA1 enriched in renin secretion and pancreatic secretion (Additional file 1: Fig.S5C). These findings suggested that immune response and mucin type O-glycan biosynthesis are the very important signaling in Entero5. And DEFA6, LCN2, WFDC2, SPNS2, ST6GALNAC1, B3GNT6, CLCA1, and KCNMA1 may play key roles in colon carcinogenesis. Moreover, the association between KCNMA1, B3GNT6, and colon cancer needs to be explored further. we performed copy number variation (CNV) analysis and identified cancer cells harboring chromosomal abnormalities. These cancer cells were grouped together with the enterocyte progenitor, goblet, and enteroendocrine cells at low resolution (Fig. 1D).
As a whole, the number of T cell subsets decreased significantly from normal to adenoma to carcinoma, suggesting that the decline of T cell function is an important factor in the development of tumor. While observing the cell number composition ratio of T cell subsets, we found the composition ratio of cell number were varied from normal colon to adenoma to carcinoma. Compared with the cell proportion in normal colon, T1, T2, T4, and T7 cell sets were significantly decreased in carcinoma tissues, especially T4 and T7, while the cell proportion of T0, T3, T5 and T8 cell sets increased significantly, especially T8. Observing the marker gene of T8 group cells (Additional file 4: Table S3, Additional file 1: Fig.S6B), GO enrichment analysis showed that FOXP3, CTLA4, TIGIT, TNFRSF1B, and ICOS are enriched in T cell activation and regulation of T cell activation, FOXP3, BIRC3, CTSC, TNFRSF1B, and C2CD4A are enriched in regulation of inflammatory response (Additional file 1: Fig.S6C). KEGG analysis showed that BIRC3, NAMPT, GBP4, and GBP5 are enriched in NOD-like receptor signaling pathway. LTB, TNFRSF1B, CXCR6, and CCL20 are enriched in cytokine–cytokine receptor interaction (Additional file 1: Fig.S6D).
In conclusion, we performed single-cell transcriptomic analysis on the same patient who carried normal colon tissue, adenoma tissue, and carcinoma tissue at the same time. Due to the same microenvironment, we perfectly revealed the evolution process of time-dependent colon cancer. By analyzing various cell types at different pathological stages, we drew the detailed evolution map, and some key expression characteristics were identified. Our results confirmed that there were dominant clones of similar cells in the process of carcinogenesis, and the dominant cloned cells in this study were C2 goblet cell group. In addition, there were also dominant clones of similar cells in epithelial cells and enterocyte cells, Epi1, Entero0, and Entero5 groups. After comprehensive analysis of the enriched genes of the dominant cloned cells in this study, we found that mucin type O-glycan biosynthesis, antibacterial, and IL-17 signaling pathway are important pathways of colon carcinogenesis, especially mucin type O-glycan biosynthesis, which deserves attention in further study. Mucins are the main components of mucus, which is secreted by goblet cells and forms a protective homeostatic barrier between the resident microbiota and the underlying immune cells in the colon, mucin type O-glycan and goblet cells have made great contributions to maintaining the homeostasis of colonic mucosa [66, 67]. It is suggested that if we focus on the function of goblet cells and mucin-type O-glycanbiosynthesis, there may be important discoveries in the prevention and treatment of colon cancer. At the same time, some genes including AC007952.4, NEK8, CHRM3, ANO7, B3GNT6, NEURL1, ODC1, and KCNMA1 may play critical role in colon carcinogenesis, which need to be studied further.
Recio-Boiles A, Cagir B. Colon Cancer. Florida: StatPearls Treasure; 2021.
Adamski S, Rutkowski J. Adenomatous polyps of the colon and rectum as precancerous lesions. Gastroenterologia. 1956;86:370–2.
Corley DA, Levin TR, Doubeni CA. Adenoma detection rate and risk of colorectal cancer and death. N Engl J Med. 2014;370:2541.
Kaminski MF, Wieszczy P, Rupinski M, Wojciechowska U, Didkowska J, Kraszewska E, Kobiela J, Franczyk R, Rupinska M, Kocot B, et al. Increased rate of adenoma detection associates with reduced risk of colorectal cancer and death. Gastroenterology. 2017;153:98–105.
Potter SS. Single-cell RNA sequencing for the study of development, physiology and disease. Nat Rev Nephrol. 2018;14:479–92.
Zhang L, Li Z, Skrzypczynska KM, Fang Q, Zhang W, O’Brien SA, He Y, Wang L, Zhang Q, Kim A, et al. Single-Cell analyses inform mechanisms of myeloid-targeted therapies in colon cancer. Cell. 2020;181(442–459): e429.
Li H, Courtois ET, Sengupta D, Tan Y, Chen KH, Goh JJL, Kong SL, Chua C, Hon LK, Tan WS, et al. Reference component analysis of single-cell transcriptomes elucidates cellular heterogeneity in human colorectal tumors. Nat Genet. 2017;49:708–18.
Sun R, Liu Z, Han L, Yang Y, Wu F, Jiang Q, Zhang H, Ma R, Miao J, He K, et al. miR-22 and miR-214 targeting BCL9L inhibit proliferation, metastasis, and epithelial-mesenchymal transition by down-regulating Wnt signaling in colon cancer. FASEB J. 2019;33:5411–24.
Sun R, Liu Z, Tong D, Yang Y, Guo B, Wang X, Zhao L, Huang C. miR-491-5p, mediated by Foxi1, functions as a tumor suppressor by targeting Wnt3a/beta-catenin signaling in the development of gastric cancer. Cell Death Dis. 2017;8: e2714.
Cecchini MJ, Walsh JC, Parfitt J, Chakrabarti S, Correa RJ, MacKenzie MJ, Driman DK. CDX2 and Muc2 immunohistochemistry as prognostic markers in stage II colon cancer. Hum Pathol. 2019;90:70–9.
Kvorjak M, Ahmed Y, Miller ML, Sriram R, Coronnello C, Hashash JG, Hartman DJ, Telmer CA, Miskov-Zivanov N, Finn OJ, et al. Cross-talk between colon cells and macrophages increases ST6GALNAC1 and MUC1-sTn expression in ulcerative colitis and colitis-associated colon cancer. Cancer Immunol Res. 2020;8:167–78.
Kwon CH, Park HJ, Choi JH, Lee JR, Kim HK, Jo HJ, Kim HS, Oh N, Song GA, Park DY. Snail and serpinA1 promote tumor progression and predict prognosis in colorectal cancer. Oncotarget. 2015;6:20312–26.
Mlakar V, Berginc G, Volavsek M, Stor Z, Rems M, Glavac D. Presence of activating KRAS mutations correlates significantly with expression of tumour suppressor genes DCN and TPM1 in colorectal cancer. BMC Cancer. 2009;9:282.
Jeong D, Kim H, Kim D, Ban S, Oh S, Ji S, Kang D, Lee H, Ahn TS, Kim HJ, et al. Defensin alpha 6 (DEFA6) is a prognostic marker in colorectal cancer. Cancer Biomark. 2019;24:485–95.
Lizarraga-Verdugo E, Carmona TG, Ramos-Payan R, Avendano-Felix M, Bermudez M, Parra-Niebla M, Lopez-Camarillo C, Fernandez-Figueroa E, Lino-Silva L, Saavedra HA, et al. SOX9 is associated with advanced T-stages of clinical stage II colon cancer in young Mexican patients. Oncol Lett. 2021;22:497.
Marcker Espersen ML, Linnemann D, Christensen IJ, Alamili M, Troelsen JT, Hogdall E. SOX9 expression predicts relapse of stage II colon cancer patients. Hum Pathol. 2016;52:38–46.
O’Brien CA, Kreso A, Ryan P, Hermans KG, Gibson L, Wang Y, Tsatsanis A, Gallinger S, Dick JE. ID1 and ID3 regulate the self-renewal capacity of human colon cancer-initiating cells through p21. Cancer Cell. 2012;21:777–92.
Chakroborty D, Emani MR, Klen R, Bockelman C, Hagstrom J, Haglund C, Ristimaki A, Lahesmaa R, Elo LL. L1TD1 - a prognostic marker for colon cancer. BMC Cancer. 2019;19:727.
Zhao QQ, Jiang C, Gao Q, Zhang YY, Wang G, Chen XP, Wu SB, Tang J. Gene expression and methylation profiles identified CXCL3 and CXCL8 as key genes for diagnosis and prognosis of colon adenocarcinoma. J Cell Physiol. 2020;235:4902–12.
Symes AJ, Eilertsen M, Millar M, Nariculam J, Freeman A, Notara M, Feneley MR, Patel HR, Masters JR, Ahmed A. Quantitative analysis of BTF3, HINT1, NDRG1 and ODC1 protein over-expression in human prostate cancer tissue. PLoS ONE. 2013;8: e84295.
Merli AM, Vieujean S, Massot C, Bletard N, Quesada Calvo F, Baiwir D, Mazzucchelli G, Servais L, Wera O, Oury C, et al. Solute carrier family 12 member 2 as a proteomic and histological biomarker of dysplasia and neoplasia in ulcerative colitis. J Crohns Colitis. 2020. https://doi.org/10.1093/ecco-jcc/jjaa168.
Yajima S, Ishii M, Matsushita H, Aoyagi K, Yoshimatsu K, Kaneko H, Yamamoto N, Teramoto T, Yoshida T, Matsumura Y, et al. Expression profiling of fecal colonocytes for RNA-based screening of colorectal cancer. Int J Oncol. 2007;31:1029–37.
Fijneman RJ, Bade LK, Peham JR, van de Wiel MA, van Hinsbergh VW, Meijer GA, O’Sullivan MG, Cormier RT. Pla2g2a attenuates colon tumorigenesis in azoxymethane-treated C57BL/6 mice; expression studies reveal Pla2g2a target genes and pathways. Cell Oncol. 2009;31:345–56.
Yotsumoto F, Yagi H, Suzuki SO, Oki E, Tsujioka H, Hachisuga T, Sonoda K, Kawarabayashi T, Mekada E, Miyamoto S. Validation of HB-EGF and amphiregulin as targets for human cancer therapy. Biochem Biophys Res Commun. 2008;365:555–61.
Wu Q, Yang ZF, Wang KJ, Feng XY, Lv ZJ, Li Y, Jian ZX. AQP8 inhibits colorectal cancer growth and metastasis by down-regulating PI3K/AKT signaling and PCDH7 expression. Am J Cancer Res. 2018;8:266–79.
Li L, Dang Q, Xie H, Yang Z, He D, Liang L, Song W, Yeh S, Chang C. Infiltrating mast cells enhance prostate cancer invasion via altering LncRNA-HOTAIR/PRC2-androgen receptor (AR)-MMP9 signals and increased stem/progenitor cell population. Oncotarget. 2015;6:14179–90.
Koslowski M, Sahin U, Dhaene K, Huber C, Tureci O. MS4A12 is a colon-selective store-operated calcium channel promoting malignant cell processes. Cancer Res. 2008;68:3458–66.
Spisak S, Kalmar A, Galamb O, Wichmann B, Sipos F, Peterfia B, Csabai I, Kovalszky I, Semsey S, Tulassay Z, et al. Genome-wide screening of genes regulated by DNA methylation in colon cancer development. PLoS ONE. 2012;7: e46215.
Xu WX, Zhang J, Hua YT, Yang SJ, Wang DD, Tang JH. An integrative pan-cancer analysis revealing LCN2 as an oncogenic immune protein in tumor microenvironment. Front Oncol. 2020;10: 605097.
Wilkinson TS, Roghanian A, Simpson AJ, Sallenave JM. WAP domain proteins as modulators of mucosal immunity. Biochem Soc Trans. 2011;39:1409–15.
Pandey JP, Namboodiri AM, Armeson KE, Iwasaki M, Kasuga Y, Hamada GS, Tsugane S. IGHG, IGKC, and FCGR genes and endogenous antibody responses to GARP in patients with breast cancer and matched controls. Hum Immunol. 2018;79:632–7.
Wang Y, Shan Q, Hou G, Zhang J, Bai J, Lv X, Xie Y, Zhu H, Su S, Li Y, et al. Discovery of potential colorectal cancer serum biomarkers through quantitative proteomics on the colonic tissue interstitial fluids from the AOM-DSS mouse model. J Proteomics. 2016;132:31–40.
Zhang X, Han J, Feng L, Zhi L, Jiang D, Yu B, Zhang Z, Gao B, Zhang C, Li M, et al. DUOX2 promotes the progression of colorectal cancer cells by regulating the AKT pathway and interacting with RPL3. Carcinogenesis. 2021;42:105–17.
Liu Y, Xia T, Jin C, Gu D, Yu J, Shi W, Zhang KE, Zhang L, Ye J, Li L. FOXP3 and CEACAM6 expression and T cell infiltration in the occurrence and development of colon cancer. Oncol Lett. 2016;11:3693–701.
Shang Y, Chai N, Gu Y, Ding L, Yang Y, Zhou J, Ren G, Hao X, Fan D, Wu K, et al. Systematic immunohistochemical analysis of the expression of CD46, CD55, and CD59 in colon cancer. Arch Pathol Lab Med. 2014;138:910–9.
Lahdaoui F, Messager M, Vincent A, Hec F, Gandon A, Warlaumont M, Renaud F, Leteurtre E, Piessen G, Jonckheere N, et al. Depletion of MUC5B mucin in gastrointestinal cancer cells alters their tumorigenic properties: implication of the Wnt/beta-catenin pathway. Biochem J. 2017;474:3733–46.
Tang Q, Chen J, Di Z, Yuan W, Zhou Z, Liu Z, Han S, Liu Y, Ying G, Shu X, et al. TM4SF1 promotes EMT and cancer stemness via the Wnt/beta-catenin/SOX2 pathway in colorectal cancer. J Exp Clin Cancer Res. 2020;39:232.
Rafa L, Dessein AF, Devisme L, Buob D, Truant S, Porchet N, Huet G, Buisine MP, Lesuffleur T. REG4 acts as a mitogenic, motility and pro-invasive factor for colon cancer cells. Int J Oncol. 2010;36:689–98.
Yusufu A, Shayimu P, Tuerdi R, Fang C, Wang F, Wang H. TFF3 and TFF1 expression levels are elevated in colorectal cancer and promote the malignant behavior of colon cancer by activating the EMT process. Int J Oncol. 2019;55:789–804.
Modi A, Dwivedi S, Roy D, Khokhar M, Purohit P, Vishnoi J, Pareek P, Sharma S, Sharma P, Misra S. Growth differentiation factor 15 and its role in carcinogenesis: an update. Growth Factors. 2019;37:190–207.
Rocha MR, Barcellos-de-Souza P, Sousa-Squiavinato ACM, Fernandes PV, de Oliveira IM, Boroni M, Morgado-Diaz JA. Annexin A2 overexpression associates with colorectal cancer invasiveness and TGF-ss induced epithelial mesenchymal transition via Src/ANXA2/STAT3. Sci Rep. 2018;8:11285.
Fijneman RJ, Cormier RT. The roles of sPLA2-IIA (Pla2g2a) in cancer of the small and large intestine. Front Biosci. 2008;13:4144–74.
Ambs S, Merriam WG, Bennett WP, Felley-Bosco E, Ogunfusika MO, Oser SM, Klein S, Shields PG, Billiar TR, Harris CC. Frequent nitric oxide synthase-2 expression in human colon adenomas: implication for tumor angiogenesis and colon cancer progression. Cancer Res. 1998;58:334–41.
Wagner J, Sim WH, Ellis JA, Ong EK, Catto-Smith AG, Cameron DJ, Bishop RF, Kirkwood CD. Interaction of Crohn’s disease susceptibility genes in an Australian paediatric cohort. PLoS ONE. 2010;5: e15376.
Wang XY, Chen SH, Zhang YN, Xu CF. Olfactomedin-4 in digestive diseases: a mini-review. World J Gastroenterol. 2018;24:1881–7.
Kim H, Kang HJ, You KT, Kim SH, Lee KY, Kim TI, Kim C, Song SY, Kim HJ, Lee C, et al. Suppression of human selenium-binding protein 1 is a late event in colorectal carcinogenesis and is associated with poor survival. Proteomics. 2006;6:3466–76.
Liu Z, Bai Y, Xie F, Miao F, Du F. Comprehensive analysis for identifying diagnostic and prognostic biomarkers in colon adenocarcinoma. DNA Cell Biol. 2020;39:599–614.
Zhang X, Zhang H, Shen B, Sun XF. Chromogranin-a expression as a novel biomarker for early diagnosis of colon cancer patients. Int J Mol Sci. 2019. https://doi.org/10.3390/ijms20122919.
Chen DS, Mellman I. Elements of cancer immunity and the cancer-immune set point. Nature. 2017;541:321–30.
Harjunpaa H, Guillerey C. TIGIT as an emerging immune checkpoint. Clin Exp Immunol. 2020;200:108–19.
Amatore F, Gorvel L, Olive D. Role of Inducible Co-Stimulator (ICOS) in cancer immunotherapy. Expert Opin Biol Ther. 2020;20:141–50.
Xu L, Pelosof L, Wang R, McFarland HI, Wu WW, Phue JN, Lee CT, Shen RF, Juhl H, Wu LH, et al. NGS evaluation of colorectal cancer reveals interferon gamma dependent expression of immune checkpoint genes and identification of novel IFNgamma Induced Genes. Front Immunol. 2020;11:224.
Jia Q, Wang J, He N, He J, Zhu B. Titin mutation associated with responsiveness to checkpoint blockades in solid tumors. JCI Insight. 2019. https://doi.org/10.1172/jci.insight.127901.
Lv H, Lv G, Chen C, Zong Q, Jiang G, Ye D, Cui X, He Y, Xiang W, Han Q, et al. NAD(+) metabolism maintains Inducible PD-L1 expression to drive tumor immune evasion. Cell Metab. 2021;33(110–127): e115.
Guay D, Beaulieu C, Percival MD. Therapeutic utility and medicinal chemistry of cathepsin C inhibitors. Curr Top Med Chem. 2010;10:708–16.
Daenthanasanmak A, Wu Y, Iamsawat S, Nguyen HD, Bastian D, Zhang M, Sofi MH, Chatterjee S, Hill EG, Mehrotra S, et al. PIM-2 protein kinase negatively regulates T cell responses in transplantation and tumor immunity. J Clin Invest. 2018;128:2787–801.
Kim CH. FOXP3 and its role in the immune system. Adv Exp Med Biol. 2009;665:17–29.
Neumann C, Blume J, Roy U, Teh PP, Vasanthakumar A, Beller A, Liao Y, Heinrich F, Arenzana TL, Hackney JA, et al. c-Maf-dependent Treg cell control of intestinal TH17 cells and IgA establishes host-microbiota homeostasis. Nat Immunol. 2019;20:471–81.
Zheng X, Ren J, Peng B, Ye J, Wu X, Zhao W, Li Y, Chen R, Gong X, Bai C, et al. MALAT1 overexpression promotes the growth of colon cancer by repressing beta-catenin degradation. Cell Signal. 2020;73: 109676.
Wang H, Liu B, Wei J. Beta2-microglobulin(B2M) in cancer immunotherapies: biological function, resistance and remedy. Cancer Lett. 2021;517:96–104.
Joung EK, Kim J, Yoon N, Maeng LS, Kim JH, Park S, Kang K, Kim JS, Ahn YH, Ko YH, et al. Expression of EEF1A1 is associated with prognosis of patients with colon adenocarcinoma. J Clin Med. 2019. https://doi.org/10.3390/jcm8111903.
Zhang H, Liu J, Dang Q, Wang X, Chen J, Lin X, Yang N, Du J, Shi H, Liu Y, et al. Ribosomal protein RPL5 regulates colon cancer cell proliferation and migration through MAPK/ERK signaling pathway. BMC Mol Cell Biol. 2022;23:48.
Lu CH, Wu WY, Lai YJ, Yang CM, Yu LC. Suppression of B3GNT7 gene expression in colon adenocarcinoma and its potential effect in the metastasis of colon cancer cells. Glycobiology. 2014;24:359–67.
Wei LJ, Li JA, Bai DM, Song Y. miR-223-RhoB signaling pathway regulates the proliferation and apoptosis of colon adenocarcinoma. Chem Biol Interact. 2018;289:9–14.
Guo C, Liu S, Wang J, Sun MZ, Greenaway FT. ACTB in cancer. Clin Chim Acta. 2013;417:39–44.
Bergstrom KS, Xia L. Mucin-type O-glycans and their roles in intestinal homeostasis. Glycobiology. 2013;23:1026–37.
Pelaseyed T, Bergstrom JH, Gustafsson JK, Ermund A, Birchenough GM, Schutte A, van der Post S, Svensson F, Rodriguez-Pineiro AM, Nystrom EE, et al. The mucus and mucins of the goblet cells and enterocytes provide the first defense line of the gastrointestinal tract and interact with the immune system. Immunol Rev. 2014;260:8–20.
This study was founded by National Natural Science Foundation of China (81802788), Natural Science Basic Research Program of Shaanxi (2023-JC-QN-0876), Key Research and Development Program of Shaanxi (2023-YBSF-447), Postdoctoral Science Foundation of China (2019T120918, 2017M620458), Postdoctoral Science Foundation of Shaanxi Province, China (2018BSHEDZZ91), the Fundamental Research Funds for the Central Universities (xzy012020056), Basic-Clinical Joint & Innovative Project of the First Affiliated Hospital of Xi’an Jiaotong University(2022041), Icubation Project of National Natural Science Foundation of Shaanxi Provincial tumor hospital (2021-ZLYY-GKJA-008).
The authors declare no potential conflicts of interest.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Yang Yang, Zhengtao Xiao, Dangxia Zhou and Chen Huang are the co-correspondence authors
. Enrichments analysis in cluster 2 goblet cells. A. Disease enrichment of the genes in cluster 2 goblet cells. B. KEGG enrichment of the genes in cluster 2. C. The Top 20 genes in adenoma tissue compare to normal tissue in cluster 2. D. The Top 20 genes in adenoma tissue compare to normal tissue in cluster 2. Figure.S2. Enrichments analysis in cluster 3 colon cancer cells. A. Disease enrichment of the genes in cluster 3 colon cancer cells. B. KEGG enrichment of the genes in cluster 3. C. The top 20 genes in adenoma tissue compare to normal tissue in cluster 3. D. The Top 20 genes in adenoma tissue compare to normal tissue in cluster 3. Figure S3. characteristics of epithelial cell subsets and Epi1. A. The maker genes of the four epithelial cell subsets. B. The top marker genes of the Epi1 subset. C. The most enriched signaling of Epi1 by GO enrichment analysis. D. The most enriched signaling of Epi1 by KEGG enrichment analysis. Figure.S4. characteristics of enterocyte cell subsets and Entero0. A. The maker genes of the eleven epithelial cell subsets. B. The top marker genes of the Entero0 subset. C. The most enriched signaling of Entero0 by GO enrichment analysis. D. The most enriched signaling of Entero0 by KEGG enrichment analysis. Figure.S5. characteristics of enterocyte cell subsets and Entero5. A The top marker genes of the Entero5 subset. B. The most enriched signaling of Entero5 by GO enrichment analysis. C. The most enriched signaling of Entero5 by KEGG enrichment analysis. Figure.S6. characteristics of T cell subsets and T8. A The maker genes of the nine T cell subsets. B. The top marker genes of the T8 subset. C. The most enriched signaling of T8 by GO enrichment analysis. D. The most enriched signaling of T8 by KEGG enrichment analysis. Figure.S7. Cell fate differentiation based on all the cells. A. Single cell trajectories differentiated by samples. B. Single cell trajectories differentiated by states. C. Single cell trajectories differentiated by clusters. D. Cell pseudo temporal trajectory analysis. E. Trajectory graph showed by each sample. F. The top50 different expressed gene heatmap by pseudotime. G. The top50 different expressed gene heatmap by branch.
The marker genes of the different cell types.
The cell number and proportion of all types of cells.
The Top20 genes of cell subsets.
About this article
Cite this article
Sun, R., Yang, Y., Lü, W. et al. Single-cell transcriptomic analysis of normal and pathological tissues from the same patient uncovers colon cancer progression. Cell Biosci 13, 62 (2023). https://doi.org/10.1186/s13578-023-01002-w