Identifying Comprehensive Genomic Alterations and Potential Neoantigens for Cervical Cancer Immunotherapy in a Cohort of Chinese Squamous Cell Carcinoma of the Cervix

Meng Wu; Jialu Zhou; Zhe Zhang; Yuanguang Meng

doi:10.3967/bes2024.064

Objective Genomic alterations and potential neoantigens for cervical cancer immunotherapy were identified in a cohort of Chinese patients with cervical squamous cell carcinoma (CSCC). Methods Whole-exome sequencing was used to identify genomic alterations and potential neoantigens for CSCC immunotherapy. RNA Sequencing was performed to analyze neoantigen expression. Results Systematic bioinformatics analysis showed that C>T/G>A transitions/transversions were dominant in CSCCs. Missense mutations were the most frequent types of somatic mutation in the coding sequence regions. Mutational signature analysis detected signature 2, signature 6, and signature 7 in CSCC samples. PIK3CA, FBXW7, and BICRA were identified as potential driver genes, with BICRA as a newly reported gene. Genomic variation profiling identified 4,960 potential neoantigens, of which 114 were listed in two neoantigen-related databases. Conclusion The present findings contribute to our understanding of the genomic characteristics of CSCC and provide a foundation for the development of new biotechnology methods for individualized immunotherapy in CSCC.

HTML

INTRODUCTION

Cervical cancer (CC) is one of the most common gynecological malignancies; approximately 604,127 new cases were reported worldwide in 2020, of which nearly 85% occurred in low- and middle-income countries^[1,2]. In China, the mortality rate of CC in 2022 was as high as 55.07%^[3]. Cervical squamous cell carcinoma (CSCC) accounts for approximately 70% of all CCs worldwide, which is significantly higher than the rate of cervical adenocarcinoma^[4]. Although surgery is an effective treatment for early-stage CC, 20%–30% of CC patients in advanced stages experience recurrence or distant metastasis after chemotherapy administered concurrently with radiation therapy, and the 5-year survival rate is only 5%–15%^[5]. A comprehensive description of the genomic and molecular characteristics of 228 primary CCs (including 144 CSCC cases) was reported in The Cancer Genome Atlas (TCGA) research network, although only 18 Asians were included^[6]. CSCC remains a disease with high morbidity and mortality in China, and the number of sequencing samples available for evaluating genomic alterations in CSCC is small. Therefore, it is necessary to establish a comprehensive genetic profile of the Chinese population with a larger sample size through next-generation sequencing.

Most CSCCs are caused by high-risk subtypes of human papillomavirus (HPV), and the disease could therefore be prevented through well-established screening and vaccination programs^[7]. However, the prevalence and mortality of CSCC have remained relatively high because of low vaccination rates. In addition to increasing the rate of vaccination, the development of new treatments for CSCC is critical. In recent years, immunotherapy has emerged as a potentially effective therapeutic approach. Immune checkpoint inhibitors (ICIs) have been successfully applied to the treatment of different types of cancer, such as gastric, lung, and head and neck cancer, which has significantly prolonged the survival of patients^[8-10]. Pembrolizumab was the first Food and Drug Administration (FDA)-approved first-line immunotherapy for CC; however, it showed positive results in a minority of CC patients^[11]. In addition to ICIs, several immunotherapeutic approaches for advanced cervical cancer, including HPV therapeutic vaccines and adoptive cellular therapy, are under study, and promising efficacy data are emerging^[12,13].

The genetic instability of tumor cells often leads to numerous somatic mutations, and the expression of non-synonymous mutations generates tumor-specific antigens called neoantigens^[14]. Neoantigens are presented by human leukocyte antigen (HLA) class I/II molecules and can activate CD8+ and CD4+ T cells, resulting in the induction of immune responses^[15]. Because neoantigens are not expressed in normal tissues and are highly immunogenic, they have emerged as novel targets for tumor immunotherapy. Extensive analysis of the genomic variations in CC identified a close association with mutated genes such as PIK3CA, FBXW7, EP300, MLL3, CASP8, and FADD^[¹⁶^]. However, research aimed at identifying CSCC neoantigens through the analysis of genetic mutations is lacking.

In this study, we explored the genomic characteristics of CSCC using whole-exome sequencing (WES) data and identified potential neoantigens by WES and RNA sequencing (RNA-seq). The present findings improve our understanding of genetic alterations in CSCC and identify relevant neoantigens, which may provide effective immunotherapeutic targets for the treatment of CSCC.

MATERIALS AND METHODS

Patient Material

Written informed consent was obtained from each individual before enrollment in the study. Primary tumor tissues and peripheral blood were collected from patients diagnosed with CSCC at the Obstetrics and Gynecology Department of the Chinese Liberation Army General Hospital between January 1, 2021 and May 1, 2022. Sample collection, HPV typing, and pathological examination were performed according to the guidelines or requirements of the patients. Data on clinical characteristics were collected from medical records.

The study included 60 samples from 30 CSCC patients. Twenty-nine surgically resected tumor tissues (4–6 g per sample) were snap-frozen in liquid nitrogen and stored at –80 °C, and one was paraffin-embedded. Additionally, 30 blood samples (3–5 mL per sample) were collected as controls and stored in anticoagulant tubes containing ethylenediaminetetraacetic acid. The blood and anticoagulant were thoroughly mixed and stored at –20 °C. All slides were diagnosed by two experienced pathologists using hematoxylin and eosin (H&E) staining.

DNA Extraction and Whole-Exome Sequencing

Tumor and matched normal DNA were extracted using the TIANamp Genomic DNA Kit (DP304, TIANGEN, Beijing, China) using fresh-frozen tumor tissues and paired blood samples. The QIAamp DNA FFPE Tissue Kit (56404, Qiagen, Hilden, Germany) was used to extract genomic DNA (gDNA) from formalin-fixed paraffin-embedded (FFPE) tissues. To ensure the quality of the gDNA, two methods were employed: (1) Agarose gel electrophoresis to analyze the degree of DNA degradation and contamination; and (2) Qubit® 3.0 Fluorometer (Invitrogen, USA) to quantify the DNA concentration accurately. Finally, DNA samples with a gDNA concentration ≥ 20 ng/µL and a minimum of 0.4 µg gDNA per sample were used for library construction.

WES was performed using 0.4 µg of gDNA from each sample. Library construction and capture experiments were performed using the Agilent SureSelect Human All Exon V6 Kit (Cat. No. 5190–8864, Agilent Technologies, Santa Clara, CA, USA) following the manufacturer’s instructions. During this process, index codes were added to each sample. The gDNA was fragmented into pieces measuring approximately 180–280 bp using a hydrodynamic shearing system (Covaris, Massachusetts, USA). After end repair, phosphorylation, and A-tailing, adapter oligonucleotides were added to create libraries. High-fidelity polymerase was used to amplify DNA fragments with ligated adaptor molecules on both ends to ensure sufficient library volume. The libraries were then hybridized with a solution of biotin-labeled probes, and exons were captured using streptomycin magnetic beads. In preparation for sequencing, PCR was used to add index tags to the captured libraries. After purification and quantification, the index-encoded samples were clustered using the HiSeq PE Cluster Kit V2.5 (Illumina). The DNA libraries were sequenced on the Illumina HiSeq X-TEN platform (San Diego, CA, USA), generating 150 bp paired-end reads after cluster generation.

RNA Extraction and RNA Sequencing

To create the library, a minimum of 1 µg of total RNA was necessary. RNA was extracted from fresh tumor tissues using the Qiagen RNeasy Mini kit (74106, Qiagen, Germany) and RNA-Seq libraries were prepared using the NEBNext® UltraTM RNA Library Prep Kit (E7530L, Illumina, USA). The library was quantified using the Qubit2.0 Fluorometer detection kit (Q32866, Invitrogen, USA) and diluted to 1.5 ng/µL. The Agilent 2100 bioanalyzer (2100, Agilent, USA) was used to detect the insert size of the library and accurately determine the effective concentration (higher than 2 nmol/L) to ensure quality. Once the library passed inspection, it was categorized according to the effective concentration and target off-machine data volume. Finally, Illumina sequencing was performed, which generated 150 bp paired-end reads.

Sequencing Data Analysis and Variation Identification

Raw data were preprocessed by Fastp (v.0.23.4, https://github.com/OpenGene/fastp) to obtain clean data using the following steps: (1) adapter trimming; (2) removal of reads with > 10% N bases; (3) removal of reads in which > 50% of the length contained low-quality bases (quality threshold < 5); and (4) sliding window trimming, in which the bases with an average quality below the cutoff value (default was 20) in the sliding window (default was 4 bp) were cut. BWA^[17], Picard (http://broadinstitute.github.io/picard/), and GATK tools^[18] were used for read alignment, variant calling, and identification of single-nucleotide variants (SNVs) and small insertions and deletions (InDels). Default parameters were used in all software programs for identifying mutations in matched normal-tumor samples. To further annotate candidate somatic mutations, we used Funcotator (FUNCtional annOTATOR)^[19] and generated mutation annotation format (MAF) files that included position, function, and sequencing data supporting the mutation status.

Somatic copy number alterations (CNAs) were evaluated using the CNVkit^[20] pipeline (v.0.9.10). The default log₂ threshold was applied to detect copy number gains or losses in target regions. Heatmaps of copy number alterations were obtained by loading the resulting files with segmented copy numbers into Integrative Genomics Viewer (IGV, v.2.15.9)^[21] for visualization. The GISTIC (Genomic Identification of Significant Targets in Cancer) 2.0 pipeline (v.2.0.23)^[22] was then applied to detect the significantly amplified and deleted regions with somatic CNAs with FDR (false discovery rate) < 0.20. A confidence level of 0.90 was set to determine significance. The GISTIC2.0 output files were visualized by the R package ggplot2.

Synonymous and nonsynonymous somatic SNVs were analyzed using the R package maftools to identify the type of point mutations in each tumor sample^[23]. The mutational signature contribution of each tumor sample was estimated using the R package deconstructSigs^[24], which accurately reconstructed the mutational profiles of tumor samples by identifying linear combinations of pre-defined features. This tool established the correspondence between the 96 mutation spectrum and the 30 mutational characteristics of the Catalog of Somatic Mutations in Cancer (COSMIC) database. The calculated weights were assigned to the mutational signatures, in which a higher weight indicates a more significant contribution.

Two computational methods, OncodriveCLUSTL^[25] and OncodriveFML^[26], were used to detect potential driver genes. The OncodriveCLUSTL algorithm uses a sequence-based clustering technique to identify linear clustering bias in the observed somatic mutations. OncodriveFML is a tool that detects genes under positive selection by analyzing the functional impact bias of the observed somatic mutations. Default values were used for all software parameters, and driver genes were identified according to the following criterion: genes with an FDR < 0.25 in both OncodriveCLUSTL and OncodriveFML were considered as driver genes. MutSigCV^[27] was used to perform convolution tests to identify significantly mutated genes (SMGs). This software comprehensively analyzes somatic SNVs and InDels to obtain SMGs whose mutation rate is significantly higher than the background mutation rate. Genes with FDR < 0.20 were considered as SMGs.

RNA-seq raw reads that passed the Illumina RTA quality filter were first preprocessed with Trim Galore (v.0.6.10) to remove adapter sequences and for base quality control. Then, STAR software (v.2.7.10b) was used to align the remaining RNA reads to the NCBI human reference genome (GRCh38). Finally, the number of reads aligning to each gene in the mapping results was calculated as Fragments Per Kilobase per Million (FPKM) values using RSEM software (v.1.3.3).

HLA Typing and Neoantigen Detection

HLAscan tool (v.2.1.2)^[28] was used to determine HLA types across the patients’ whole-exome sequences by aligning reads to HLA sequences from the international ImMunoGeneTics project/human leukocyte antigen (IMGT/HLA) database. Neoantigen analysis was performed using the NeoPredPipe pipeline^[29], which integrates ANNOVAR and netMHCpan to process neoantigens predicted from multi-region Variant Call Format (VCF) files. ANNOVAR correctly annotates variants from VCF files to identify non-synonymous variants, generating peptide sequences based on variant bases. Before executing netMHCpan, HLA haplotypes were cross-referenced with available HLA haplotypes, and epitopes of 8–11 mer length (known to be likely for peptides presented by human MHC class I molecules) were specified to make predictions. NetMHCpan 4.1^[30] was used to detect the binding affinity strength of each mutant peptide to sample-specific HLA alleles to identify exome-derived neoantigens. Finally, the results were filtered according to half-maximal inhibitory concentration (IC50) and rank value (IC50 < 500 nmol/L and %Rank < 2%). A neoantigen was considered expressed if a mutated gene Tumor-FPKM ≥ 0.5 when RNA-seq was available.

Visualization and Statistical Analysis

All graphical analyses were performed in the R statistical environment (v. 4.2.2). P-value calculation methods and multiple testing corrections are reported in the text. Because of the limited sample size, all analyses were conducted at a two-sided significance level of 0.2 (exceptional cases stated otherwise). Linear correlations were assessed using Pearson’s correlation coefficient.

DISCUSSION

There is limited data on genomic alteration profiles and neoantigens of CSCC in Chinese patients. In this study, we used WES to analyze the somatic mutational landscape in a cohort of patients with CSCC (30 samples). Analysis of somatic non-synonymous mutations was used to identify potential neoantigens that can serve as new targets for CSCC immunotherapy. RNA-seq data was used to examine the expression of candidate neoantigens.

A total of 6,232 somatic mutations were identified in 30 CSCC samples, with an average of 207.73 mutations per sample, which is slightly lower than the average of 225.65 mutations per sample in TCGA database for CC^[6]. Analysis of nonsilent mutations showed a mean mutation burden of 2.73/Mb, which was slightly higher than TCGA mutational burden of 2.53/Mb (excluding hypermutated tumors)^[6]. Chung^[34] et al. reported that cervical adenocarcinoma has a lower mutational burden than CSCC, which may explain the difference in mutational burden between this study and TCGA database. CNA analysis in CSCC identified 6,766 CNAs (225.87 per sample), which is considerably higher than the number reported in TCGA database for CC^[6]. GISTIC2.0 analysis revealed 19 amplification peaks and 45 deletion peaks. Furthermore, 15 (50%) patients had high-level copy number amplification at 3q27.1, which was consistent with findings in lung squamous cell carcinoma and esophageal squamous cell carcinoma^[47,48]. ALG3, which is one of the genes covered by this region, helps tumor cells generate high mannose N-linked glycans. Aberrant expression of high-mannose N-linked glycans is associated with cancer progression^[49,50,51]. ALG3 is significantly overexpressed in radioresistant breast cancer tissues and promotes radioresistance and cancer stemness by inducing the glycosylation of TGF-β receptor II (TGFBR2)^[50]. Although ALG3 is an effective therapeutic target in breast cancer patients with high ALG3 levels^[50], whether it promotes the development of CSCC remains to be determined. HTR3C, another gene in the 3q27.1 region, is a biomarker for predicting lung cancer prognosis^[52]. A risk model that includes HTR3E together with 13 other central immune-related genes (CBLC, TNF, PSMC4, TRAV30, PDIA3, FGF8, PDGFRA, ESRRA, SBDS, CRHR1, LTA, NR2F1, TNFRSF18) was used to predict the prognosis of endometrial carcinoma^[53]. MIR1224 acts as a tumor suppressor in the occurrence and development of cancers and can be used as a tumor biomarker for early diagnosis and prognosis prediction^[54]. UGT2B28 (located at 4q13.2) is a predictor of progression in prostate cancer and can be therapeutically targeted by using a combination of AR/EGFR inhibitors^[55].

Analysis of CSCC samples identified three mutational signatures corresponding to signatures 2, 6, and 7 in the COSMIC database (Figure 4C). These are slightly different from those found in the CESC dataset in TCGA database, specifically signature 2 (72.5%) and 6 (13.2%). The APOBEC family of proteins specifically catalyze the conversion of cytosine in the genome to uracil, which is related to base excision repair and DNA replication mechanisms^[56]. It can be activated by HPV virus infection and is involved in the immune response^[56]. APOBEC, which is closely related to cervical carcinogenesis, is the source of signature 2 and signature 13 in human cancers^[6,57]. These studies suggest that APOBEC causes CSCC mutations under the control of HPV. Additionally, signature 6 is a novel early warning biomarker for CC associated with the deficiency of DNA mismatch repair.

This study identified three genes (PIK3CA, FBXW7, and BICRA) predicted to act as driver genes that could potentially promote tumor formation and development. BICRA, a component of the SWI/SNF chromatin remodeling complex, was identified as a novel driver gene of CSCC. However, we did not find published evidence supporting that this gene is associated with cancer risk or development. We observed eight missense mutations in PIK3CA (26.67%), which is consistent with results reported previously (26% in Cancer Genome Atlas Research Network^[6], 16.7% in Huang et al.^[16]). The results further indicated no significant difference in the PIK3CA mutation rate between cervical adenocarcinoma and CSCC. We identified two SMGs that may be associated with CSCC: RAMP2 and FOSL2. RAMP2 downregulation may promote distant metastasis of cancers and is associated with a low survival rate in oral squamous cell carcinoma^[58,59], whereas FOSL2 is closely related to the occurrence of ovarian cancer, lung cancer, and breast cancer^[60,61,62]. The results indicate that these two genes may play crucial roles in the occurrence and development of tumors. However, further studies are needed to validate these findings and to develop these factors as potential biomarkers for CSCC.

Antigen presentation plays a crucial role in the human immune response to cancer. Immunotherapies for cancer are often based on targeting antigens presented by major histocompatibility complex/HLA molecules^[63]. Significant advances have been made in immunotherapy strategies for the treatment of solid tumors (such as breast cancer, prostate cancer, and non-small cell lung cancer). However, the currently available approaches are not sufficient to cure CC. Neoantigens are optimal targets for immunotherapy and are up-and-coming therapeutic options. We identified 4960 neoantigens in this study and found that the number of neoantigens was positively correlated with the number of somatic non-synonymous mutations, whereas it showed no obvious correlation with clinical stage. HDAC6 and DPP10 are two potential neoantigen genes that are detected in the early stage of CSCC. HDAC6 is a unique HDAC family member that regulates the Ras/MAPK/ERK, PI3K/Akt, and Wnt signaling pathways, which are associated with cellular proliferation and are activated in most tumors^[64]. Liu et al. suggested that DPP10 inhibits colon cancer stem cell proliferation by regulating microRNAs such as miR-127-3p^[65]. A previous study demonstrated that DPP10 methylation levels are significantly correlated with cervical neoplasia progression^[66]. We found that almost every neoantigen was present in one sample, further highlighting the difficulty in ubiquitous neoantigen identification in CSCC. We used RNA-seq data to examine the expression of neoantigens and found that five potential neoantigen genes (including PIK3CA, MUC16, USP28, CIC, and CDKL5) were expressed. Further extended clinical studies are needed to determine whether these genes are of value for CSCC immunotherapy. The validation with two available neoantigen-related public databases (TSNAdb and CTdatabase) identified 114 neoantigens involving 27 genes that may serve as candidate targets for neoantigen vaccines.

CONCLUSION

In this study, the comprehensive genomic characteristics of CSCC were determined using WES data. WES and RNA-seq data were used to narrow the scope of neoantigens for individualized immunotherapy in CSCC. Further experimental verification is needed to obtain effective neoantigens.

Reference (66)

[1]	Sung H, Ferlay J, Siegel RL, et al. Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin, 2021; 71, 209−49.
[2]	Singh D, Vignat J, Lorenzoni V, et al. Global estimates of incidence and mortality of cervical cancer in 2020: a baseline analysis of the WHO global cervical cancer elimination initiative. Lancet Glob Health, 2023; 11, e197−206.
[3]	Xia CF, Dong XS, Li H, et al. Cancer statistics in China and United States, 2022: profiles, trends, and determinants. Chin Med J (Engl), 2022; 135, 584−90.
[4]	Cohen PA, Jhingran A, Oaknin A, et al. Cervical cancer. Lancet, 2019; 393, 169−82.
[5]	Wu X, Peng L, Zhang YO, et al. Identification of key genes and pathways in cervical cancer by bioinformatics analysis. Int J Med Sci, 2019; 16, 800−12.
[6]	The Cancer Genome Atlas Research Network. Integrated genomic and molecular characterization of cervical cancer. Nature, 2017; 543, 378−84.
[7]	Lei JY, Ploner A, Elfström KM, et al. HPV vaccination and the risk of invasive cervical cancer. N Engl J Med, 2020; 383, 1340−8.
[8]	Jin X, Liu ZR, Yang DX, et al. Recent progress and future perspectives of immunotherapy in advanced gastric cancer. Front Immunol, 2022; 13, 948647.
[9]	Fournel L, Wu ZR, Stadler N, et al. Cisplatin increases PD-L1 expression and optimizes immune check-point blockade in non-small cell lung cancer. Cancer Lett, 2019; 464, 5−14.
[10]	Solis RN, Silverman DA, Birkeland AC. Current trends in precision medicine and next-generation sequencing in head and neck cancer. Curr Treat Options Oncol, 2022; 23, 254−67.
[11]	Monk BJ, Enomoto T, Kast WM, et al. Integration of immunotherapy into treatment of cervical cancer: recent data and ongoing trials. Cancer Treat Rev, 2022; 106, 102385.
[12]	Wang RJ, Pan W, Jin L, et al. Human papillomavirus vaccine against cervical cancer: opportunity and challenge. Cancer Lett, 2020; 471, 88−102.
[13]	Ferrall L, Lin KY, Roden RBS, et al. Cervical cancer immunotherapy: facts and hopes. Clin Cancer Res, 2021; 27, 4953−73.
[14]	Peng M, Mo YZ, Wang YA, et al. Neoantigen vaccine: an emerging tumor immunotherapy. Mol Cancer, 2019; 18, 128.
[15]	Lang F, Schrörs B, Löwer M, et al. Identification of neoantigens for individualized therapeutic cancer vaccines. Nat Rev Drug Discov, 2022; 21, 261−82.
[16]	Huang J, Qian ZY, Gong YH, et al. Comprehensive genomic variation profiling of cervical intraepithelial neoplasia and cervical cancer identifies potential targets for cervical cancer early warning. J Med Genet, 2019; 56, 186−94.
[17]	Li H. (2013) Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. arXiv: 1303.3997v2 [q-bio. GN].
[18]	McKenna A, Hanna M, Banks E, et al. The genome analysis toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res, 2010; 20, 1297−303.
[19]	McKenna A, Hanna M, Banks E, et al. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res, 2010; 20(9), 1297-1303.
[20]	Talevich E, Shain AH, Botton T, et al. CNVkit: genome-wide copy number detection and visualization from targeted DNA sequencing. PLoS Comput Biol, 2016; 12, e1004873.
[21]	Robinson JT, Thorvaldsdóttir H, Wenger AM, et al. Variant review with the integrative genomics viewer. Cancer Res, 2017; 77, e31−4.
[22]	Mermel CH, Schumacher SE, Hill B, et al. GISTIC2.0 facilitates sensitive and confident localization of the targets of focal somatic copy-number alteration in human cancers. Genome Biol, 2011; 12, R41.
[23]	Mayakonda A, Lin DC, Assenov Y, et al. Maftools: efficient and comprehensive analysis of somatic variants in cancer. Genome Res, 2018; 28, 1747−56.
[24]	Rosenthal R, McGranahan N, Herrero J, et al. deconstructSigs: delineating mutational processes in single tumors distinguishes DNA repair deficiencies and patterns of carcinoma evolution. Genome Biol, 2016; 17, 31.
[25]	Arnedo-Pac C, Mularoni L, Muiños F, et al. OncodriveCLUSTL: a sequence-based clustering method to identify cancer drivers. Bioinformatics, 2019; 35, 4788−90.
[26]	Mularoni L, Sabarinathan R, Deu-Pons J, et al. OncodriveFML: a general framework to identify coding and non-coding regions with cancer driver mutations. Genome Biol, 2016; 17, 128.
[27]	Lawrence MS, Stojanov P, Polak P, et al. Mutational heterogeneity in cancer and the search for new cancer-associated genes. Nature, 2013; 499, 214−8.
[28]	Ka S, Lee S, Hong J, et al. HLAscan: genotyping of the HLA region using next-generation sequencing data. BMC Bioinformatics, 2017; 18, 258.
[29]	Schenck RO, Lakatos E, Gatenbee C, et al. NeoPredPipe: high-throughput neoantigen prediction and recognition potential pipeline. BMC Bioinformatics, 2019; 20, 264.
[30]	Reynisson B, Alvarez B, Paul S, et al. NetMHCpan-4.1 and NetMHCIIpan-4.0: improved predictions of MHC antigen presentation by concurrent motif deconvolution and integration of MS MHC eluted ligand data. Nucleic Acids Res, 2020; 48, W449−54.
[31]	Ojesina AI, Lichtenstein L, Freeman SS, et al. Landscape of genomic alterations in cervical carcinomas. Nature, 2014; 506, 371−5.
[32]	Niyazi M, Han LL, Husaiyin S, et al. Analysis of somatic mutations and key driving factors of cervical cancer progression. Open Med (Wars), 2023; 18, 20230759.
[33]	Xu YX, Luo H, Hu QC, et al. Identification of potential driver genes based on multi-genomic data in cervical cancer. Front Genet, 2021; 12, 598304.
[34]	Chung TKH, Van Hummelen P, Chan PKS, et al. Genomic aberrations in cervical adenocarcinomas in Hong Kong Chinese women. Int J Cancer, 2015; 137, 776−83.
[35]	Bao CH, An N, Xie H, et al. Identifying potential neoantigens for cervical cancer immunotherapy using comprehensive genomic variation profiling of cervical intraepithelial neoplasia and cervical cancer. Front Oncol, 2021; 11, 672386.
[36]	Alexandrov LB, Nik-Zainal S, Wedge DC, et al. Signatures of mutational processes in human cancer. Nature, 2013; 500, 415−21.
[37]	Tate JG, Bamford S, Jubb HC, et al. COSMIC: the catalogue of somatic mutations in cancer. Nucleic Acids Res, 2019; 47, D941−7.
[38]	Chen YP, Zhang Y, Lv JW, et al. Genomic analysis of tumor microenvironment immune types across 14 solid cancer types: immunotherapeutic implications. Theranostics, 2017; 7, 3585−94.
[39]	Shen H, Guo M, Wang L, et al. MUC16 facilitates cervical cancer progression via JAK2/STAT3 phosphorylation-mediated cyclooxygenase-2 expression. Genes Genomics, 2020; 42, 127−33.
[40]	Bhattacharya S, Dunn P, Thomas CG, et al. ImmPort, toward repurposing of open access immunological assay data for translational and clinical research. Sci Data, 2018; 5, 180015.
[41]	Lin M, Zhang XL, You R, et al. Neoantigen landscape in metastatic nasopharyngeal carcinoma. Theranostics, 2021; 11, 6427−44.
[42]	Wu JC, Zhao WY, Zhou BB, et al. TSNAdb: a database for tumor-specific neoantigens from immunogenomics data analysis. Genomics Proteomics Bioinformatics, 2018; 16, 276−82.
[43]	Almeida LG, Sakabe NJ, deOliveira AR, et al. CTdatabase: a knowledge-base of high-throughput and curated data on cancer-testis antigens. Nucleic Acids Res, 2009; 37, D816−9.
[44]	Zhou Y, Zhang YT, Lian XC, et al. Therapeutic target database update 2022: facilitating drug discovery with enriched comparative data of targeted agents. Nucleic Acids Res, 2022; 50, D1398−407.
[45]	Southan C, Sharman JL, Benson HE, et al. The IUPHAR/BPS guide to PHARMACOLOGY in 2016: towards curated quantitative interactions between 1300 protein targets and 6000 ligands. Nucleic Acids Res, 2016; 44, D1054−68.
[46]	Mullard A. 2017 FDA drug approvals. Nat Rev Drug Discov, 2018; 17, 81−5.
[47]	Wang YZ, Wu WB, Zhu M, et al. Integrating expression-related SNPs into genome-wide gene- and pathway-based analyses identified novel lung cancer susceptibility genes. Int J Cancer, 2018; 142, 1602−10.
[48]	Chen ZM, Yao NH, Zhang S, et al. Identification of critical radioresistance genes in esophageal squamous cell carcinoma by whole-exome sequencing. Ann Transl Med, 2020; 8, 998.
[49]	Cui XY, Pei XS, Wang H, et al. ALG3 promotes peritoneal metastasis of ovarian cancer through increasing interaction of α1, 3-mannosylated uPAR and ADAM8. Cells, 2022; 11, 3141.
[50]	Sun XQ, He ZY, Guo L, et al. ALG3 contributes to stemness and radioresistance through regulating glycosylation of TGF-β receptor II in breast cancer. J Exp Clin Cancer Res, 2021; 40, 149.
[51]	Ke SB, Qiu H, Chen JM, et al. ALG3 contributes to the malignancy of non-small cell lung cancer and is negatively regulated by MiR-98-5p. Pathol Res Pract, 2020; 216, 152761.
[52]	Chen JR, Huang MS, Lee YC, et al. Potential clinical value of 5-hydroxytryptamine receptor 3C as a prognostic biomarker for lung cancer. J Oncol, 2021; 2021, 1901191.
[53]	Zhou HY, Zhang CF, Li HR, et al. A novel risk score system of immune genes associated with prognosis in endometrial cancer. Cancer Cell Int, 2020; 20, 240.
[54]	Ma MW, Li J, Zhang ZM, et al. The role and mechanism of microRNA-1224 in human cancer. Front Oncol, 2022; 12, 858892.
[55]	Lacombe L, Hovington H, Brisson H, et al. UGT2B28 accelerates prostate cancer progression through stabilization of the endocytic adaptor protein HIP1 regulating AR and EGFR pathways. Cancer Lett, 2023; 553, 215994.
[56]	Henderson S, Chakravarthy A, Su XP, et al. APOBEC-mediated cytosine deamination links PIK3CA helical domain mutations to human papillomavirus-driven tumor development. Cell Rep, 2014; 7, 1833−41.
[57]	Roberts SA, Lawrence MS, Klimczak LJ, et al. An APOBEC cytidine deaminase mutagenesis pattern is widespread in human cancers. Nat Genet, 2013; 45, 970−6.
[58]	Tanaka M, Koyama T, Sakurai T, et al. The endothelial adrenomedullin-RAMP2 system regulates vascular integrity and suppresses tumour metastasis. Cardiovasc Res, 2016; 111, 398−409.
[59]	de Paula Souza DPS, Dos Reis Pereira Queiroz L, de Souza MG, et al. Identification of potential biomarkers and survival analysis for oral squamous cell carcinoma: a transcriptomic study. Oral Dis, 2023; 29, 2658−66.
[60]	Li J, Zhou L, Jiang HY, et al. Inhibition of FOSL2 aggravates the apoptosis of ovarian cancer cells by promoting the formation of inflammasomes. Genes Genomics, 2022; 44, 29−38.
[61]	Xu P, Wang L, Xie X, et al. Hsa_circ_0001869 promotes NSCLC progression via sponging miR-638 and enhancing FOSL2 expression. Aging (Albany NY), 2020; 12, 23836−48.
[62]	Wan XY, Guan SD, Hou YX, et al. FOSL2 promotes VEGF-independent angiogenesis by transcriptionnally activating Wnt5a in breast cancer-associated fibroblasts. Theranostics, 2021; 11, 4975−91.
[63]	Neefjes J, Jongsma MLM, Paul P, et al. Towards a systems understanding of MHC class I and MHC class II antigen presentation. Nat Rev Immunol, 2011; 11, 823−36.
[64]	Kaur S, Rajoria P, Chopra M. HDAC6: a unique HDAC family member as a cancer target. Cell Oncol (Dordr), 2022; 45, 779−829.
[65]	Liu G, Zhao H, Song Q, et al. Long non-coding RNA DPP10-AS1 exerts anti-tumor effects on colon cancer via the upregulation of ADCY1 by regulating microRNA-127-3p. Aging (Albany NY), 2021; 13, 9748−65.
[66]	El-Zein M, Cheishvili D, Gotlieb W, et al. Genome-wide DNA methylation profiling identifies two novel genes in cervical neoplasia. Int J Cancer, 2020; 147, 1264−74.

Gene	Indels	SNVs	Tot Muts^*	Sample affect	Sample percent (%)	FDR CT^‡			Present in CGC	Reported in previous research
Gene	Indels	SNVs	Tot Muts^*	Sample affect	Sample percent (%)	Predicted by OncodriveCLUSTL	Predicted by OncodriveFML	Predicted by MutSigCV	Present in CGC	Reported in previous research
K3CAPI	0	8	8	8	26.67	0.048	0.181	0.836	Yes	Yes
BICRA	0	3	3	3	10.00	0.181	0.064	1	No	No
FBXW7	1	4	5	5	16.67	0.181	0.165	1	Yes	Yes
Note. ^*Tot Muts denotes the total mutations occurred in certain genes. ^‡FDR CT denotes corrected P value. FDR, false discovery rate; SNVs, single-nucleotide variants; InDels, insertions and deletions; CGC, Cancer Gene Census.

Sample-ID	Number of nonsynomous mutations	Number of neoantigens	HLA-A*	HLA-B*	HLA-C*	Sequencing strategies	Stage
SCCP01T	66	52	02:07/30:01	－	01:02/12:02	WES/RNA-seq	IIIC1
SCCP02T	72	86	24:02/33:03	58:01	01:02/03:02	WES	IIA2
SCCP03T	220	258	02:01/03:01	35:08/44:02	05:03	WES/RNA-seq	IIA
SCCP04T	67	97	03:01/31:01	51:02	12:02/15:02	WES/RNA-seq	IIB
SCCP05T	96	119	02:01/11:01	51:01/51:02	08:01/15:02	WES/RNA-seq	IB2
SCCP06T	182	311	02:01/02:06	51:01	03:03/15:02	WES/RNA-seq	IB2
SCCP07T	264	163	03:01	35:01	04:01	WES/RNA-seq	IB3
SCCP08T	50	76	11:01/30:01	13:02	03:04/06:02	WES/RNA-seq	IB2
SCCP09T	193	333	11:01/24:02	13:01/15:01	03:03/03:04	WES/RNA-seq	IIA
SCCP10T	98	151	02:01/02:03	13:01/48:01	03:04/08:03	WES/RNA-seq	IB2
SCCP11T	102	115	02:07/31:01	40:01	01:02/03:04	WES/RNA-seq	IIIA
SCCP12T	77	124	02:01/30:01	15:02/44:03	08:01	WES/RNA-seq	IB1
SCCP13T	106	128	02:01/02:07	40:01/54:01	01:02/03:04	WES/RNA-seq	IB3
SCCP14T	79	109	02:06/11:01	40:01/40:06	01:02/08:01	WES/RNA-seq	IIA1
SCCP15T	71	51	01:01/11:01	37:01	06:02/07:02	WES/RNA-seq	IIA1
SCCP16T	218	290	02:01/02:06	15:11/35:01	03:03	WES	IB2
SCCP17T	188	271	02:06/24:02	15:11/51:01	03:03/14:02	WES/RNA-seq	IIIC1
SCCP18T	99	128	02:03/03:01	27:07/40:01	07:02/15:02	WES/RNA-seq	IB3
SCCP19T	361	487	01:01/02:06	07:02/51:01	07:02/14:02	WES/RNA-seq	IIA1
SCCP20T	72	67	03:01	07:02/37:01	06:02/07:02	WES/RNA-seq	IIIC1
SCCP21T	79	121	11:01/24:02	15:02/27:07	08:01/15:02	WES/RNA-seq	IB2
SCCP22T	68	53	02:01/03:01	13:02/	03:03/06:02	WES/RNA-seq	IIB
SCCP23T	72	90	01:01/11:01	35:03/37:01	06:02/12:03	WES/RNA-seq	IIIC1
SCCP24T	83	66	02:01	15:11	03:03/08:01	WES/RNA-seq	IIIC1
SCCP25T	378	324	01:01/30:01	13:02/54:01	01:02/06:02	WES/RNA-seq	IIA1
SCCP26T	180	118	02:07/33:03	37:01/46:01	01:02/06:02	WES/RNA-seq	IIA1
SCCP27T	121	186	11:01/30:01	13:02/14:01	06:02/08:02	WES/RNA-seq	IB2
SCCP28T	204	392	11:01/11:12	15:02/35:03	08:01/12:03	WES/RNA-seq	IIA2
SCCP29T	167	150	11:01	13:01/13:02	03:04/06:02	WES/RNA-seq	IIA
SCCP30T	58	44	02:01/32:01	13:01	03:04/12:02	WES	IB1
Note. HLA, human leukocyte antigen. *List separator.

Sample	Protein	Mutation AA	HLA types	Identity	Length (AA)	%Rank	Affinity (nmol/L)	Drugs
SCCP14T	CADPS	R737W	HLA-A02:06	YLRDLLEWA	9	0.502	25.53	−
SCCP14T	CADPS	R737W	HLA-B40:01	LEWAENGAM	9	0.436	52.67	−
SCCP14T	CADPS	R737W	HLA-B40:01	LEWAENGAMI	10	1.173	237.85	−
SCCP24T	CADPS	R737W	HLA-A02:01	YLRDLLEWA	9	0.407	24.09	−
SCCP13T	DENND5B	D339H	HLA-A02:01	FLHAPVPYL	9	0.007	3.86	−
SCCP13T	DENND5B	D339H	HLA-A02:01	SLLHFLHAPV	10	0.526	5.35	−
SCCP13T	DENND5B	D339H	HLA-A02:01	FLHAPVPYLM	10	0.469	13.82	−
SCCP13T	DENND5B	D339H	HLA-C03:04	FLHAPVPYL	9	0.133	34.66	−
SCCP13T	DENND5B	D339H	HLA-A02:01	HFLHAPVPYL	10	1.087	44.79	−
SCCP13T	DENND5B	D339H	HLA-B54:01	LPASLLHFLHA	11	0.194	52.16	−
SCCP13T	DENND5B	D339H	HLA-A02:07	FLHAPVPYL	9	0.014	256.91	−
SCCP13T	DENND5B	D339H	HLA-A02:01	LHFLHAPVPYL	11	1.597	272.86	−
SCCP13T	DENND5B	D339H	HLA-C01:02	HAPVPYLMGL	10	0.187	380.05	−
SCCP13T	DENND5B	D339H	HLA-C01:02	FLHAPVPYL	9	0.129	391.67	−
SCCP13T	DENND5B	D339H	HLA-A02:01	SLLHFLHAP	9	1.536	453.38	−
SCCP07T	MAPK1	E322K	HLA-C04:01	YYDPSDKPI	9	0.017	389.35	BVD-523, ASTX029, HH2710
SCCP01T	MAPK1	R135K	HLA-A30:01	KGLKYIHSA	9	1.615	482.21	BVD-523, ASTX029, HH2711
SCCP06T	MUC16	A4577T	HLA-A02:01	SMGDTLASI	9	0.265	30.46	Oregovomab, Abagovomab
SCCP06T	MUC16	A4577T	HLA-A02:06	SMGDTLASI	9	0.523	58.48	Oregovomab, Abagovomab
SCCP06T	MUC16	A4577T	HLA-A02:01	SMGDTLASISI	11	1.565	324.65	Oregovomab, Abagovomab
SCCP06T	MUC16	A4577T	HLA-C15:02	SSMGDTLASI	10	1.661	353.86	Oregovomab, Abagovomab
SCCP07T	PIK3CA	E542K	HLA-A03:01	AISTRDPLSK	10	0.14	53.93	Alpelisib, BAY 80-6946
SCCP07T	PIK3CA	E542K	HLA-A03:01	KAISTRDPLSK	11	0.299	469.89	Alpelisib, BAY 80-6947
SCCP09T	PIK3CA	E542K	HLA-A11:01	AISTRDPLSK	10	0.312	99.32	Alpelisib, BAY 80-6948
SCCP09T	PIK3CA	E542K	HLA-A11:01	KAISTRDPLSK	11	0.273	256.36	Alpelisib, BAY 80-6949
SCCP09T	PIK3CA	E542K	HLA-A11:01	ISTRDPLSK	9	0.629	333.21	Alpelisib, BAY 80-6950
SCCP21T	PIK3CA	E542K	HLA-A11:01	AISTRDPLSK	10	0.312	99.32	Alpelisib, BAY 80-6951
SCCP21T	PIK3CA	E542K	HLA-A11:01	KAISTRDPLSK	11	0.273	256.36	Alpelisib, BAY 80-6952
SCCP21T	PIK3CA	E542K	HLA-A11:01	ISTRDPLSK	9	0.629	333.21	Alpelisib, BAY 80-6953
SCCP21T	PIK3CA	E542K	HLA-C15:02	STRDPLSKI	9	0.135	453.03	Alpelisib, BAY 80-6954
SCCP01T	PIK3CA	E545K	HLA-A30:01	STRDPLSEITK	11	0.01	38.96	Alpelisib, BAY 80-6955
SCCP02T	PIK3CA	E545K	HLA-B58:01	ITKQEKDFLW	10	0.049	13.03	Alpelisib, BAY 80-6956
SCCP02T	PIK3CA	E545K	HLA-B58:01	EITKQEKDFLW	11	1.03	391.32	Alpelisib, BAY 80-6957
SCCP14T	PIK3CA	E545K	HLA-A11:01	STRDPLSEITK	11	0.053	91.72	Alpelisib, BAY 80-6958
SCCP28T	PIK3CA	E545K	HLA-A11:12	STRDPLSEITK	11	0.053	91.72	Alpelisib, BAY 80-6959
SCCP28T	PIK3CA	E545K	HLA-A11:01	STRDPLSEITK	11	0.053	91.72	Alpelisib, BAY 80-6960
SCCP29T	PIK3CA	E545K	HLA-A11:01	STRDPLSEITK	11	0.053	91.72	Alpelisib, BAY 80-6961
SCCP03T	SLC26A3	V340I	HLA-C05:03	VGDCFDIAM	9	0.841	391.96	−
SCCP19T	VCPIP1	D434N	HLA-A02:06	GIHPSLVANV	10	0.749	171.9	−
SCCP19T	VCPIP1	D434N	HLA-A01:01	VANVHQYFY	9	0.173	244.29	−
SCCP19T	VCPIP1	D434N	HLA-C14:02	LVANVHQYF	9	1.247	361.85	−
SCCP19T	VCPIP1	D434N	HLA-A01:01	LVANVHQYFY	10	0.643	448.15	−
SCCP28T	ZBED4	E664K	HLA-B15:02	KMIALDLQPY	10	1.429	89.83	−
SCCP28T	ZBED4	E664K	HLA-C12:03	IAKMIALDL	9	0.66	132.87	−
SCCP28T	ZBED4	E664K	HLA-A11:12	VAKKITSLIAK	11	1.798	386.39	−
SCCP28T	ZBED4	E664K	HLA-A11:01	VAKKITSLIAK	11	1.798	386.39	−
*Note.* AA, amino acids. HLA, human leukocyte antigen.

Identifying Comprehensive Genomic Alterations and Potential Neoantigens for Cervical Cancer Immunotherapy in a Cohort of Chinese Squamous Cell Carcinoma of the Cervix

doi: 10.3967/bes2024.064

Abstract

References

Proportional views

通讯作者: 陈斌, bchen63@163.com

Article Metrics

Proportional views

Related

Identifying Comprehensive Genomic Alterations and Potential Neoantigens for Cervical Cancer Immunotherapy in a Cohort of Chinese Squamous Cell Carcinoma of the Cervix

doi: 10.3967/bes2024.064

Corresponding author: Zhe Zhang, PhD, Tel: 86-18810999596, E-mail: zhangzhe301@126.com; Yuanguang Meng, Professor, PhD, Tel: 86-13501093681, E-mail: meng6512@vip.sina.com