Article Text
Abstract
Background There is an urgent need for a vaccine with efficacy against SARS-CoV-2. We hypothesize that peptide vaccines containing epitope regions optimized for concurrent B cell, CD4+ T cell, and CD8+ T cell stimulation would drive both humoral and cellular immunity with high specificity, potentially avoiding undesired effects such as antibody-dependent enhancement (ADE) (figure 1). Leveraging methods initially developed for prediction of tumor-specific antigen targets, we combine computational prediction of T cell epitopes, recently published B cell epitope mapping studies, and epitope accessibility to select candidate peptide vaccines for SARS-CoV-2 (figure 2).
Methods SARS-CoV-2 HLA-I and HLA-II ligands were predicted using multiple MHC binding prediction software. T cell vaccine candidates were further refined by predicted immunogenicity, viral source protein abundance, sequence conservation, coverage of high frequency HLA alleles, and co-localization of CD4+/CD8+ T cell epitopes. B cell epitope regions were chosen from linear epitope mapping studies of convalescent patient serum, filtering to select regions with surface accessibility, high sequence conservation, spatial localization near functional domains of the spike glycoprotein, and avoidance of glycosylation sites. Using murine compatible T/B cell epitopes, vaccine studies were performed with downstream ELISA/ELISpot to monitor immunogenicity.
Results We observed distribution of HLA-I (n = 2486) and -II (n = 3138) ligands evenly across the SARS-CoV-2 proteome, with significant overlap between predicted human and murine ligands (figure 3). Applying a multivariable immunogenicity model trained from IEDB viral tetramer data (AUC 0.7 and 0.9 for HLA-I and -II, respectively), alongside filters for entropy and protein expression resulted in 292 CD8+ and 616 CD4+ epitopes (figure 4). From an initial pool of 58 B cell epitope candidates, three epitope regions were identified (figure 5). Combining B cell and T cell analyses, alongside manufacturability heuristic, we propose a set of SARS-CoV-2 vaccine peptides for use in subsequent murine studies and clinical trials (figure 6). Preliminary murine studies demonstrate evidence of T and B cell activation (figure 7).
Summary of combination CD4+/CD8+ T cell and B cell SARS-CoV-2 peptide vaccine. Humoral immunity (blue dashed box) is targeted through B cell and HLA-II epitopes, aimed at viral neutralization while avoiding non-neutralizing and ADE promoting targets. Cellular immunity (red dashed box) is targeted through HLA-I and HLA-II epitopes, aimed to clear virally infected cells
Summary of B cell and CD4+/CD8+ epitope prediction workflows. Pathways are colored by B cell (blue), human T cell (black), and murine T cell (red) epitope prediction workflows. Color bars represent proportions of epitopes derived from internal proteins (ORF), nucleocapsid phosphoprotein, and surface-exposed proteins (spike, membrane, envelope)
Landscape of SARS-CoV-2 MHC ligands. (A&B) Selection criteria for (A) HLA-I and (B) HLA-II SARS-CoV-2 HLA ligand candidates. Scatterplot (bottom) shows predicted (x-axis) versus IEDB (y-axis) binding affinity, with horizontal line representing 500 nM IEDB binding affinity and vertical line representing corresponding predicted binding affinity for 90% specificity in binding prediction. Histogram (top) shows all predicted SARS-CoV-2 HLA ligand candidates. (C) Landscape of predicted HLA ligands, showing nested HLA ligands comprising HLA-I and -II ligands with complete overlap (top), and LOESS fitted curve (span = 0.1) for HLA-I/II ligands by location along the SARS-CoV2 proteome (bottom). Red track represents SARS epitopes identified in literature review with sequence identity in SARS-CoV-2. Predicted HLA ligands with conserved sequences to this literature set are represented in the lollipop plot with a red stick. (D) Summary of total number of predicted HLA-I/II ligands and nested HLA ligands. (E) Summary of nested HLA ligand coverage by protein, with raw counts (left) or counts normalized by protein length (right). (F) Summary of murine/human MHC ligand overlap. (G) Distribution of population frequencies among predicted HLA-I, -II, and nested HLA ligands
Prediction of SARS-CoV-2 T cell epitopes. (Top) Summary of predicted (left) and IEDB-defined (right) SARS-CoV-2 HLA ligands, showing proportions of each derivative protein. (Middle) Funnel plot representing counts of HLA-I (red text), HLA-II (blue text), and nested HLA (violet text) ligands along with proportions of HLA-I (top bar) and HLA-II (bottom bar) alleles at each filtering step. (Bottom) Summary of CD8+ (red, top), CD4+ (blue, bottom), and nested T cell epitopes (middle) after filtering criteria in S, M, and N proteins. Y-axis and size represent the population frequency of each CD8+ and CD4+ epitopes by circles. Middle track of diamonds represents overlaps between CD8+ and CD4+ epitopes, showing the overlap with greatest population frequency (size) for each region of overlap. Color of diamonds represents the proportion of overlap between CD4+ and CD8+ epitope sequences.
Selection of SARS-CoV-2 B cell epitope regions. (A) SARS-CoV-2 linear B cell epitopes curated from epitope mapping studies. X-axis represents amino acid position along the SARS-CoV-2 spike protein, with labeled start sites. (B) Schematic for filtering criteria of B cell epitope candidates. (C) Spike protein amino acid sequence, with overlay of selection features prior to filtering. Polymorphic residues are red, glycosites are blue, accessible regions highlighted in yellow. The receptor binding domain (RBD), fusion peptide (FP), and HR1/HR2 regions are outlined. (D) Spike protein functional regions (RBD, FP, HR1/2) amino acid sequences, with residues colored by how many times they occur in identified epitopes. Selected accessible sub-sequences of known antibody epitopes highlighted in purple outline. (E) S protein trimer crystal structure with glycosylation, with final linear epitope regions highlighted by color
T cell and B cell vaccine candidates. (A) 27mer vaccine peptide sets selecting for best CD4+, CD8+, CD4+/CD8+, and B cell epitopes with HLA-I, HLA-II, and total population coverage. (B) Unified list of all selected 27mer vaccine peptides. Vaccine peptides containing predicted ligands for murine MHC alleles (H2-b and H2-d haplotypes) are indicated in their respective columns
Immunogenicity of murine-compatible peptide vaccines. (A) ELISA result: peptides derived from three B cell vaccine candidate regions were coated on peptide capture plates, either in combination by overlapping core epitopes (1+2 and 3+4) or alone (5). (B) ELISpot results: splenocytes from animals vaccinated against predicted B cell epitopes (1–5) or measles peptide control (M; adapted from Obeid et al. 1995). Each point represents the average of technical triplicates, background subtracted against no-peptide control. (A&B) Colors represent adjuvant used for vaccination. P-values shown above each graph represent pair-wise Mann-Whitney u-test
Conclusions A peptide vaccine targeting B cells, CD4+ T cells, and CD8+ T cells in parallel may prove an important part of a multifaceted response to the COVID-19 pandemic. Adapting methods for predicting tumor-specific antigens, we presented a set of peptide candidates with high overlap for T and B cell epitopes and broad haplotype population coverage, with validation of immunogenicity in murine vaccine studies.
Acknowledgements The authors appreciate funding support from University of North Carolina University Cancer Research Fund (AR and BGV), the Susan G. Komen Foundation (BGV), the V Foundation for Cancer Research (BGV), and the National Institutes of Health (CCS, 1F30CA225136). We would like to thank members of the #DownWithTheCrown Slack channel for helpful discussion and feedback.
This is an open access article distributed in accordance with the Creative Commons Attribution 4.0 Unported (CC BY 4.0) license, which permits others to copy, redistribute, remix, transform and build upon this work for any purpose, provided the original work is properly cited, a link to the licence is given, and indication of whether changes were made. See: https://creativecommons.org/licenses/by/4.0/.