HLA-A*02:01 binding "ALNTPKDHI" at 1.58Å resolution
Data provenance
Information sections
- Publication
- Peptide details
- Peptide neighbours
- Binding cleft pockets
- Chain sequences
- Downloadable data
- Data license
- Footnotes
Complex type
HLA-A*02:01
ALNTPKDHI
Species
Locus / Allele group
The presentation of SARS-CoV-2 peptides by the common HLA-A���02:01 molecule.
CD8+ T cells are crucial for anti-viral immunity; however, understanding T cell responses requires the identification of epitopes presented by human leukocyte antigens (HLA). To date, few SARS-CoV-2-specific CD8+ T cell epitopes have been described. Internal viral proteins are typically more conserved than surface proteins and are often the target of CD8+ T cells. Therefore, we have characterized eight peptides derived from the internal SARS-CoV-2 nucleocapsid protein predicted to bind HLA-A∗02:01, the most common HLA molecule in the global population. We determined not all peptides could form a complex with HLA-A∗02:01, and the six crystal structures determined revealed that some peptides adopted a mobile conformation. We therefore provide a molecular understanding of SARS-CoV-2 CD8+ T cell epitopes. Furthermore, we show that there is limited pre-existing CD8+ T cell response toward these epitopes in unexposed individuals. Together, these data show that SARS-CoV-2 nucleocapsid might not contain potent epitopes restricted to HLA-A∗02:01.
Structure deposition and release
Data provenance
Publication data retrieved from PDBe REST API8 and PMCe REST API9
Other structures from this publication



Data provenance
MHC:peptide complexes are visualised using PyMol. The peptide is superimposed on a consistent cutaway slice of the MHC binding cleft (displayed as a grey mesh) which best indicates the binding pockets for the P1/P5/PC positions (side view - pockets A, E, F) and for the P2/P3/PC-2 positions (top view - pockets B, C, D). In some cases peptides will use a different pocket for a specific peptide position (atypical anchoring). On some structures the peptide may appear to sterically clash with a pocket. This is an artefact of picking a standardised slice of the cleft and overlaying the peptide.
Peptide neighbours
P1
ALA
PHE33
TYR159
TYR7
TYR171
GLU63
TRP167
TYR59
LYS66
MET5
|
P2
LEU
GLU63
HIS70
MET45
PHE9
LYS66
TYR159
TYR7
VAL67
TYR99
|
P3
ASN
LEU156
TYR159
TYR99
HIS70
LYS66
GLN155
|
P4
THR
LYS66
GLN155
HIS70
|
P5
PRO
TYR99
HIS70
HIS114
GLN155
ARG97
LEU156
|
P6
LYS
GLN155
TRP147
VAL152
|
P7
ASP
TRP147
VAL152
HIS114
TYR116
ARG97
THR73
ASP77
|
P8
HIS
THR73
THR143
ASP77
TRP147
LYS146
|
P9
ILE
LEU81
TYR123
LYS146
TYR116
THR143
THR80
ASP77
TYR84
TRP147
|
Colour key
Data provenance
Neighbours are calculated by finding residues with atoms within 5Å of each other using BioPython Neighboursearch module. The list of neighbours is then sorted and filtered to inlcude only neighbours where between the peptide and the MHC Class I alpha chain.
Colours selected to match the YRB scheme. [https://www.frontiersin.org/articles/10.3389/fmolb.2015.00056/full]


A Pocket
TYR159
THR163
TRP167
TYR171
MET5
TYR59
GLU63
LYS66
TYR7
|
B Pocket
ALA24
VAL34
MET45
GLU63
LYS66
VAL67
TYR7
HIS70
PHE9
TYR99
|
C Pocket
HIS70
THR73
HIS74
PHE9
ARG97
|
D Pocket
HIS114
GLN155
LEU156
TYR159
LEU160
TYR99
|
E Pocket
HIS114
TRP147
VAL152
LEU156
ARG97
|
F Pocket
TYR116
TYR123
THR143
LYS146
TRP147
ASP77
THR80
LEU81
TYR84
VAL95
|
Colour key
Data provenance
1. Beta 2 microglobulin
Beta 2 microglobulin
|
10 20 30 40 50 60
IQRTPKIQVYSRHPAENGKSNFLNCYVSGFHPSDIEVDLLKNGERIEKVEHSDLSFSKDW 70 80 90 SFYLLYYTEFTPTEKDEYACRVNHVTLSQPKIVKWDRDM |
2. Class I alpha
HLA-A*02:01
IPD-IMGT/HLA
[ipd-imgt:HLA35266] |
10 20 30 40 50 60
GSHSMRYFFTSVSRPGRGEPRFIAVGYVDDTQFVRFDSDAASQRMEPRAPWIEQEGPEYW 70 80 90 100 110 120 DGETRKVKAHSQTHRVDLGTLRGYYNQSEAGSHTVQRMYGCDVGSDWRFLRGYHQYAYDG 130 140 150 160 170 180 KDYIALKEDLRSWTAADMAAQTTKHKWEAAHVAEQLRAYLEGTCVEWLRRYLENGKETLQ 190 200 210 220 230 240 RTDAPKTHMTHHAVSDHEATLRCWALSFYPAEITLTWQRDGEDQTQDTELVETRPAGDGT 250 260 270 FQKWVAVVVPSGQEQRYTCHVQHEGLPKPLTLRWEPSS |
3. Peptide
|
ALNTPKDHI
|
Data provenance
Sequences are retrieved via the Uniprot method of the RSCB REST API. Sequences are then compared to those derived from the PDB file and matched against sequences retrieved from the IPD-IMGT/HLA database for human sequences, or the IPD-MHC database for other species. Mouse sequences are matched against FASTA files from Uniprot. Sequences for the mature extracellular protein (signal petide and cytoplasmic tail removed) are compared to identical length sequences from the datasources mentioned before using either exact matching or Levenshtein distance based matching.
Downloadable data
Components
Data license
Footnotes
- Protein Data Bank Europe - Coordinate Server
- 1HHK - HLA-A*02:01 binding LLFGYPVYV at 2.5Å resolution - PDB entry for 1HHK
- Protein structure alignment by incremental combinatorial extension (CE) of the optimal path. - PyMol CEALIGN Method - Publication
- PyMol - PyMol.org/pymol
- Levenshtein distance - Wikipedia entry
- Protein Data Bank Europe REST API - Molecules endpoint
- 3Dmol.js: molecular visualization with WebGL - 3DMol.js - Publication
- Protein Data Bank Europe REST API - Publication endpoint
- PubMed Central Europe REST API - Articles endpoint

This work is licensed under a Creative Commons Attribution 4.0 International License.