← Back
Fetching drawings from USPTO…
The present invention relates to an endogenous tumor-derived circular ribonucleic acid (circ RNA) as well as one or more proteins expressed from said tumor-derived circ RNA. The invention further relates said tumor-derived circ RNA and the protein(s) expressed thereof for use vaccines in the prophylaxis and/or treatment of cancer.
SEQUENCE LISTING
The Sequence Listing submitted herewith, entitled “20181227_sequence-listing.txt”, created Jul. 2, 2020, and having a size of 218,587 bytes, is incorporated herein by reference.
TECHNICAL FIELD
The present invention relates to a tumor-derived circular ribonucleic acid (circRNA) as well as one or more proteins expressed from said tumor-derived circRNA for use as vaccine. The invention further relates said tumor-derived circRNA and the protein(s) expressed thereof for use as vaccine in the prophylaxis and/or treatment of cancer.
BACKGROUND OF THE INVENTION
Cancer is a generic term for a large group of diseases characterized by the growth of abnormal cells beyond their usual boundaries that can then invade adjoining parts of the body and/or spread to other organs [1]. Other common terms used are malignant tumors and neoplasms. Cancer can affect almost any part of the body and has many anatomic and molecular subtypes that each require specific management strategies.
Cancer is the second leading cause of death globally and accounted for 8.8 million deaths in 2015. Lung, prostate, colorectal, stomach and liver cancer are the most common types of cancer in men, while breast, colorectal, lung, cervix and stomach cancer are the most common among women. Cancer can be treated by surgery, chemotherapy, radiation therapy, hormonal therapy, targeted therapy (including immunotherapy such as monoclonal antibody therapy) and synthetic lethality. US$1.16 trillion is the estimated total annual economic cost of cancer in 2010. Hence, there is a need for more effective and cost-efficient pharmaceuticals for use in the prophylaxis and treatment of cancer.
The adaptive immune response, mediated by antibody-producing B cells and cytotoxic T cells, protects us from disease by the killing of invading pathogens and performs immune surveillance to eliminate tumors prior to metastasis. Antibody-based and T-cell based therapy has been used with success to treat a number of malignancies. However, it would be advantageous, both for patients and for public health, to develop prophylaxis, such as vaccines, against different cancers, and to activate the immune system against the cancer.
To avoid killing of “self” and induction of severe autoimmunity, cells of the adaptive immune system undergo a stringent selection process during their development, where self-reactive cells are eliminated. With few exceptions (mutations changing the structure or gene fusions), cancer cells express proteins that will be considered “self” to the immune system. Effectively, this means that the frequency of circulating B cells and T cells that have a potential to recognize cancer cells are exceedingly low. This is true even in the form of fusion proteins, and mutations, where both proteins retain much of their original structure. However, even though small changes induced by mutations they have been used to elicit an immune response against the tumor [2,3].
Messenger RNA (mRNA) based vaccines for the treatment of tumors are described in US2008171711. The use of mRNA and peptides as vaccines is safer than using DNA-based vaccines and gene therapeutics since there is no risk of RNA and peptides being integrated into the genome. However, the instability of mRNA or of RNA for example due to RNA-degrading enzymes is a problem in using RNA in pharmaceutical compositions.
Hence, there is a need for vaccine which are not degraded by RNA-degrading enzymes.
Personalized RNA and peptide vaccines for the prophylaxis of cancer have been reported in the art [2,3]. However, the reported personalized peptide vaccines were not fully effective, the time to develop treatment is time consuming, and the treatment had to be combined with other immunotherapies [2]. Moreover, one of the patients had a late relapse when a personalized RNA vaccine was used [3]. Hence, there is a need for more effective and rapid vaccine development for the treatment of cancer. The use of circular RNAs (circRNA or cRNA) could potentially overcome these obstacles.
Recently, it was shown that circular RNAs (circRNA or cRNA) can form during RNA processing (e.g. back splicing) within cancer cells as well as normal cells [4-6]. During the formation of the circular RNA, the back-splicing process will introduce a unique fusion of two RNA nucleotides stretches. The function of circRNA is not well understood, but it has been shown that protein peptides can be generated from (open reading frames) ORFs [7,8].
Depending on the circRNA, proteins can be expressed from the RNA template, and the ORFs may overlap with the ‘true mRNA’ sequence and produce parts of the ‘true protein’, there is also a chance that the circRNA allow new ORFs to be expressed that is ‘out of frame’ and therefore contain a new composition of amino acids that will be recognized as “non-self”, and potentially reactive to the immune system. Another possible outcome of the circulation is that the ORF will cover the back-splicing site (BSS) and translate a protein that is ‘in-frame’ until the fusion site and then becomes ‘out of frame’ when it passes the BSS.
US2017298347 relates to fusion-circular RNAs (f-circRNAs) for diagnosis of cancer. Additionally, US2017298347 discloses a f-circRNA inhibiting agent which binds to f-circRNAs and said inhibiting agent is used for treating cancer. However, said inhibiting agent is neither a circRNA nor a protein derived from circRNA (paragraph 145; claims 15 and 26). Consequently, US2017298347 is silent about circRNAs and peptide products thereof being used for the treatment of cancer.
CN107384909 relates to the use of circRNA in a screening of gastric cancer. However, CN107384909 is altogether silent about using circRNA as a vaccine.
CN106480033 relates to a kit for detecting the circRNA-005365 gene. However, CN106480033 is altogether silent about using circRNA as a vaccine.
US20160194368 relates to synthetically produced circRNA. However, the circRNA are neither tumor-derived nor endogenous. More importantly, circRNA of US20160194368 does not comprise at least one open reading frame (ORF) not expressed in non-tumorous healthy cells.
OBJECT OF THE INVENTION
The object of the invention is to provide a vaccine for the prophylaxis and treatment of cancer.
The object of the invention is to provide an RNA-based vaccine for the prophylaxis and treatment of cancer.
The object of the invention is to provide an RNA-based vaccine for the prophylaxis and treatment of cancer wherein the RNA is not degraded by RNA-degrading enzymes and nucleases.
The object of the invention is to provide a peptide-based vaccine for the prophylaxis and treatment of cancer.
The object of the invention is to overcome the disadvantages of DNA-based vaccines and immunotherapies.
The object of the invention is to overcome the disadvantages of mRNA-based vaccines and immunotherapies.
SUMMARY OF THE INVENTION
The objects of the present invention are solved by the subject-matter disclosed in the claims.
The present invention relates to a plurality of interrelated products listed below and therefore discloses a single inventive concept:
DNA
RNA
protein (RNA product)
vaccine (DNA, RNA or protein for use in prophylaxis or treatment of cancer),
methods of preparing said RNA and protein, and
diagnostics (protein for the use in screening of disease)
Tumor-derived circRNA
In the preferred embodiment of the invention, the objects of the invention are solved by an endogenous tumor-derived circRNA for use as a vaccine in the prophylaxis and/or treatment of cancer. Preferably said circRNA comprises at least one open reading frame (ORF) not expressed in non-tumorous cells.
In one embodiment of the invention, the tumor-derived circRNA comprises a nucleotide sequence selected from SEQ ID No: 1 to 299 or homologous nucleotide sequences thereof having at least 80% homology.
In a further embodiment of the invention, the tumor-derived circRNA is for use as vaccine in the prophylaxis and/or treatment of prostate cancer, breast cancer, colon cancer, lung cancer and/or pancreatic cancer.
In a further embodiment of the invention, the tumor-derived circRNA is for use as a prophylactic vaccine.
In a further embodiment of the invention, the tumor-derived circRNA is for use as a therapeutic vaccine.
In a further embodiment of the invention, the tumor-derived circRNA is administered (i) to the lymphatic system or proximity of the lymphatic system, preferably to at least one lymph node or proximity of lymph node, or (ii) ex-vivo for immune cell stimulation and cell-based therapy.
The invention also relates to a method of prophylaxis and/or treatment of cancer by using a tumor-derived circRNA. The circRNA may comprise a nucleotide sequence selected from SEQ ID No: 1 to 299 or homologous nucleotide sequences thereof having at least 80% homology.
The invention also relates to a method of preparing a tumor-derived circRNA. The circRNA may comprise a nucleotide sequence selected from SEQ ID No: 1 to 299 or homologous nucleotide sequences thereof having at least 80% homology, comprising the steps of:
a. Extracting total RNA content from tumor cells, and
a. Enriching circRNA by enzyme digestion of linear RNA and
b. Return circRNA to antigen presenting cells in vivo or ex vivo.
The invention also relates to a vaccine comprising a tumor-derived circRNA. The circRNA may comprise a nucleotide sequence selected from SEQ ID No: 1 to 299 or homologous nucleotide sequences thereof having at least 80% homology.
Proteins from Tumor-Derived circRNA
In the preferred embodiment of the invention, the objects of the invention are solved by a protein derived from a tumor-derived circRNA.
In a further embodiment of the invention, the protein is derived from a tumor-derived circRNA comprising a nucleotide sequence selected from SEQ ID No: 1 to 299 or homologous nucleotide sequences thereof having at least 80% homology, wherein said protein comprise an amino acid sequence selected from SEQ ID No: 300 to 598 or homologous amino acid sequences thereof having at least 80% homology.
In a further embodiment of the invention, the protein comprise an amino acid sequence selected from SEQ ID No: 301 to 598 or homologous amino acid sequences thereof having at least 80% homology.
In a further embodiment of the invention, the protein comprises an amino acid selected from SEQ ID No: 301 to 598.
In a further embodiment of the invention, the protein is for use in the prophylaxis and/or treatment of cancer.
In a further embodiment of the invention, the protein is for use in the prophylaxis and/or treatment of prostate cancer, breast cancer, colon cancer, lung cancer and pancreatic cancer.
In a further embodiment of the invention, the protein is for use as a prophylactic vaccine.
In a further embodiment of the invention, the protein is for use as a therapeutic vaccine.
In a further embodiment of the invention, the protein is administered (i) to the lymphatic system or proximity of the lymphatic system, preferably to at least one lymph node or proximity of lymph node, or (ii) ex-vivo for immune cell stimulation and cell-based therapy.
The invention also relates to a method of prophylaxis and/or treatment of cancer by using a protein derived from tumor-derived circRNA. The protein may comprise a protein sequence selected from SEQ ID No: 300 to 598 or homologous nucleotide sequences thereof having at least 80% homology.
The invention also relates to a method of preparing the protein, comprising the steps of:
c. Extracting total RNA content from tumor cells,
d. Enriching circRNA by enzyme digestion of linear RNA,
e. Identifying proteins:
i. RNA sequencing,
ii. protein mass-spectrometry,
iii. In vitro translation, and/or
iv. Prediction of MHC class I/II-binding
f. identification of personalized/general potential of the protein as immunogen by:
v. In vitro stimulation of T-cells from patients with neoantigens, and/or
vi. Determination of B cell-responses to neoantigen
The invention also relates to vaccine comprising the protein derived from a tumor-derived circRNA.
The invention also relates to a pharmaceutical composition comprising the protein derived from a tumor-derived circRNA.
DNA Expressing (or Encoding) RNA and/or Proteins
The invention also relates to DNA which encodes for a circRNA discussed above.
The invention is also related to DNA which encodes for a protein discussed above.
The DNA may have a DNA sequence disclosed in Table 1.
In a further embodiment, said DNA is administered (i) to the lymphatic system or proximity of the lymphatic system, preferably to at least one lymph node or proximity of lymph node, or (ii) ex-vivo for immune cell stimulation and cell-based therapy.
The invention also relates to a vaccine vector or genetic vector which comprises DNA as discussed above.
The invention also relates to a vaccine vector or genetic vector which comprises DNA or RNA that express a protein (described above) corresponding to a protein encoded by a tumor-derived circRNA
BRIEF DESCRIPTION OF FIGURES
FIG. 1A illustrates RNA splicing and FIG. 1B illustrates circRNA splicing.
FIG. 2 illustrates the translation of circRNA into proteins.
FIG. 3 illustrates the process for manufacturing circRNA-based vaccines.
FIG. 4 illustrates the process for manufacturing vaccines comprising proteins derived from circRNA.
FIGS. 5 and 6 show experimental results from circRNA vaccination of rats. There is increased proliferation of CD8+ T cells in vaccinated rats compared to a control rat group.
FIG. 7 show experimental results from circRNA vaccination of rats. There is no difference in proliferation of CD4+ T cells between vaccinated rats and a control rat group.
FIG. 8 show experimental results from circRNA vaccination of rats. In vaccinated rats there is an increased tumor necrosis.
FIGS. 9a-9c show experimental results from circRNA vaccination of mice. There is increased proliferation of CD4+ T cells and B cells in vaccinated mice compared to a control mice group, FIG. 9a and FIG. 9c, respectively. There is a minimal effect on proliferation of CD8+ T-cells, FIG. 9b.
DETAILED DESCRIPTION
Splicing is the editing of the nascent precursor messenger RNA (pre-mRNA) transcript into a mature mRNA. As illustrated in FIG. 1A, after splicing (i.e. normal RNA splicing), introns are removed and exons are joined together (ligated). The splicing of the tumor-derived circRNA of the present invention differs from normal RNA splicing in that, during the formation of the circular RNA, the back-splicing process will introduce a unique fusion of two RNA nucleotides stretches as illustrated in FIG. 1B.
The tumor-derived circRNA of the present invention can be translated into proteins by at least the three types of processes as illustrated in FIG. 2. Both the circRNA and the resulting proteins can be used for use in the prophylaxis and treatment of cancer.
In the Type-I process illustrated in FIG. 2A, the ORF is out of frame and the RNA is translated into a (neo-antigen) protein from an alternative read frame compared to the normal gene product.
The type-II process which is illustrated in FIG. 2B differs from the Type-I process in that, the Type-II process the translation of the (neo-antigen) protein cross a back-splicing site (BSS), and therefore, in the type-II process novel (neo-antigen) proteins can be produced by out-of-frame (from the normal gene product) translation of the ORF both before and/or after the BSS. In the type-II process the fusion site at the BSS can as well generate neo-antigen peptides.
The type III process which is illustrated in FIG. 2C involves a rolling circle amplified protein. The circRNA has an ORF that only contains a start codon (AUG) and produce a peptide that is in-frame with itself after passing the start codon and protein can continue to be translated in a rolling circle amplification manner. This leads to a rolling circle amplification of the generated peptide and potentiate the neo-antigen production, which leads to an enhanced immune response to the antigen.
Delivering RNA to subjects and induction of an adaptive immune response against said RNA is known in the art. This approach has previously been used in eliciting an immune response to RNA. Hence, full-length mRNAs or short oligonucleotide repeats against mutated regions have been selected and amplified in prior art techniques.
The present invention differs from prior art techniques in that tumor tissue is used and total RNA is extracted from the tumor cells. Within the total pool of tumor derived RNA molecules there is a fraction that consist of circularized RNA molecules (due to back-splicing events). The tumor-derived circRNA pool contains tumor specific circRNA with unique open-reading frames (ORFs) not normally expressed. ORFs that can be translated into peptides as novel proteins and hence be presented as novel neo-antigens that can stimulate immune responses toward the neo-antigens.
The strength of the present invention is that it is totally new protein sequence that compose the epitopes that are inducing the immune response, compared to mutations that only change a single or a few amino acids or full proteins over expressed in cancer that already have had a negative selection against their epitopes. Another aspect is that circRNAs are very diverse in their composition and that cancer induced changes have big impact on the repertoire of expressed circRNAs, thus a wide range of neo-antigens can be present at the same time within a tumor. Moreover, circRNAs are nuclease resistant by their circular structure and much more resistant against degradation than linear RNA molecules once injected in the patient enhancing the delivering potential.
The circRNA extracted from the patient will contain the multiplicity of unique tumor-derived circRNAs that have the potential to via antigen presenting cells present novel neo-antigens for the immune system.
In the present invention, tumor-derived circRNA from the patient will be prepared and divided into vaccination batches and booster batches, and administered either by injection in-vivo at sites of immune presentation (like adjacent to lymph nodes, within lymph nodes) or ex-vivo for immune cell stimulation and cell based therapy.
In the present invention, tumor-derived circRNA are isolated and sequenced from subjects (i.e. patients) with well-defined tumor types. The invention is not limited to a certain cancer type and is therefore applicable for all types of cancer. Subsequently, a database with identified and potential unique open reading frames within circRNA are built that are not present within normal cell transcription within the patients. This will:
1. Allow for the identification of potential neo-antigens within (back-spliced) circRNA that are common for a certain tumor type, and
2. Allow for the identification of potential neo-antigens within (back-spliced) circRNA that are unique for a single patient
Importantly, results from (1) will identify circRNA neo-antigens that are suitable for prophylactic vaccination (i.e. to vaccinate healthy individuals against a particular cancer) and therapeutic vaccination (i.e. patients with corresponding cancer). Results from (2) will identify circRNA neo-antigens that can be rapidly introduced into a delivery vector for personalized therapeutic vaccination against pre-existing tumors in a cancer patient.
The pharmaceutical compositions and vaccines of the present invention may comprise pharmaceutically acceptable excipients used in the art. Preservatives, adjuvants, stabilizers and buffers are examples excipients which may be used but the present invention is not restricted to these examples.
Preservatives are used to prevent growth of bacteria or fungi that may be introduced into the vaccine during its use, for example by repeated puncture of a multi-dose vaccine vial with a needle.
Adjuvants helping stimulate a stronger immune response of vaccinated individuals.
Aluminum salts may be incorporated into a vaccine formulation as an adjuvant. Examples of aluminum salts are aluminum hydroxide, aluminum phosphate, alum (potassium aluminum sulfate), or mixed aluminum salts. Organic compounds such as squalene or oil-based compounds may also be used as adjuvant.
Stabilizers keep vaccines potent during transportation and storage. They help protect the vaccine from adverse conditions such as the freeze-drying process, for those vaccines that are freeze dried. Some examples of stabilizers which may be added to vaccines include: sugars such as sucrose and lactose, amino acids such as glycine or the monosodium salt of glutamic acid and proteins such as human serum albumin or gelatin.
In the present invention, the term “protein” is defined as a molecule which comprises one or more chains of amino acid residues. The terms peptide, oligopeptide and polypeptide are in the present invention also included in the definition of protein.
In the present invention, the expression “tumor-derived circRNA” is defined as “tumor-isolated circRNA” (or alternatively “circRNA isolated from tumor”).
In the present invention, the expression “protein derived from a tumor-derived circRNA” is defined as “protein encoded by tumor-derived circRNA” (or alternatively “protein expressed from tumor-derived circRNA”.
EXAMPLES
Example 1—Tumor-derived circRNA
The method described in Example 1 is briefly illustrated in FIG. 3.
Tumor tissue is harvested from the patient and the total RNA content in the tumor cells are extracted with RNA isolation methodology used in the art. The circular RNA fraction is further enriched by for example enzyme digestion of linear RNA (by for example using RNAse R).
RNA quality control and content may be monitored by a combination of Bioanalyser (Agilent) and Qubit (Thermofisher) measurements or by similar instruments. circRNA is aliquoted and may be stored at −80° C. (or at other suitable temperatures) in different batches (e.g. vaccination and booster) ready to be administered to the patient after reconstituting the frozen batches with physiological buffer (such as Hanks Buffers Saline Solution/NaCl).
The endogenous tumor-derived circRNA prepared according to Example 1 comprises a nucleotide sequence selected from SEQ ID No: 1 to 299 (which are disclosed in Table 1 and the sequence listing) or homologous nucleotide sequences thereof having at least 80% homology. However, the present invention is not restricted to these specific sequences, and therefore, tumor-derived circRNA having other RNA sequences may be used for prophylaxis and/or treatment of disease.
The resulting vaccine is a therapeutic vaccine which is personalized for the patient from whom the tumor tissue has been harvested.
The personalized vaccine described in Example 1 is advantageous since it can be produced in 1-2 days. As a contrast, it takes several months, if not longer, to produce the vaccines described in the prior art.
The usability of the method described has been demonstrated in a pilot experiment where personalized therapeutic vaccination led to an increased immune activation against prostate cancer cells in two animal models (the Dunning Rat prostate cancer model and the TRAMP-C1 prostate cancer mouse model).
Example 2—Proteins from Tumor-Derived Circular RNA
The method described in Example 2 is briefly illustrated in FIG. 4.
Tumor tissue is harvested from the patient and the total RNA content in the tumor cells are extracted with RNA isolation methodology used in the art. The circular RNA fraction is further enriched by for example enzyme digestion of linear RNA (by for example using RNAse R).
RNA quality control and content may be monitored by a combination of Bioanalyser (Agilent) and Qubit (Thermofisher) measurements or by similar instruments. circRNA is aliquoted and may be stored at −80° C. (or at other suitable temperatures) in different batches.
The neoantigen proteins which can be expressed by circRNA are identified by using one or more of the following methods:
RNA sequencing
protein mass-spectrometry
In vitro translation
Prediction of MHC class I/II-binding
The next step in the method involves the identification of personalized/general potential of the neoantigen protein as immunogen by using one or more of the following methods:
In vitro stimulation of T-cells from patients with neo-antigens and detection of specific T cell responses by standard immunological assays, such as immune cell proliferation of cytokine production by Flow cytometry or ELISpot Determination of B cell-responses to neoantigen by standard immunological assays (such as ELISA/ELISpot)
The proteins derived from the tumor-derived circRNA isolated according to the method described in Example 2 comprise a protein sequence selected from SEQ ID No: 300 to 598 (which are disclosed in Table 1 and the sequence listing). However, the present invention is not restricted to these specific sequences, and therefore, proteins derived from tumor-derived circRNA having other protein sequences may be used for prophylaxis and/or treatment of disease.
The proteins can be extracted from tumor cells. Moreover, the proteins can also be prepared by liquid-phase synthesis [11,13], solid-phase synthesis [11,13] or by an expression vector [12,13], or by any other method known in the art.
The proteins can also be synthesized by the patient's cell by genetic vaccination or gene therapy with DNA or RNA vectors coding for neoantigen sequences corresponding to a tumor-derived circRNA sequence.
EXPERIMENTS
Rat Experiment:
1. Total RNA was isolated from MAT-Ly-Lu (Dunning R-3327 prostate cancer animal model) cells.
2. Linear RNA was digested with RNAse R (Epicenter) to enrich for circRNA (1U RNAse R/ug total RNA). The digest total RNA was cleaned-up (Agencourt RNAClean XP), and yielded 12 vaccine doses of 30 ug circRNA per dose.
3. Vaccine was prepared for the four Copenhagen rats: One dose (30 ug) circRNA was mixed with physiological NaCl solution to a volume of 100 ul, and the prepared “vaccine” was injected intradermal, close to the inguinal lymph nodes in four Copenhagen rats. Resulting in a dose of 80 ug circRNA/kg animal.
4. To four rats in a control group, 100 ul physiological NaCl solution was injected in the same way as in the vaccination group.
5. Time plan: Vaccination took place during three weeks before injecting 10.000 MAT-Ly-Lu cells within prostate. In brief, at day 0 the first dose was administered, and the second and third was injected at day 7 and day 14. At day 21, ten thousand MAT-Ly-Lu cells was injected into the prostate, and at day 35 the animal was sacrificed.
6. Sample collected: Spleen and lymph nodes where collected, as well as prostate tumor and lung.
7. Analysis: Immune activation was measured by labelling splenocytes/lymphocytes with CFSE and exposing them to mitomycin C-treated MAT-Ly-Lu cells for 5 days. Cells were then labelled with anti-CD8 and anti-CD4 and subjected to flow cytometry. Activation and subsequent proliferation of CD4+ and CD8+ T cells were measured as a decrease in CFSE-intensity (the CFSE is diluted in cells upon cell division). The necrotic tumor areas within the tumor was also investigated and measured between the control group and the vaccinated group.
8. Results: The main mediators of tumor induced immune response are the CD8+ T-cells, the so called cytotoxic T-cells. These cells are well-known to mediate tumor cell elimination of cells expressing neoantigen epitopes on their MHC-I complexes. Interestingly there was a more than threefold induction of proliferation in 50% of the animals (2/4) of the CD8+ T-cells after in vitro exposure to MAT-Ly-Lu cells (2:1 spleenocyte/tumor cell ratio), indicative that two rats have acquired an adaptive immune response against the tumor (see FIGS. 5 and 6). In line with an increased CD8+ T-cell activity there was also a trend toward an increased tumor cell death and necrotic area in vaccinated animals (see FIG. 8). There was no difference in induced CD4 activity between vaccinated rats and a control rat group (see FIG. 7).
9. Summary: We could see an increased mobilization of the adaptive immune system toward the tumor cells after administration of a circRNA vaccination program.
Mouse Experiment:
1. Total RNA was isolated from TRAMP-C1 (prostate cancer mouse cell line with C57BL/6 background) cells.
2. Linear RNA was digested with RNAse R (Epicenter) to enrich for circRNA (1U RNAse R/ug total RNA). The digest total RNA was cleaned-up (Agencourt RNAClean XP), and yielded RNA for eight vaccine doses of 10 ug circRNA per dose.
3. Vaccine was prepared for the four C57BL/6 mice: One dose (10 ug) circRNA was prepared as following, 5 ug circRNA was prepared with 5 ug protamine, incubated 30 min RT for complex binding and then another 5 ug of circRNA was added to the tube in a volume of 100 ul in ringers salt solution. The prepared “vaccine” was injected intradermal, on the flanks close to the groins. Resulting in a dose of ˜300 ug circRNA/kg animal.
4. To four mice in a control group, 100 ul ringers salt solution containing 5 ug of protamine was injected in the same way as in the vaccination group.
5. Time plan: Animals were vaccinated day 0 and day 14, and then sacrificed on day 21.
6. Sample collected: Spleens where collected.
7. Analysis: Single cell suspension of CFSE-labelled splenocytes were co-incubated with lysate from TRAMP-C1 cells for 4-5 days. Proliferation/activation of B and T cells was measured as reduced CFSE-intensity on B220+ B cells, CD8+ T cells or CD4+ T cells by flow cytometry.
8. Results: The tumor induced immune response were a pronounced effect seen among the CD4+ T-cell (FIG. 9a) and the B-cells (FIG. 9b), with minimal effect on the CD8+ T-cells (FIG. 9c). In this experiment the mice had not been exposed to the tumor cells in question before analysis, contrary to in the rat experiment (discussed above) where the rats had known the tumor cells for two weeks and thereby triggering a potential immune response.
9. Summary: We could see an increased mobilization of the adaptive immune system toward the tumor cells after administration of a circRNA vaccination program.
TABLE 1
SEQ
KLK3 neo-
ID NO
antigen
cRNA/protein
DNA sequence
RNA sequence
Protein sequence
1
1/300
TTGCTAGGAAAAGAAATCAGCAGACACAGGTGTA
UUGCUAGGAAAAGAAAUCAGCAGACACAG
LLGKEISRHRCRPECFLNGVILSSLCPGEY
GACCAGAGTGTTTCTTAAATGGTGTAATTTTGTCC
GUGUAGACCAGAGUGUUUCUUAAAUGGU
WPCLETYHSISLRTQIGWGVCVICGVQR
TCTCTGTGTCCTGGGGAATACTGGCCATGCCTGGA
GUAAUUUUGUCCUCUCUGUGUCCUGGGG
GACATATCACTCAATTTCTCTGAGGACACAGATAG
AAUACUGGCCAUGCCUGGAGACAUAUCAC
GATGGGGTGTCTGTGTTATTTGTGGGGTACAGAG
UCAAUUUCUCUGAGGACACAGAUAGGAUG
ATGA
GGGUGUCUGUGUUAUUUGUGGGGUACAG
AGAUGA
2
2/301
ATGTGCCCTGCCCGAAAGGCCTTCCCTGTACACCA
AUGUGCCCUGCCCGAAAGGCCUUCCCUGU
MCPARKAFPVHQGGALPEVDQGHHRG
AGGTGGTGCATTACCGGAAGTGGATCAAGGACAC
ACACCAAGGUGGUGCAUUACCGGAAGUGG
QPLSTPINPLL
CATCGTGGCCAACCCCTGAGCACCCCTATCAACCC
AUCAAGGACACCAUCGUGGCCAACCCCUGA
CCTATTGTAG
GCACCCCUAUCAACCCCCUAUUGUAG
3
3/302
GTGTCTGTGTTATTTGTGGGGTACAGAGATGAAA
GUGUCUGUGUUAUUUGUGGGGUACAGAG
VSVLFVGYRDERGVGSTLREWRVTCAGH
GAGGGGTGGGATCCACACTGAGAGAGTGGAGAG
AUGAAAGAGGGGUGGGAUCCACACUGAGA
CP
TGACATGTGCTGGACACTGTCCATGA
GAGUGGAGAGUGACAUGUGCUGGACACU
GUCCAUGA
4
4/303
TTGTCCTTAGGTGTGAGGTCCAGGGTTGCTAGGA
UUGUCCUUAGGUGUGAGGUCCAGGGUUG
LSLGVRSRVARKRNQQTQV
AAAGAAATCAGCAGACACAGGTGTAG
CUAGGAAAAGAAAUCAGCAGACACAGGUG
UAG
5
5/304
GTGTTTCTTAAATGGTGTAATTTTGTCCTCTCTGTG
GUGUUUCUUAAAUGGUGUAAUUUUGUCC
VFLKWCNFVLSVSWGILAMPGDISLNFS
TCCTGGGGAATACTGGCCATGCCTGGAGACATAT
UCUCUGUGUCCUGGGGAAUACUGGCCAUG
EDTDRMGCLCYLWGTEMKEGWDPH
CACTCAATTTCTCTGAGGACACAGATAGGATGGG
CCUGGAGACAUAUCACUCAAUUUCUCUGA
GTGTCTGTGTTATTTGTGGGGTACAGAGATGAAA
GGACACAGAUAGGAUGGGGUGUCUGUGU
GAGGGGTGGGATCCACACTGA
UAUUUGUGGGGUACAGAGAUGAAAGAGG
GGUGGGAUCCACACUGA
6
6/305
GTGCTGGACACTGTCCATGAAGCACTGAGCAGAA
GUGCUGGACACUGUCCAUGAAGCACUGAG
VLDTVHEALSRSWRHNAPDTHSKDGAE
GCTGGAGGCACAACGCACCAGACACTCACAGCAA
CAGAAGCUGGAGGCACAACGCACCAGACAC
NITHSVLEALGSLEKAVSQGGRVFLWHG
GGATGGAGCTGAAAACATAACCCACTCTGTCCTG
UCACAGCAAGGAUGGAGCUGAAAACAUAA
MGMK
GAGGCACTGGGAAGCCTAGAGAAGGCTGTGAGC
CCCACUCUGUCCUGGAGGCACUGGGAAGC
CAAGGAGGGAGGGTCTTCCTTTGGCATGGGATGG
CUAGAGAAGGCUGUGAGCCAAGGAGGGAG
GGATGAAGTAA
GGUCUUCCUUUGGCAUGGGAUGGGGAUG
AAGUAA
7 (BSS)
7/306
TTGAGCCCCAAGCTTACCACCTGCACCCGGAGAGC
UUGAGCCCCAAGCUUACCACCUGCACCCGG
LSPKLTTCTRRAVSPCGSRLSSSPCP
TGTGTCACCATGTGGGTCCCGGTTGTCTTCCTCAC
AGAGCUGUGUCACCAUGUGGGUCCCGGUU
CCTGTCCGTGA
GUCUUCCUCACCCUGUCCGUGA
8 (BSS)
8/307
GTGCTGCACCCCTCATCCTGTCTCGGATTGTGGGA
GUGCUGCACCCCUCAUCCUGUCUCGGAUU
VLHPSSCLGLWEAGSARSIPNPGRCLWP
GGCTGGGAGTGCGAGAAGCATTCCCAACCCTGGC
GUGGGAGGCUGGGAGUGCGAGAAGCAUU
LVAGQSAAVFWCTPSGSSQLPTASGSAA
AGGTGCTTGTGGCCTCTCGTGGCAGGGCAGTCTG
CCCAACCCUGGCAGGUGCUUGUGGCCUCU
PLILSRIVGGWECEKHSQPWQVLVASRG
CGGCGGTGTTCTGGTGCACCCCCAGTGGGTCCTC
CGUGGCAGGGCAGUCUGCGGCGGUGUUC
RAVCGGVLVHPQWVLTAAHCIR
ACAGCTGCCCACTGCATCAGGAAGTGCTGCACCCC
UGGUGCACCCCCAGUGGGUCCUCACAGCU
TCATCCTGTCTCGGATTGTGGGAGGCTGGGAGTG
GCCCACUGCAUCAGGAAGUGCUGCACCCCU
CGAGAAGCATTCCCAACCCTGGCAGGTGCTTGTG
CAUCCUGUCUCGGAUUGUGGGAGGCUGG
GCCTCTCGTGGCAGGGCAGTCTGCGGCGGTGTTC
GAGUGCGAGAAGCAUUCCCAACCCUGGCA
TGGTGCACCCCCAGTGGGTCCTCACAGCTGCCCAC
GGUGCUUGUGGCCUCUCGUGGCAGGGCA
TGCATCAGG
GUCUGCGGCGGUGUUCUGGUGCACCCCCA
GUGGGUCCUCACAGCUGCCCACUGCAUCA
GG
9 (BSS)
9/308
CTGGGAGTGCGAGAAGCATTCCCAACCCTGGCAG
CUGGGAGUGCGAGAAGCAUUCCCAACCCU
LGVREAFPTLAGACGLSWQGSLRRCSGA
GTGCTTGTGGCCTCTCGTGGCAGGGCAGTCTGCG
GGCAGGUGCUUGUGGCCUCUCGUGGCAG
PPVGPHSCPLHQEVLHPSSCLGLWEAGS
GCGGTGTTCTGGTGCACCCCCAGTGGGTCCTCACA
GGCAGUCUGCGGCGGUGUUCUGGUGCACC
ARSIPNPGRCLWPLVAGQSAAVFWCTPS
GCTGCCCACTGCATCAGGAAGTGCTGCACCCCTCA
CCCAGUGGGUCCUCACAGCUGCCCACUGCA
GSSQLPTASG
TCCTGTCTCGGATTGTGGGAGGCTGGGAGTGCGA
UCAGGAAGUGCUGCACCCCUCAUCCUGUC
GAAGCATTCCCAACCCTGGCAGGTGCTTGTGGCCT
UCGGAUUGUGGGAGGCUGGGAGUGCGAG
CTCGTGGCAGGGCAGTCTGCGGCGGTGTTCTGGT
AAGCAUUCCCAACCCUGGCAGGUGCUUGU
GCACCCCCAGTGGGTCCTCACAGCTGCCCACTGCA
GGCCUCUCGUGGCAGGGCAGUCUGCGGCG
TCAGGA
GUGUUCUGGUGCACCCCCAGUGGGUCCUC
ACAGCUGCCCACUGCAUCAGGA
10 (BSS)
10/309
GTGCTGGACGCTGGACAGGGGGCAAAAGCACCT
GUGCUGGACGCUGGACAGGGGGCAAAAGC
VLDAGQGAKAPARLDPKETSVCGPPCYF
GCTCGTCTTGACCCCAAAGAAACTTCAGTGTGTGGACC
UGCUCGUCUUGACCCCAAAGAAACUUC
Q
ACCTCCATGTTATTTCCAATGA
AGUGUGUGGACCUCCAUGUUAUUUCCAAU
GA
11 (BSS)
11/310
GTGTGCGCAAGTTCACCCTCAGAAGGTGACCAAG
GUGUGCGCAAGUUCACCCUCAGAAGGUGA
VCASSPSEGDQVHAVCWTLDRGQKHLL
TTCATGCTGTGTGCTGGACGCTGGACAGGGGGCA
CCAAGUUCAUGCUGUGUGCUGGACGCUGG
VLTPKKLQCVDLHVISNDVCAQVHPQKV
AAAGCACCTGCTCGTCTTGACCCCAAAGAAACTTC
ACAGGGGGCAAAAGCACCUGCUCGUCUUG
TKFMLCAGRWTGGKSTCS
AGTGTGTGGACCTCCATGTTATTTCCAATGACGTG
ACCCCAAAGAAACUUCAGUGUGUGGACCU
TGTGCGCAAGTTCACCCTCAGAAGGTGACCAAGT
CCAUGUUAUUUCCAAUGACGUGUGUGCGC
TCATGCTGTGTGCTGGACGCTGGACAGGGGGCAA
AAGUUCACCCUCAGAAGGUGACCAAGUUC
AAGCACCTGCTCG
AUGCUGUGUGCUGGACGCUGGACAGGGG
GCAAAAGCACCUGCUCG
12 (BSS)
12/311
ATGTGCCCTGCCCGAAAGGCCTTCCCTGTACACCA
AUGUGCCCUGCCCGAAAGGCCUUCCCUGU
MCPARKAFPVHQGGALPEVDQGHHRG
AGGTGGTGCATTACCGGAAGTGGATCAAGGACAC
ACACCAAGGUGGUGCAUUACCGGAAGUGG
QPLSTPINPLL
CATCGTGGCCAACCCCTGAGCACCCCTATCAACCC
AUCAAGGACACCAUCGUGGCCAACCCCUGA
CCTATTGTAG
GCACCCCUAUCAACCCCCUAUUGUAG
13 (BSS)
13/312
CTGATTCACTATGGGGGGAGGTGTATTGAAGTCC
CUGAUUCACUAUGGGGGGAGGUGUAUUG
LIHYGGRCIEVLQTTLRFDDFLVELTEIKSC
TCCAGACAACCCTCAGATTTGATGATTTCCTAGTA
AAGUCCUCCAGACAACCCUCAGAUUUGAU
YTVGDSGGPLVCNGVLQGITSWGSEPCA
GAACTCACAGAAATAAAGAGCTGTTATACTGTGG
GAUUUCCUAGUAGAACUCACAGAAAUAAA
LPERPSLYTKVVHYRKWIKDTIVANP
GTGATTCTGGGGGCCCACTTGTCTGTAATGGTGTG
GAGCUGUUAUACUGUGGGUGAUUCUGGG
CTTCAAGGTATCACGTCATGGGGCAGTGAACCAT
GGCCCACUUGUCUGUAAUGGUGUGCUUCA
GTGCCCTGCCCGAAAGGCCTTCCCTGTACACCAAG
AGGUAUCACGUCAUGGGGCAGUGAACCAU
GTGGTGCATTACCGGAAGTGGATCAAGGACACCA
GUGCCCUGCCCGAAAGGCCUUCCCUGUACA
TCGTGGCCAACCCCTGA
CCAAGGUGGUGCAUUACCGGAAGUGGAUC
AAGGACACCAUCGUGGCCAACCCCUGA
14
14/313
GTGATCTTGCTGGGTCGGCACAGCCTGTTTCATCC
GUGAUCUUGCUGGGUCGGCACAGCCUGUU
VILLGRHSLFHPEDTGQVFQVSHSFPHPL
(BSS + RCA)
TGAAGACACAGGCCAGGTATTTCAGGTCAGCCAC
UCAUCCUGAAGACACAGGCCAGGUAUUUC
YDMSLLKNRFLRPGDDSSHDLMLLRLSEP
AGCTTCCCACACCCGCTCTACGATATGAGCCTCCT
AGGUCAGCCACAGCUUCCCACACCCGCUCU
AELTDAVKVMDLPTQEPALGTTCYASG
GAAGAATCGATTCCTCAGGCCAGGTGATGACTCC
ACGAUAUGAGCCUCCUGAAGAAUCGAUUC
WGSIEPEECAAPLILSRIVGGWECEKHSQ
AGCCACGACCTCATGCTGCTCCGCCTGTCAGAGCC
CUCAGGCCAGGUGAUGACUCCAGCCACGAC
PWQVLVASRGRAVCGGVLVHPQWVLT
TGCCGAGCTCACGGATGCTGTGAAGGTCATGGAC
CUCAUGCUGCUCCGCCUGUCAGAGCCUGC
AAHCIR
CTGCCCACCCAGGAGCCAGCACTGGGGACCACCT
CGAGCUCACGGAUGCUGUGAAGGUCAUGG
GCTACGCCTCAGGCTGGGGCAGCATTGAACCAGA
ACCUGCCCACCCAGGAGCCAGCACUGGGGA
GGAGTGTGCTGCACCCCTCATCCTGTCTCGGATTG
CCACCUGCUACGCCUCAGGCUGGGGCAGCA
TGGGAGGCTGGGAGTGCGAGAAGCATTCCCAACC
UUGAACCAGAGGAGUGUGCUGCACCCCUC
CTGGCAGGTGCTTGTGGCCTCTCGTGGCAGGGCA
AUCCUGUCUCGGAUUGUGGGAGGCUGGG
GTCTGCGGCGGTGTTCTGGTGCACCCCCAGTGGG
AGUGCGAGAAGCAUUCCCAACCCUGGCAG
TCCTCACA
GUGCUUGUGGCCUCUCGUGGCAGGGCAG
UCUGCGGCGGUGUUCUGGUGCACCCCCAG
UGGGUCCUCACA
PCA3 neo-
antigen
DNA sequence
RNA sequence
Protein Sequence
1
15/314
CTGGCATCAGAAAAACAGAGGGGAGATTTGTGTG
CUGGCAUCAGAAAAACAGAGGGGAGAUUU
LASEKQRGDLCGCSRGRPGRSAWWEGP
GCTGCAGCCGAGGGAGACCAGGAAGATCTGCAT
GUGUGGCUGCAGCCGAGGGAGACCAGGAA
DDTEVRNKKGC
GGTGGGAAGGACCTGATGATACAGAGGTGAGAA
GAUCUGCAUGGUGGGAAGGACCUGAUGA
ATAAGAAAGGCTGCTGA
UACAGAGGUGAGAAAUAAGAAAGGCUGCU
GA
2
16/315
GTGCCTGGTCCCGCTTGTGAGGGAAGGACATTAG
GUGCCUGGUCCCGCUUGUGAGGGAAGGAC
VPGPACEGRTLENELMCSLKDGQENRSC
AAAATGAATTGATGTGTTCCTTAAAGGATGGGCA
AUUAGAAAAUGAAUUGAUGUGUUCCUUA
CGYLFERDYRFEMKSQSEHYQ
GGAAAACAGATCCTGTTGTGGATATTTATTTGAAC
AAGGAUGGGCAGGAAAACAGAUCCUGUUG
GGGATTACAGATTTGAAATGAAGTCACAAAGTGA
UGGAUAUUUAUUUGAACGGGAUUACAGA
GCATTACCAATGA
UUUGAAAUGAAGUCACAAAGUGAGCAUUA
CCAAUGA
3
17/316
ATGGAATACTGTGATGACATGAGGCAGCCAAGCT
AUGGAAUACUGUGAUGACAUGAGGCAGCC
MEYCDDMRQPSWGGDNHGAEGQDSG
GGGGAGGAGATAACCACGGGGCAGAGGGTCAGG
AAGCUGGGGAGGAGAUAACCACGGGGCAG
PAA
ATTCTGGCCCTGCTGCCTAA
AGGGUCAGGAUUCUGGCCCUGCUGCCUAA
4
18/317
CTGATCTCTACGGTTCCTTCTGGGCCCAACATTCTC
CUGAUCUCUACGGUUCCUUCUGGGCCCAA
LISTVPSGPNILHISSHTHF
CATATATCCAGCCACACTCATTTTTAA
CAUUCUCCAUAUAUCCAGCCACACUCAUUU
UUAA
5
19/318
GTGTTCTGGCCCAGGGGATCTGTGAACAGGCTGG
GUGUUCUGGCCCAGGGGAUCUGUGAACAG
VFWPRGSVNRLGSISRSFQGYTY
GAAGCATCTCAAGATCTTTCCAGGGTTATACTTAC
GCUGGGAAGCAUCUCAAGAUCUUUCCAGG
TAG
GUUAUACUUACUAG
6
20/319
GTGCCCATTCTCAAGACCTCAAAATGTCATTCCATT
GUGCCCAUUCUCAAGACCUCAAAAUGUCA
VPILKTSKCHSINITGLTFFFNLEEFNVTCS
AATATCACAGGATTAACTTTTTTTTTTAACCTGGAA
UUCCAUUAAUAUCACAGGAUUAACUUUUU
YGNLITYFVFQCKDD
GAATTCAATGTTACATGCAGCTATGGGAATTTAAT
UUUUUAACCUGGAAGAAUUCAAUGUUACA
TACATATTTTGTTTTCCAGTGCAAAGATGACTAA
UGCAGCUAUGGGAAUUUAAUUACAUAUU
UUGUUUUCCAGUGCAAAGAUGACUAA
7
21/320
TTGAACATGTCAGGCATACATTATTCCTTCTGCCTG
UUGAACAUGUCAGGCAUACAUUAUUCCUU
LNMSGIHYSFCLRSSSLSLKSRMM
AGAAGCTCTTCCTTGTCTCTTAAATCTAGAATGAT
CUGCCUGAGAAGCUCUUCCUUGUCUCUUA
GTAA
AAUCUAGAAUGAUGUAA
8
22/321
CTGGCAATAAAGAATTTACAAAGAGCTACTCAGG
CUGGCAAUAAAGAAUUUACAAAGAGCUAC
LAIKNLQRATQDQLLRALCVCVCV
ACCAGTTGTTAAGAGCTCTGTGTGTGTGTGTGTGT
UCAGGACCAGUUGUUAAGAGCUCUGUGU
GTGTGA
GUGUGUGUGUGUGUGUGA
9
23/322
GTGTACATGCCAAAGTGTGCCTCTCTCTCTTTGAC
GUGUACAUGCCAAAGUGUGCCUCUCUCUC
VYMPKCASLSLTHYFRLKNKHVFKWHYE
CCATTATTTCAGACTTAAAAACAAGCATGTTTTCAA
UUUGACCCAUUAUUUCAGACUUAAAAACA
LPMMYHHHISLFSSKCDNNVIC
ATGGCACTATGAGCTGCCAATGATGTATCACCACC
AGCAUGUUUUCAAAUGGCACUAUGAGCUG
ATATCTCATTATTCTCCAGTAAATGTGATAATAATG
CCAAUGAUGUAUCACCACCAUAUCUCAUU
TCATCTGTTAA
AUUCUCCAGUAAAUGUGAUAAUAAUGUCA
UCUGUUAA
10
24/323
TTGACTTCACAAAAGCAGCTGGAAATGGACAACC
UUGACUUCACAAAAGCAGCUGGAAAUGGA
LTSQKQLEMDNHNMHKSNSYHQLHTA
ACAATATGCATAAATCTAACTCCTACCATCAGCTA
CAACCACAAUAUGCAUAAAUCUAACUCCUA
CACACTGCTTGA
CCAUCAGCUACACACUGCUUGA
11
25/324
CTGAGCTGTCATCGTCCCCATCTCTGTGAGCCACA
CUGAGCUGUCAUCGUCCCCAUCUCUGUGA
LSCHRPHLCEPQPTAGPNACLRSLNQGN
ACCAACAGCAGGACCCAACGCATGTCTGAGATCC
GCCACAACCAACAGCAGGACCCAACGCAUG
QCHELNSPIMDASFWPSLALLLTHISF
TTAAATCAAGGAAACCAGTGTCATGAGTTGAATTC
UCUGAGAUCCUUAAAUCAAGGAAACCAGU
TCCTATTATGGATGCTAGCTTCTGGCCATCTCTGG
GUCAUGAGUUGAAUUCUCCUAUUAUGGA
CTCTCCTCTTGACACATATTAGCTTCTAG
UGCUAGCUUCUGGCCAUCUCUGGCUCUCC
UCUUGACACAUAUUAGCUUCUAG
12
26/325
TTGCTTCCACGACTTTTATCTTTTCTCCAACACATC
UUGCUUCCACGACUUUUAUCUUUUCUCCA
LLPRLLSFLQHIAYQSSLCSVALDFPTRIST
GCTTACCAATCCTCTCTCTGCTCTGTTGCTTTGGAC
ACACAUCGCUUACCAAUCCUCUCUCUGCUC
TLKSFLPSPPLT
TTCCCCACAAGAATTTCAACGACTCTCAAGTCTTTT
UGUUGCUUUGGACUUCCCCACAAGAAUUU
CTTCCATCCCCACCACTAACCTGA
CAACGACUCUCAAGUCUUUUCUUCCAUCCC
CACCACUAACCUGA
13
27/326
ATGCCTAGACCCTTATTTTTATTAATTTCCAATAGA
AUGCCUAGACCCUUAUUUUUAUUAAUUUC
MPRPLFLLISNRCCLWAILL
TGCTGCCTATGGGCTATATTGCTTTAG
CAAUAGAUGCUGCCUAUGGGCUAUAUUGC
UUUAG
14
28/327
ATGAACATTAGATATTTAAAGCTCAAGAGGTTCAA
AUGAACAUUAGAUAUUUAAAGCUCAAGAG
MNIRYLKLKRFKIQLIIFSFFHLPAPLPILLIA
AATCCAACTCATTATCTTCTCTTTCTTTCACCTCCCT
GUUCAAAAUCCAACUCAUUAUCUUCUCUU
LNSMVPNVAMQMRNPVAPCGTCMQD
GCTCCTCTCCCTATATTACTGATTGCACTGAACAGC
UCUUUCACCUCCCUGCUCCUCUCCCUAUAU
C
ATGGTCCCCAATGTAGCCATGCAAATGAGAAACC
UACUGAUUGCACUGAACAGCAUGGUCCCC
CAGTGGCTCCTTGTGGTACATGCATGCAAGACTGC
AAUGUAGCCAUGCAAAUGAGAAACCCAGU
TGA
GGCUCCUUGUGGUACAUGCAUGCAAGACU
GCUGA
15
29/328
GTGCCGAGAAGCTGGCATCAGAAAAACAGAGGG
GUGCCGAGAAGCUGGCAUCAGAAAAACAG
VPRSWHQKNRGEICVAAAEGDQEDLHG
GAGATTTGTGTGGCTGCAGCCGAGGGAGACCAG
AGGGGAGAUUUGUGUGGCUGCAGCCGAG
GKDLMIQR
GAAGATCTGCATGGTGGGAAGGACCTGATGATAC
GGAGACCAGGAAGAUCUGCAUGGUGGGAA
AGAGGTGA
GGACCUGAUGAUACAGAGGUGA
16
30/329
ATGTTTTTGCACATTTCCAGCCCCTTTAAATATCCA
AUGUUUUUGCACAUUUCCAGCCCCUUUAA
MFLHISSPFKYPHTQEAQKEAQRSLGEM
CACACACAGGAAGCACAAAAGGAAGCACAGAGA
AUAUCCACACACACAGGAAGCACAAAAGGA
PGRHLGSSMSLALCLVPLVREGH
TCCCTGGGAGAAATGCCCGGCCGCCATCTTGGGT
AGCACAGAGAUCCCUGGGAGAAAUGCCCG
CATCGATGAGCCTCGCCCTGTGCCTGGTCCCGCTT
GCCGCCAUCUUGGGUCAUCGAUGAGCCUC
GTGAGGGAAGGACATTAG
GCCCUGUGCCUGGUCCCGCUUGUGAGGGA
AGGACAUUAG
17
31/330
ATGGGCAGGAAAACAGATCCTGTTGTGGATATTT
AUGGGCAGGAAAACAGAUCCUGUUGUGGA
MGRKTDPVVDIYLNGITDLK
ATTTGAACGGGATTACAGATTTGAAATGA
UAUUUAUUUGAACGGGAUUACAGAUUUG
AAAUGA
18
32/331
GTGAGCATTACCAATGAGAGGAAAACAGACGAG
GUGAGCAUUACCAAUGAGAGGAAAACAGA
VSITNERKTDEKILMASQDMQQTKWNT
AAAATCTTGATGGCTTCACAAGACATGCAACAAAC
CGAGAAAAUCUUGAUGGCUUCACAAGACA
VMT
AAAATGGAATACTGTGATGACATGA
UGCAACAAACAAAAUGGAAUACUGUGAUG
ACAUGA
19
33/332
GTGAATTATCTAATCAACATCATCCTCAGTGTCTTT
GUGAAUUAUCUAAUCAACAUCAUCCUCAG
VNYLINIILSVFAHTEIHFPLLCPFSRPQNVI
GCCCATACTGAAATTCATTTCCCACTTTTGTGCCCA
UGUCUUUGCCCAUACUGAAAUUCAUUUCC
PLISQD
TTCTCAAGACCTCAAAATGTCATTCCATTAATATCA
CACUUUUGUGCCCAUUCUCAAGACCUCAA
CAGGATTAA
AAUGUCAUUCCAUUAAUAUCACAGGAUUA
A
20
34/333
TTGTTTTCCAGTGCAAAGATGACTAAGTCCTTTATC
UUGUUUUCCAGUGCAAAGAUGACUAAGUC
LFSSAKMTKSFIPPLCLIFFPV
CCTCCCCTTTGTTTGATTTTTTTTCCAGTATA
CUUUAUCCCUCCCCUUUGUUUGAUUUUU
UUUCCAGUAUA
21
35/334
ATGCAAGAGCCACAGAGGGAATGTTTATGGGGCA
AUGCAAGAGCCACAGAGGGAAUGUUUAUG
MQEPQRECLWGTFVSLGCEAKAGNLIVS
CGTTTGTAAGCCTGGGATGTGAAGCAAAGGCAGG
GGGCACGUUUGUAAGCCUGGGAUGUGAA
YIIYFISLSLSQYPTSFSQNSCSANPQR
GAACCTCATAGTATCTTATATAATATACTTCATTTC
GCAAAGGCAGGGAACCUCAUAGUAUCUUA
TCTATCTCTATCACAATATCCAACAAGCTTTTCACA
UAUAAUAUACUUCAUUUCUCUAUCUCUAU
GAATTCATGCAGTGCAAATCCCCAAAGGTAA
CACAAUAUCCAACAAGCUUUUCACAGAAU
UCAUGCAGUGCAAAUCCCCAAAGGUAA
22
36/335
GTGGGTTCTCTTAAGCAAAATACTTGCATTAGGTC
GUGGGUUCUCUUAAGCAAAAUACUUGCAU
VGSLKQNTCIRSQLGLCIRRFEKYSILSRSQ
TCAGCTGGGGCTGTGCATCAGGCGGTTTGAGAAA
UAGGUCUCAGCUGGGGCUGUGCAUCAGGC
NLNSLIF
TATTCAATTCTCAGCAGAAGCCAGAATTTGAATTC
GGUUUGAGAAAUAUUCAAUUCUCAGCAGA
CCTCATCTTTTAG
AGCCAGAAUUUGAAUUCCCUCAUCUUUUA
G
23
37/336
CTGTCCCTCTTTGTGTTCATGGATAGTCCAATAAAT
CUGUCCCUCUUUGUGUUCAUGGAUAGUCC
LSLFVFMDSPINNVIFELMLIGENIRTLSDI
AATGTTATCTTTGAACTGATGCTCATAGGAGAGAA
AAUAAAUAAUGUUAUCUUUGAACUGAUGC
NIRDSKKY
TATAAGAACTCTGAGTGATATCAACATTAGGGATT
UCAUAGGAGAGAAUAUAAGAACUCUGAGU
CAAAGAAATATTAG
GAUAUCAACAUUAGGGAUUCAAAGAAAUA
UUAG
24
38/337
CTGTTGCTTTGGACTTCCCCACAAGAATTTCAACG
CUGUUGCUUUGGACUUCCCCACAAGAAUU
LLLWTSPQEFQRLSSLFFHPHH
ACTCTCAAGTCTTTTCTTCCATCCCCACCACTAA
UCAACGACUCUCAAGUCUUUUCUUCCAUC
CCCACCACUAA
25
39/338
GTGGTACATGCATGCAAGACTGCTGAAGCCAGAA
GUGGUACAUGCAUGCAAGACUGCUGAAGC
VVHACKTAEARRMTDYASWVEGTTPGP
GGATGACTGATTACGCCTCATGGGTGGAGGGGAC
CAGAAGGAUGACUGAUUACGCCUCAUGGGS
CACTCCTGGGCCTTCGTGA
UGGAGGGGACCACUCCUGGGCCUUCGUGA
26
40/339
TTGTCAGGAGCAAGACCTGAGATGCTCCCTGCCTT
UUGUCAGGAGCAAGACCUGAGAUGCUCCC
LSGARPEMLPAFSVLCISPF
CAGTGTCCTCTGCATCTCCCCTTTCTAA
UGCCUUCAGUGUCCUCUGCAUCUCCCCUU
UCUAA
27
41/340
TTGGCATACTATATCAACTTTGATTCTTTGTTACAA
UUGGCAUACUAUAUCAACUUUGAUUCUU
LAYYINFDSLLQLFLLFYHQSGFYSLYYYYF
CTTTTCTTACTCTTTTATCACCAAAGTGGCTTTTATT
UGUUACAACUUUUCUUACUCUUUUAUCAC
LLLLYYVVIILFSIVSIYLI
CTCTTTATTATTATTATTTTCTTTTACTACTATATTA
CAAAGUGGCUUUUAUUCUCUUUAUUAUU
CGTTGTTATTATTTTGTTCTCTATAGTATCAATTTAT
AUUAUUUUCUUUUACUACUAUAUUACGU
TTGATTTAG
UGUUAUUAUUUUGUUCUCUAUAGUAUCA
AUUUAUUUGAUUUAG
28
42/341
GTGACATGTTTTTGCACATTTCCAGCCCCTTTAAAT
GUGACAUGUUUUUGCACAUUUCCAGCCCC
VTCFCTFPAPLNIHTHRKHKRKHRDPWE
ATCCACACACACAGGAAGCACAAAAGGAAGCACA
UUUAAAUAUCCACACACACAGGAAGCACAA
KCPAAILGHR
GAGATCCCTGGGAGAAATGCCCGGCCGCCATCTT
AAGGAAGCACAGAGAUCCCUGGGAGAAAU
GGGTCATCGATGA
GCCCGGCCGCCAUCUUGGGUCAUCGAUGA
29
43/342
GTGACCTTTCTACACTGTAGAATAACATTACTCATT
GUGACCUUUCUACACUGUAGAAUAACAUU
VTFLHCRITLLILFKDPSCCCLICS
TTGTTCAAAGACCCTTCGTGTTGCTGCCTAATATGT
ACUCAUUUUGUUCAAAGACCCUUCGUGUU
AGCTGA
GCUGCCUAAUAUGUAGCUGA
30
44/343
CTGTTTTTCCTAAGGAGTGTTCTGGCCCAGGGGAT
CUGUUUUUCCUAAGGAGUGUUCUGGCCCA
LFFLRSVLAQGICEQAGKHLKIFPGLYLLA
CTGTGAACAGGCTGGGAAGCATCTCAAGATCTTTC
GGGGAUCUGUGAACAGGCUGGGAAGCAUC
HSMIITE
CAGGGTTATACTTACTAGCACACAGCATGATCATT
UCAAGAUCUUUCCAGGGUUAUACUUACUA
ACGGAGTGA
GCACACAGCAUGAUCAUUACGGAGUGA
31
45/344
GTGTCTTTGCCCATACTGAAATTCATTTCCCACTTT
GUGUCUUUGCCCAUACUGAAAUUCAUUUC
VSLPILKFISHFCAHSQDLKMSFH
TGTGCCCATTCTCAAGACCTCAAAATGTCATTCCAT
CCACUUUUGUGCCCAUUCUCAAGACCUCA
TAA
AAAUGUCAUUCCAUUAA
32
46/345
ATGCTTAGCCTTGTACTGAGGCTGTATACAGCCAC
AUGCUUAGCCUUGUACUGAGGCUGUAUAC
MLSLVLRLYTATASPHPSSLICHHHQPLPC
AGCCTCTCCCCATCCCTCCAGCCTTATCTGTCATCA
AGCCACAGCCUCUCCCCAUCCCUCCAGCCU
T
CCATCAACCCCTCCCATGCACCTAA
UAUCUGUCAUCACCAUCAACCCCUCCCAUG
CACCUAA
33
47/346
ATGCAAAGAAGGGACACATATGAGATTCATCATC
AUGCAAAGAAGGGACACAUAUGAGAUUCA
MQRRDTYEIHHHMRQQILKV
ACATGAGACAGCAAATACTAAAAGTGTAA
UCAUCACAUGAGACAGCAAAUACUAAAAG
UGUAA
34
48/347
GTGCAAATCCCCAAAGGTAACCTTTATCCATTTCAT
GUGCAAAUCCCCAAAGGUAACCUUUAUCC
VQIPKGNLYPFHGECALEFWQIILVTYLNF
GGTGAGTGCGCTTTAGAATTTTGGCAAATCATACT
AUUUCAUGGUGAGUGCGCUUUAGAAUUU
EMCLSL
GGTCACTTATCTCAACTTTGAGATGTGTTTGTCCTT
UGGCAAAUCAUACUGGUCACUUAUCUCAA
GTAG
CUUUGAGAUGUGUUUGUCCUUGUAG
35
49/348
GTGTGTGTGTGTGTGTGTGTGAGTGTACATGCCA
GUGUGUGUGUGUGUGUGUGUGAGUGUA
VCVCVCVSVHAKVCLSLFDPLFQT
AAGTGTGCCTCTCTCTCTTTGACCCATTATTTCAGA
CAUGCCAAAGUGUGCCUCUCUCUCUUUGA
CTTAA
CCCAUUAUUUCAGACUUAA
36
50/349
TTGACATATATTGTTAGAAGCACCTCGCATTTGTG
UUGACAUAUAUUGUUAGAAGCACCUCGCA
LTYIVRSTSHLWVLLSKILALGLSWGCASG
GGTTCTCTTAAGCAAAATACTTGCATTAGGTCTCA
UUUGUGGGUUCUCUUAAGCAAAAUACUU
GLRNIQFSAEARI
GCTGGGGCTGTGCATCAGGCGGTTTGAGAAATAT
GCAUUAGGUCUCAGCUGGGGCUGUGCAUC
TCAATTCTCAGCAGAAGCCAGAATTTGA
AGGCGGUUUGAGAAAUAUUCAAUUCUCAG
CAGAAGCCAGAAUUUGA
37
51/350
TTGGAGAGGATTCAGACAGCTCAGGTGCTTTCACT
UUGGAGAGGAUUCAGACAGCUCAGGUGCU
LERIQTAQVLSLMSLNFCPSLCSWIVQ
AATGTCTCTGAACTTCTGTCCCTCTTTGTGTTCATG
UUCACUAAUGUCUCUGAACUUCUGUCCCU
GATAGTCCAATAA
CUUUGUGUUCAUGGAUAGUCCAAUAA
38
52/351
GTGATATCAACATTAGGGATTCAAAGAAATATTAG
GUGAUAUCAACAUUAGGGAUUCAAAGAAA
VISTLGIQRNIRFKLTLVKRNQDTKNSELS
ATTTAAGCTCACACTGGTCAAAAGGAACCAAGAT
UAUUAGAUUUAAGCUCACACUGGUCAAAA
SSPSL
ACAAAGAACTCTGAGCTGTCATCGTCCCCATCTCT
GGAACCAAGAUACAAAGAACUCUGAGCUG
GTGA
UCAUCGUCCCCAUCUCUGUGA
39
53/352
CTGCTCTGTTGCTTTGGACTTCCCCACAAGAATTTC
CUGCUCUGUUGCUUUGGACUUCCCCACAA
LLCCFGLPHKNFNDSQVFSSIPTTNLNA
AACGACTCTCAAGTCTTTTCTTCCATCCCCACCACT
GAAUUUCAACGACUCUCAAGUCUUUUCUU
AACCTGAATGCCTAG
CCAUCCCCACCACUAACCUGAAUGCCUAG
40
54/353
CTGATTACGCCTCATGGGTGGAGGGGACCACTCC
CUGAUUACGCCUCAUGGGUGGAGGGGACC
ITPHGWRGPLLGLRDCQEQDLRCSLPSVS
TGGGCCTTCGTGATTGTCAGGAGCAAGACCTGAG
ACUCCUGGGCCUUCGUGAUUGUCAGGAGC
SASPLSNEDP
ATGCTCCCTGCCTTCAGTGTCCTCTGCATCTCCCCT
AAGACCUGAGAUGCUCCCUGCCUUCAGUG
TTCTAATGAAGATCCATAG
UCCUCUGCAUCUCCCCUUUCUAAUGAAGA
UCCAUAG
41
55/354
TTGCTACATTTGAGAATTCCAATTAGGAACTCACA
UUGCUACAUUUGAGAAUUCCAAUUAGGAA
LLHLRIPIRNSHVLSALSIF
TGTTTTATCTGCCCTATCAATTTTTTAA
CUCACAUGUUUUAUCUGCCCUAUCAAUUU
UUUAA
42
56/355
GTGTCTTGGCATACTATATCAACTTTGATTCTTTGT
GUGUCUUGGCAUACUAUAUCAACUUUGA
VSWHTISTLILCYNFSYSFITKVAFILFIIIIFF
TACAACTTTTCTTACTCTTTTATCACCAAAGTGGCT
UUCUUUGUUACAACUUUUCUUACUCUUU
YYYITLLLFCSL
TTTATTCTCTTTATTATTATTATTTTCTTTTACTACTA
UAUCACCAAAGUGGCUUUUAUUCUCUUUA
TATTACGTTGTTATTATTTTGTTCTCTATAG
UUAUUAUUAUUUUCUUUUACUACUAUAU
UACGUUGUUAUUAUUUUGUUCUCUAUAG
43
57/356
TTGCTGACTTTTAAAATAAGTGATTCGGGGGGTG
UUGCUGACUUUUAAAAUAAGUGAUUCGG
LLTFKISDSGGGRTGEGEH
GGAGAACAGGGGAGGGAGAGCATTAG
GGGGUGGGAGAACAGGGGAGGGAGAGCA
UUAG
44
58/357
GTGCAGCAAACCACTATGGCACACGTATACCTGTG
GUGCAGCAAACCACUAUGGCACACGUAUA
VQQTTMAHVYLCNKPTHSAHVSQNVK
TAACAAACCTACACATTCTGCACATGTATCCCAGA
CCUGUGUAACAAACCUACACAUUCUGCACA
ACGTAAAGTAA
UGUAUCCCAGAACGUAAAGUAA
45 (BSS)
59/358
GTGCAGCAAAGAAAGACTACAGACATCTCAATGG
GUGCAGCAAAGAAAGACUACAGACAUCUC
VQQRKTTDISMAGELQHIYLVFQ
CAGGGGAATTACAACACATATACTTAGTGTTTCAA
AAUGGCAGGGGAAUUACAACACAUAUACU
TGA
UAGUGUUUCAAUGA
46 (BSS)
60/359
GTGAGTCTCCTCAGTGACACAGGGCTGGATCACC
GUGAGUCUCCUCAGUGACACAGGGCUGGA
VSLLSDTGLDHHRRHFLSTQCSKERLQTS
ATCGACGGCACTTTCTGAGTACTCAGTGCAGCAAA
UCACCAUCGACGGCACUUUCUGAGUACUC
QWQGNYNTYT
GAAAGACTACAGACATCTCAATGGCAGGGGAATT
AGUGCAGCAAAGAAAGACUACAGACAUCU
ACAACACATATACTTAG
CAAUGGCAGGGGAAUUACAACACAUAUAC
UUAG
47 (BSS)
61/360
GTGACATGTTTTTGCACATTTCCAGCCCCTTTAAAT
GUGACAUGUUUUUGCACAUUUCCAGCCCC
VTCFCTFPAPLNIHTHRKHKRKHRGEK
ATCCACACACACAGGAAGCACAAAAGGAAGCACA
UUUAAAUAUCCACACACACAGGAAGCACAA
GAGGTGAGAAATAA
AAGGAAGCACAGAGGUGAGAAAUAA
48 (BSS)
62/361
GTGCAGCAAAGAAAGACTACAGACATCTCAATGG
GUGCAGCAAAGAAAGACUACAGACAUCUC
VQQRKTTDISMAGELQHIYLVFQ
CAGGGGAATTACAACACATATACTTAGTGTTTCAA
AAUGGCAGGGGAAUUACAACACAUAUACU
TGA
UAGUGUUUCAAUGA
49 (BSS)
63/362
ATGTTTTTGCACATTTCCAGCCCCTTTAAATATCCA
AUGUUUUUGCACAUUUCCAGCCCCUUUAA
MFLHISSPFKYPHTQEAQKEAQRNYNTY
CACACACAGGAAGCACAAAAGGAAGCACAGAGG
AUAUCCACACACACAGGAAGCACAAAAGGA
T
AATTACAACACATATACTTAG
AGCACAGAGGAAUUACAACACAUAUACUU
AG
50 (BSS)
64/363
GTGACATGTTTTTGCACATTTCCAGCCCCTTTAAAT
GUGACAUGUUUUUGCACAUUUCCAGCCCC
VTCFCTFPAPLNIHTHRKHKRKHRGITTHI
ATCCACACACACAGGAAGCACAAAAGGAAGCACA
UUUAAAUAUCCACACACACAGGAAGCACAA
LSVSMNTKINK
GAGGAATTACAACACATATACTTAGTGTTTCAATG
AAGGAAGCACAGAGGAAUUACAACACAUA
AACACCAAGATAAATAAGTGA
UACUUAGUGUUUCAAUGAACACCAAGAUA
AAUAAGUGA
AR neo-
antigen
DNA sequence
RNA sequence
Protein sequence
1
65/364
CTGCCTTCCCCACCCCGAGTGCGGAGCCAGAGAT
CUGCCUUCCCCACCCCGAGUGCGGAGCCAG
LPSPPRVRSQRSKDEKAVRSSVAKKQNK
CAAAAGATGAAAAGGCAGTCAGGTCTTCAGTAGC
AGAUCAAAAGAUGAAAAGGCAGUCAGGUC
QKQKSRNKRKR
CAAAAAACAAAACAAACAAAAACAAAAAAGCCGA
UUCAGUAGCCAAAAAACAAAACAAACAAAA
AATAAAAGAAAAAGATAA
ACAAAAAAGCCGAAAUAAAAGAAAAAGAUA
A
2
66/365
TTGCACCTACTTCAGTGGACACTGAATTTGGAAGG
UUGCACCUACUUCAGUGGACACUGAAUUU
LHLLQWTLNLEGGGFCFFLLRSGHLLNLP
TGGAGGATTTTGTTTTTTTCTTTTAAGATCTGGGCA
GGAAGGUGGAGGAUUUUGUUUUUUUCUU
FKY
TCTTTTGAATCTACCCTTCAAGTATTAA
UUAAGAUCUGGGCAUCUUUUGAAUCUACC
CUUCAAGUAUUAA
3
67/366
GTGTCTTCTTCTGCACGAGACTTTGAGGCTGTCAG
GUGUCUUCUUCUGCACGAGACUUUGAGGC
VSSSARDFEAVRALFAWLLPQVSFSGASR
AGCGCTTTTTGCGTGGTTGCTCCCGCAAGTTTCCTT
UGUCAGAGCGCUUUUUGCGUGGUUGCUC
RWAASCSDYRIITAC
CTCTGGAGCTTCCCGCAGGTGGGCAGCTAGCTGC
CCGCAAGUUUCCUUCUCUGGAGCUUCCCG
AGCGACTACCGCATCATCACAGCCTGTTGA
CAGGUGGGCAGCUAGCUGCAGCGACUACC
GCAUCAUCACAGCCUGUUGA
4
68/367
CTGAGCAAGAGAAGGGGAGGCGGGGTAAGGGA
CUGAGCAAGAGAAGGGGAGGCGGGGUAAG
LSKRRGGGVREVGGRFSQAQGWKCS
AGTAGGTGGAAGATTCAGCCAAGCTCAAGGATGG
GGAAGUAGGUGGAAGAUUCAGCCAAGCUC
AAGTGCAGTTAG
AAGGAUGGAAGUGCAGUUAG
5
69/368
GTGAGGATGGTTCTCCCCAAGCCCATCGTAGAGG
GUGAGGAUGGUUCUCCCCAAGCCCAUCGU
VRMVLPKPIVEAPQATWSWMRNSNLH
CCCCACAGGCTACCTGGTCCTGGATGAGGAACAG
AGAGGCCCCACAGGCUACCUGGUCCUGGA
SRSRPWSATPREVASQSLEPPWPPARGC
CAACCTTCACAGCCGCAGTCGGCCCTGGAGTGCC
UGAGGAACAGCAACCUUCACAGCCGCAGUC
RSSCQHLRTRMTQLPHPRCPCWAPLSPA
ACCCCGAGAGAGGTTGCGTCCCAGAGCCTGGAGC
GGCCCUGGAGUGCCACCCCGAGAGAGGUU
CGCCGTGGCCGCCAGCAAGGGGCTGCCGCAGCA
GCGUCCCAGAGCCUGGAGCCGCCGUGGCC
GCTGCCAGCACCTCCGGACGAGGATGACTCAGCT
GCCAGCAAGGGGCUGCCGCAGCAGCUGCCA
GCCCCATCCACGTTGTCCCTGCTGGGCCCCACTTT
GCACCUCCGGACGAGGAUGACUCAGCUGC
CCCCGGCTTAA
CCCAUCCACGUUGUCCCUGCUGGGCCCCAC
UUUCCCCGGCUUAA
6
70/369
CTGACAACGCCAAGGAGTTGTGTAAGGCAGTGTC
CUGACAACGCCAAGGAGUUGUGUAAGGCA
LTTPRSCVRQCRCPWAWVWRRWSI
GGTGTCCATGGGCCTGGGTGTGGAGGCGTTGGA
GUGUCGGUGUCCAUGGGCCUGGGUGUGG
GCATCTGA
AGGCGUUGGAGCAUCUGA
7
71/370
GTGTCAAAAGCGAAATGGGCCCCTGGATGGATAG
GUGUCAAAAGCGAAAUGGGCCCCUGGAUG
VSKAKWAPGWIATPDLTGTCVWRLPGT
CTACTCCGGACCTTACGGGGACATGCGTTTGGAG
GAUAGCUACUCCGGACCUUACGGGGACAU
MFCPLTITFHPRRPA
ACTGCCAGGGACCATGTTTTGCCCATTGACTATTA
GCGUUUGGAGACUGCCAGGGACCAUGUUU
CTTTCCACCCCAGAAGACCTGCCTGA
UGCCCAUUGACUAUUACUUUCCACCCCAG
AAGACCUGCCUGA
8
72/371
GTGGAGATGAAGCTTCTGGGTGTCACTATGGAGC
GUGGAGAUGAAGCUUCUGGGUGUCACUA
VEMKLLGVTMELSHVEAARSSSKEPLKG
TCTCACATGTGGAAGCTGCAAGGTCTTCTTCAAAA
UGGAGCUCUCACAUGUGGAAGCUGCAAGG
NRSTCAPAEMIALLINSEGKIVHLVVFGN
GAGCCGCTGAAGGGAAACAGAAGTACCTGTGCGC
UCUUCUUCAAAAGAGCCGCUGAAGGGAAA
VMKQG
CAGCAGAAATGATTGCACTATTGATAAATTCCGAA
CAGAAGUACCUGUGCGCCAGCAGAAAUGA
GGAAAAATTGTCCATCTTGTCGTCTTCGGAAATGT
UUGCACUAUUGAUAAAUUCCGAAGGAAAA
TATGAAGCAGGGATGA
AUUGUCCAUCUUGUCGUCUUCGGAAAUG
UUAUGAAGCAGGGAUGA
9
73/372
TTGCATGCAAAAGAAAAAATCCCACATCCTGCTCA
UUGCAUGCAAAAGAAAAAAUCCCACAUCCU
LHAKEKIPHPAQDASTSSPSSWTPCSLLR
AGACGCTTCTACCAGCTCACCAAGCTCCTGGACTC
GCUCAAGACGCUUCUACCAGCUCACCAAGC
ESCISSLLTC
CGTGCAGCCTATTGCGAGAGAGCTGCATCAGTTC
UCCUGGACUCCGUGCAGCCUAUUGCGAGA
ACTTTTGACCTGCTAA
GAGCUGCAUCAGUUCACUUUUGACCUGCU
AA
10
74/373
CTGTGCAAGTGCCCAAGATCCTTTCTGGGAAAGTC
CUGUGCAAGUGCCCAAGAUCCUUUCUGGGLC
KCPRSFLGKSSPSISTPSEALETLFPHPS
AAGCCCATCTATTTCCACACCCAGTGAAGCATTGG
AAAGUCAAGCCCAUCUAUUUCCACACCCAG
SCPLSDVFCLL
AAACCCTATTTCCCCACCCCAGCTCATGCCCCCTTT
UGAAGCAUUGGAAACCCUAUUUCCCCACCC
CAGATGTCTTCTGCCTGTTATAA
CAGCUCAUGCCCCCUUUCAGAUGUCUUCU
GCCUGUUAUAA
11
75/374
ATGGTGTTGTATGCCTTTAAATCTGTGATGATCCT
AUGGUGUUGUAUGCCUUUAAAUCUGUGA
MVLYAFKSVMILIWPSVKLCLFTALLCAS
CATATGGCCCAGTGTCAAGTTGTGCTTGTTTACAG
UGAUCCUCAUAUGGCCCAGUGUCAAGUUG
HTNVYLSYATGSLES
CACTACTCTGTGCCAGCCACACAAACGTTTACTTA
UGCUUGUUUACAGCACUACUCUGUGCCAG
TCTTATGCCACGGGAAGTTTAGAGAGCTAA
CCACACAAACGUUUACUUAUCUUAUGCCA
CGGGAAGUUUAGAGAGCUAA
12
76/375
CTGGGGAAATCAAAACAAAAACAAGCAAACAAAA
CUGGGGAAAUCAAAACAAAAACAAGCAAAC
LGKSKQKQANKKKKQKQNKK
AAAAAAAGCAAAAACAAAACAAAAAATAA
AAAAAAAAAAAGCAAAAACAAAACAAAAAA
UAA
13
77/376
GTGGAGCAATTCATTATACTGAAAATGTGCTTGTT
GUGGAGCAAUUCAUUAUACUGAAAAUGU
VEQFIILKMCLLLKICLHVNASPPNPFLSHS
GTTGAAAATTTGTCTGCATGTTAATGCCTCACCCC
GCUUGUUGUUGAAAAUUUGUCUGCAUGU
LPPTSD
CAAACCCTTTTCTCTCTCACTCTCTGCCTCCAACTTC
UAAUGCCUCACCCCCAAACCCUUUUCUCUC
AGATTGA
UCACUCUCUGCCUCCAACUUCAGAUUGA
14
78/377
ATGTTCTCTTCAGCCAAAACTTGGCGACTTCCACA
AUGUUCUCUUCAGCCAAAACUUGGCGACU
MFSSAKTWRLPQKSLTTEKKESRDLTLCK
GAAAAGTCTGACCACTGAGAAGAAGGAGAGCAG
UCCACAGAAAAGUCUGACCACUGAGAAGA
APFGSRSAFSCVSQGGAGARGEENDSLA
AGATTTAACCCTTTGTAAGGCCCCATTTGGATCCA
AGGAGAGCAGAGAUUUAACCCUUUGUAAG
VLLLRTLTE
GGTCTGCTTTCTCATGTGTGAGTCAGGGAGGAGC
GCCCCAUUUGGAUCCAGGUCUGCUUUCUC
TGGAGCCAGAGGAGAAGAAAATGATAGCTTGGCT
AUGUGUGAGUCAGGGAGGAGCUGGAGCC
GTTCTCCTGCTTAGGACACTGACTGAATAG
AGAGGAGAAGAAAAUGAUAGCUUGGCUG
UUCUCCUGCUUAGGACACUGACUGAAUAG
15
79/378
CTGCCAAACTCCGTGAAGCCACAAGCACCTTATGT
CUGCCAAACUCCGUGAAGCCACAAGCACCU
LPNSVKPQAPYVLPSVFCGPEFHHTAFQP
CCTCCCTTCAGTGTTTTGTGGGCCTGAATTTCATCA
UAUGUCCUCCCUUCAGUGUUUUGUGGGCC
WSSSLFASFGHVHRFSVKSPHHQEG
CACTGCATTTCAGCCATGGTCATCAAGCCTGTTTG
UGAAUUUCAUCACACUGCAUUUCAGCCAU
CTTCTTTTGGGCATGTTCACAGATTCTCTGTTAAGA
GGUCAUCAAGCCUGUUUGCUUCUUUUGG
GCCCCCACCACCAAGAAGGTTAG
GCAUGUUCACAGAUUCUCUGUUAAGAGCC
CCCACCACCAAGAAGGUUAG
16
80/379
GTGGGAAGAATGAAGGCACTAGAACCAGAAACC
GUGGGAAGAAUGAAGGCACUAGAACCAGA
VGRMKALEPETLQMLFLSPSISTCRSHEK
CTGCAAATGCTCTTCTTGTCACCCAGCATATCCACC
AACCCUGCAAAUGCUCUUCUUGUCACCCA
REGTKRRL
TGCAGAAGTCATGAGAAGAGAGAAGGAACAAAG
GCAUAUCCACCUGCAGAAGUCAUGAGAAG
AGGAGACTCTGA
AGAGAAGGAACAAAGAGGAGACUCUGA
17
81/380
CTGACATTGCCCATACTCACTCAGATTCCCCACCTT
CUGACAUUGCCCAUACUCACUCAGAUUCCC
LTLPILTQIPHLCCCLLVRGRPNH
TGTTGCTGCCTCTTAGTCAGAGGGAGGCCAAACC
CACCUUUGUUGCUGCCUCUUAGUCAGAGG
ATTGA
GAGGCCAAACCAUUGA
18
82/381
ATGAGTAATATGCCAATCCAAGACTGCTGGAGAA
AUGAGUAAUAUGCCAAUCCAAGACUGCUG
MSNMPIQDCWRKLKLTGSLFGVG
AACTAAAGCTGACAGGTTCCCTTTTTGGGGTGGG
GAGAAAACUAAAGCUGACAGGUUCCCUUU
ATAG
UUGGGGUGGGAUAG
19
83/382
ATGTACAGGATCACTTTTAGCTGTTTTAAACAGAA
AUGUACAGGAUCACUUUUAGCUGUUUUA
MYRITFSCFKQKKISTTLFSYTRLHFNRSFT
AAAAATATCCACCACTCTTTTCAGTTACACTAGGTT
AACAGAAAAAAAUAUCCACCACUCUUUUCA
SVLE
ACATTTTAATAGGTCCTTTACATCTGTTTTGGAATG
GUUACACUAGGUUACAUUUUAAUAGGUCC
A
UUUACAUCUGUUUUGGAAUGA
20
84/383
GTGATACACAGATTGAATTATATCATTTTCATATCT
GUGAUACACAGAUUGAAUUAUAUCAUUU
VIHRLNYIIFISLLVNTRSSPLHFSIKFFIFM
CTCCTTGTAAATACTAGAAGCTCTCCTTTACATTTC
UCAUAUCUCUCCUUGUAAAUACUAGAAGC
GFPIVTLVFMNICFSFAKAKNQ
TCTATCAAATTTTTCATCTTTATGGGTTTCCCAATT
UCUCCUUUACAUUUCUCUAUCAAAUUUUU
GTGACTCTTGTCTTCATGAATATATGTTTTTCATTT
CAUCUUUAUGGGUUUCCCAAUUGUGACUC
GCAAAAGCCAAAAATCAGTGA
UUGUCUUCAUGAAUAUAUGUUUUUCAUU
UGCAAAAGCCAAAAAUCAGUGA
21
85/384
ATGACAAAACTAGGGAAAAATAGCCTACACAAGC
AUGACAAAACUAGGGAAAAAUAGCCUACA
MTKLGKNSLHKPLGLLFLCLGLSEQRRF
CTTTAGGCCTACTCTTTCTGTGCTTGGGTTTGAGTG
CAAGCCUUUAGGCCUACUCUUUCUGUGCU
AACAAAGGAGATTTTAG
UGGGUUUGAGUGAACAAAGGAGAUUUUA
G
22
86/385
CTGCTCTACAAACAGAGTTGGTATGGTTGGTATAC
CUGCUCUACAAACAGAGUUGGUAUGGUUG
LLYKQSWYGWYTVLTCEGLATQTHLAGE
TGTACTCACCTGTGAGGGACTGGCCACTCAGACCC
GUAUACUGUACUCACCUGUGAGGGACUGG
LEDEDHSLEKSQGPSPNKLAVLDVDEE
ACTTAGCTGGTGAGCTAGAAGATGAGGATCACTC
CCACUCAGACCCACUUAGCUGGUGAGCUA
ACTGGAAAAGTCACAAGGACCATCTCCAAACAAG
GAAGAUGAGGAUCACUCACUGGAAAAGUC
TTGGCAGTGCTCGATGTGGACGAAGAGTGA
ACAAGGACCAUCUCCAAACAAGUUGGCAG
UGCUCGAUGUGGACGAAGAGUGA
23
87/386
CTGTGCTGGGCAGCAGACAGCTGCCAGGATCACG
CUGUGCUGGGCAGCAGACAGCUGCCAGGA
LCWAADSCQDHELCSQRKESCGSFSSRSL
AACTCTGTAGTCAAAGAAAAGAGTCGTGTGGCAG
UCACGAACUCUGUAGUCAAAGAAAAGAGU
GSSPRPSL
TTTCAGCTCTCGTTCATTGGGCAGCTCGCCTAGGC
CGUGUGGCAGUUUCAGCUCUCGUUCAUU
CCAGCCTCTGA
GGGCAGCUCGCCUAGGCCCAGCCUCUGA
24
88/387
CTGAGAAAGGGATATTTTGAAGGACTGTCATATA
CUGAGAAAGGGAUAUUUUGAAGGACUGU
LRKGYFEGLSYIFEKRKSVIHIFLCMFTGTK
TCTTTGAAAAAAGAAAATCTGTAATACATATATTTT
CAUAUAUCUUUGAAAAAAGAAAAUCUGUA
KYRELHSVLWVVAEVIVQVEK
TATGTATGTTCACTGGCACTAAAAAATATAGAGAG
AUACAUAUAUUUUUAUGUAUGUUCACUG
CTTCATTCTGTCCTTTGGGTAGTTGCTGAGGTAATT
GCACUAAAAAAUAUAGAGAGCUUCAUUCU
GTCCAGGTTGAAAAATAA
GUCCUUUGGGUAGUUGCUGAGGUAAUUG
UCCAGGUUGAAAAAUAA
25
89/388
CTGCCCCTTCCCAGCCCTGCACCAAAGCTGCATTT
CUGCCCCUUCCCAGCCCUGCACCAAAGCUG
LPLPSPAPKLHFRRLSPDSPVTTRAWPLH
CAGGAGACTCTCTCCAGACAGCCCAGTAACTACCC
CAUUUCAGGAGACUCUCUCCAGACAGCCCA
SPGKIRG
GAGCATGGCCCCTGCATAGCCCTGGAAAAATAAG
GUAACUACCCGAGCAUGGCCCCUGCAUAGC
AGGCTGA
CCUGGAAAAAUAAGAGGCUGA
26
90/389
CTGTCTACGAATTATCTTGTGCCAGTTGCCCAGGT
CUGUCUACGAAUUAUCUUGUGCCAGUUGC
LSTNYLVPVAQVRGHWAKGVVFMFDPL
GAGAGGGCACTGGGCCAAGGGAGTGGTTTTCAT
CCAGGUGAGAGGGCACUGGGCCAAGGGAG
QGVMGIRNAKAPDQIQNLKSK
GTTTGACCCACTACAAGGGGTCATGGGAATCAGG
UGGUUUUCAUGUUUGACCCACUACAAGGG
AATGCCAAAGCACCAGATCAAATCCAAAACTTAAA
GUCAUGGGAAUCAGGAAUGCCAAAGCACC
GTCAAAATAA
AGAUCAAAUCCAAAACUUAAAGUCAAAAU
AA
27
91/390
GTGTGTGTGTGTGTTCTGATAGCTTTAACTTTCTCT
GUGUGUGUGUGUGUUCUGAUAGCUUUAA
VCVCVLIALTFSASLYLVPDHT
GCATCTTTATATTTGGTTCCAGATCACACCTGA
CUUUCUCUGCAUCUUUAUAUUUGGUUCC
AGAUCACACCUGA
28
92/391
GTGAGAGAGGATGCAGTTTTGTTTTGGAAGCTCT
GUGAGAGAGGAUGCAGUUUUGUUUUGGA
VREDAVLFWKLSQNKQDTWIDQLTKSFL
CTCAGAACAAACAAGACACCTGGATTGATCAGTTA
AGCUCUCUCAGAACAAACAAGACACCUGGA
PYWV
ACTAAAAGTTTTCTCCCCTATTGGGTTTGA
UUGAUCAGUUAACUAAAAGUUUUCUCCCC
UAUUGGGUUUGA
29
93/392
GTGAAGGAGCAGAGGGATAAAAAGAGTAGAGGA
GUGAAGGAGCAGAGGGAUAAAAAGAGUAG
VKEQRDKKSRGHDTLYFTSSRQMNVESI
CATGATACATTGTACTTTACTAGTTCAAGACAGAT
AGGACAUGAUACAUUGUACUUUACUAGUU
KTQWN
GAATGTGGAAAGCATAAAAACTCAATGGAACTGA
CAAGACAGAUGAAUGUGGAAAGCAUAAAA
ACUCAAUGGAACUGA
30
94/393
CTGAGATTTACCACAGGGAAGGCCCAAACTTGGG
CUGAGAUUUACCACAGGGAAGGCCCAAAC
LRFTTGKAQTWGQKPTQVIDQWPPNGT
GCCAAAAGCCTACCCAAGTGATTGACCAGTGGCC
UUGGGGCCAAAAGCCUACCCAAGUGAUUG
CCCTAATGGGACCTGA
ACCAGUGGCCCCCUAAUGGGACCUGA
31
95/394
CTGTTCCTTGGTCTTCACCATCCTTGTGAGAGAAG
CUGUUCCUUGGUCUUCACCAUCCUUGUGA
LFLGLHHPCERRAVSCIGTWSKRSIFHTNS
GGCAGTTTCCTGCATTGGAACCTGGAGCAAGCGC
GAGAAGGGCAGUUUCCUGCAUUGGAACCULT
TCTATCTTTCACACAAATTCCCTCACCTGA
GGAGCAAGCGCUCUAUCUUUCACACAAAU
UCCCUCACCUGA
32
96/395
CTGATTCCAATTCAGTATAGCAAGGTGCTAGGTTT
CUGAUUCCAAUUCAGUAUAGCAAGGUGCU
LIPIQYSKVLGFFLSPPVS
TTTCCTTTCCCCACCTGTCTCTTAG
AGGUUUUUUCCUUUCCCCACCUGUCUCUU
AG
33
97/396
CTGGTGGTTCCCTCTCTGGCTGCTGCCTCACAGTA
CUGGUGGUUCCCUCUCUGGCUGCUGCCUC
LVVPSLAAASQYGNLYSAEVTGQICIISQP
TGGGAACCTGTACTCTGCAGAGGTGACAGGCCAG
ACAGUAUGGGAACCUGUACUCUGCAGAGG
ATTTGCATTATCTCACAACCTTAG
UGACAGGCCAGAUUUGCAUUAUCUCACAA
CCUUAG
34
98/397
TTGCTTGAAATCTCTTTAGATGGGGCTCATTTCTCA
UUGCUUGAAAUCUCUUUAGAUGGGGCUC
LLEISLDGAHFSRWHLASTGQQDQLQAL
CGGTGGCACTTGGCCTCCACTGGGCAGCAGGACC
AUUUCUCACGGUGGCACUUGGCCUCCACU
VFCSLFVILESFVALNTIKNGRNLFVGLHV
AGCTCCAAGCGCTAGTGTTCTGTTCTCTTTTTGTAA
GGGCAGCAGGACCAGCUCCAAGCGCUAGU
TCTTGGAATCTTTTGTTGCTCTAAATACAATTAAAA
GUUCUGUUCUCUUUUUGUAAUCUUGGAA
ATGGCAGAAACTTGTTTGTTGGACTACATGTGTGA
UCUUUUGUUGCUCUAAAUACAAUUAAAAA
UGGCAGAAACUUGUUUGUUGGACUACAU
GUGUGA
35
99/398
GTGAGCCTAGCAGGGCAGATCTTGTCCACCGTGT
GUGAGCCUAGCAGGGCAGAUCUUGUCCAC
VSLAGQILSTVCLLLHETLRLSERFLRGCSR
GTCTTCTTCTGCACGAGACTTTGAGGCTGTCAGAG
CGUGUGUCUUCUUCUGCACGAGACUUUGA
KFPSLELPAGGQLAAATTASSQPVELF
CGCTTTTTGCGTGGTTGCTCCCGCAAGTTTCCTTCT
GGCUGUCAGAGCGCUUUUUGCGUGGUUG
CTGGAGCTTCCCGCAGGTGGGCAGCTAGCTGCAG
CUCCCGCAAGUUUCCUUCUCUGGAGCUUC
CGACTACCGCATCATCACAGCCTGTTGAACTCTTC
CCGCAGGUGGGCAGCUAGCUGCAGCGACU
TGA
ACCGCAUCAUCACAGCCUGUUGAACUCUU
CUGA
36
100/399
GTGGAAGATTCAGCCAAGCTCAAGGATGGAAGTG
GUGGAAGAUUCAGCCAAGCUCAAGGAUGG
VEDSAKLKDGSAVRAGKGLPSAAVQDLP
CAGTTAGGGCTGGGAAGGGTCTACCCTCGGCCGC
AAGUGCAGUUAGGGCUGGGAAGGGUCUA
RSFPESVPERARSDPEPGPQAPRGRERST
CGTCCAAGACCTACCGAGGAGCTTTCCAGAATCTG
CCCUCGGCCGCCGUCCAAGACCUACCGAGG
SRRQFAAAAAAAAAAAAAAAAAAAAAA
TTCCAGAGCGTGCGCGAAGTGATCCAGAACCCGG
AGCUUUCCAGAAUCUGUUCCAGAGCGUGC
AAAARD
GCCCCAGGCACCCAGAGGCCGCGAGCGCAGCACC
GCGAAGUGAUCCAGAACCCGGGCCCCAGGC
TCCCGGCGCCAGTTTGCTGCTGCTGCAGCAGCAG
ACCCAGAGGCCGCGAGCGCAGCACCUCCCG
CAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAG
GCGCCAGUUUGCUGCUGCUGCAGCAGCAG
CAGCAGCAGCAGCAGCAGCAGCAGCAAGAGACT
CAGCAGCAGCAGCAGCAGCAGCAGCAGCAG
AG
CAGCAGCAGCAGCAGCAGCAGCAGCAGCAA
GAGACUAG
37
101/400
GTGCCACCCCGAGAGAGGTTGCGTCCCAGAGCCT
GUGCCACCCCGAGAGAGGUUGCGUCCCAG
VPPRERLRPRAWSRRGRQQGAAAAAAS
GGAGCCGCCGTGGCCGCCAGCAAGGGGCTGCCG
AGCCUGGAGCCGCCGUGGCCGCCAGCAAG
TSGRG
CAGCAGCTGCCAGCACCTCCGGACGAGGATGA
GGGCUGCCGCAGCAGCUGCCAGCACCUCCG
GACGAGGAUGA
38
102/401
TTGCATGTACGCCCCACTTTTGGGAGTTCCACCCG
UUGCAUGUACGCCCCACUUUUGGGAGUUC
LHVRPTFGSSTRCASHSLCPIGRMQRFSA
CTGTGCGTCCCACTCCTTGTGCCCCATTGGCCGAA
CACCCGCUGUGCGUCCCACUCCUUGUGCCC
RRQRRQEH
TGCAAAGGTTCTCTGCTAGACGACAGCGCAGGCA
CAUUGGCCGAAUGCAAAGGUUCUCUGCUA
AGAGCACTGA
GACGACAGCGCAGGCAAGAGCACUGA
39
103/402
CTGGGCGGCTGCGGCGGCGCAGTGCCGCTATGG
CUGGGCGGCUGCGGCGGCGCAGUGCCGCU
LGGCGGAVPLWGPGEPAWRGCSGTRF
GGACCTGGCGAGCCTGCATGGCGCGGGTGCAGC
AUGGGGACCUGGCGAGCCUGCAUGGCGCG
WVTLSRRFLILAHSLHSRRRPVVWTVW
GGGACCCGGTTCTGGGTCACCCTCAGCCGCCGCTT
GGUGCAGCGGGACCCGGUUCUGGGUCACC
WWWGWWRRRRRRRRRRRRRRRRRG
CCTCATCCTGGCACACTCTCTTCACAGCCGAAGAA
CUCAGCCGCCGCUUCCUCAUCCUGGCACAC
GSCSPLRLHSAPSGAGGPGKRLHRT
GGCCAGTTGTATGGACCGTGTGGTGGTGGTGGG
UCUCUUCACAGCCGAAGAAGGCCAGUUGU
GGTGGTGGCGGCGGCGGCGGCGGCGGCGGCGG
AUGGACCGUGUGGUGGUGGUGGGGGUGG
CGGCGGCGGCGGCGGCGGCGGCGGCGAGGCGG
UGGCGGCGGCGGCGGCGGCGGCGGCGGCG
GAGCTGTAGCCCCCTACGGCTACACTCGGCCCCCT
GCGGCGGCGGCGGCGGCGGCGGCGAGGCG
CAGGGGCTGGCGGGCCAGGAAAGCGACTTCACC
GGAGCUGUAGCCCCCUACGGCUACACUCG
GCACCTGA
GCCCCCUCAGGGGCUGGCGGGCCAGGAAA
GCGACUUCACCGCACCUGA
40
104/403
GTGGTACCCTGGCGGCATGGTGAGCAGAGTGCCC
GUGGUACCCUGGCGGCAUGGUGAGCAGAG
VVPWRHGEQSALSQSHLCQKRNGPLDG
TATCCCAGTCCCACTTGTGTCAAAAGCGAAATGGG
UGCCCUAUCCCAGUCCCACUUGUGUCAAA
CCCCTGGATGGATAG
AGCGAAAUGGGCCCCUGGAUGGAUAG
41
105/404
GTGTCACTATGGAGCTCTCACATGTGGAAGCTGC
GUGUCACUAUGGAGCUCUCACAUGUGGAA
VSLWSSHMWKLQGLLQKSR
AAGGTCTTCTTCAAAAGAGCCGCTGA
GCUGCAAGGUCUUCUUCAAAAGAGCCGCU
GA
42
106/405
GTGGGCCAAGGCCTTGCCTGGCTTCCGCAACTTAC
GUGGGCCAAGGCCUUGCCUGGCUUCCGCA
VGQGLAWLPQLTRGRPDGCHSVLLDGA
ACGTGGACGACCAGATGGCTGTCATTCAGTACTCC
ACUUACACGUGGACGACCAGAUGGCUGUC
HGVCHGLAILHQCQLQDALLRP
TGGATGGGGCTCATGGTGTTTGCCATGGGCTGGC
AUUCAGUACUCCUGGAUGGGGCUCAUGGU
GATCCTTCACCAATGTCAACTCCAGGATGCTCTAC
GUUUGCCAUGGGCUGGCGAUCCUUCACCA
TTCGCCCCTGA
AUGUCAACUCCAGGAUGCUCUACUUCGCC
CCUGA
43
107/406
GTGTGTCCGAATGAGGCACCTCTCTCAAGAGTTTG
GUGUGUCCGAAUGAGGCACCUCUCUCAAG
VCPNEAPLSRVWMAPNHPPGIPVHEST
GATGGCTCCAAATCACCCCCCAGGAATTCCTGTGC
AGUUUGGAUGGCUCCAAAUCACCCCCCAG
ATLQHYSSGWAEKSKIL
ATGAAAGCACTGCTACTCTTCAGCATTATTCCAGT
GAAUUCCUGUGCAUGAAAGCACUGCUACU
GGATGGGCTGAAAAATCAAAAATTCTTTGA
CUUCAGCAUUAUUCCAGUGGAUGGGCUGA
AAAAUCAAAAAUUCUUUGA
44
108/407
ATGCAAAAGAAAAAATCCCACATCCTGCTCAAGAC
AUGCAAAAGAAAAAAUCCCACAUCCUGCUC
MQKKKSHILLKTLLPAHQAPGLRAAYCER
GCTTCTACCAGCTCACCAAGCTCCTGGACTCCGTG
AAGACGCUUCUACCAGCUCACCAAGCUCCU
AASVHF
CAGCCTATTGCGAGAGAGCTGCATCAGTTCACTTT
GGACUCCGUGCAGCCUAUUGCGAGAGAGC
TGA
UGCAUCAGUUCACUUUUGA
45
109/408
GTGAAGCATTGGAAACCCTATTTCCCCACCCCAGC
GUGAAGCAUUGGAAACCCUAUUUCCCCAC
VKHWKPYFPTPAHAPFQMSSACYNSALL
TCATGCCCCCTTTCAGATGTCTTCTGCCTGTTATAA
CCCAGCUCAUGCCCCCUUUCAGAUGUCUUC
LCSALGNFLY
CTCTGCACTACTCCTCTGCAGTGCCTTGGGGAATT
UGCCUGUUAUAACUCUGCACUACUCCUCU
TCCTCTATTGA
GCAGUGCCUUGGGGAAUUUCCUCUAUUGA
46
110/409
TTGCTGGGCTTTTTTTTTCTCTTTCTCTCCTTTCTTTT
UUGCUGGGCUUUUUUUUUCUCUUUCUCU
LLGFFFLFLSFLFLLPSLSNPPMAPSDFASH
TCTTCTTCCCTCCCTATCTAACCCTCCCATGGCACC
CCUUUCUUUUUCUUCUUCCCUCCCUAUCU
CGSYLCFEWCCMPLNL
TTCAGACTTTGCTTCCCATTGTGGCTCCTATCTGTG
AACCCUCCCAUGGCACCUUCAGACUUUGCU
TTTTGAATGGTGTTGTATGCCTTTAAATCTGTGA
UCCCAUUGUGGCUCCUAUCUGUGUUUUG
AAUGGUGUUGUAUGCCUUUAAAUCUGUG
A
47
111/410
GTGTCAAGTTGTGCTTGTTTACAGCACTACTCTGT
GUGUCAAGUUGUGCUUGUUUACAGCACU
VSSCACLQHYSVPATQTFTYLMPREV
GCCAGCCACACAAACGTTTACTTATCTTATGCCAC
ACUCUGUGCCAGCCACACAAACGUUUACU
GGGAAGTTTAG
UAUCUUAUGCCACGGGAAGUUUAG
48
112/411
GTGAAAATTGCAGGCCCATGGGGAGTTACTGATT
GUGAAAAUUGCAGGCCCAUGGGGAGUUAC
VKIAGPWGVTDFFISSLHGRLYFLPMAIAI
TTTTCATCTCCTCCCTCCACGGGAGACTTTATTTTC
UGAUUUUUUCAUCUCCUCCCUCCACGGGA
RGQSDPRAELGRGVDREERTRRAMEHQ
TGCCAATGGCTATTGCCATTAGAGGGCAGAGTGA
GACUUUAUUUUCUGCCAAUGGCUAUUGCC
YLPTALVPGG
CCCCAGAGCTGAGTTGGGCAGGGGGGTGGACAG
AUUAGAGGGCAGAGUGACCCCAGAGCUGA
AGAGGAGAGGACAAGGAGGGCAATGGAGCATCA
GUUGGGCAGGGGGGUGGACAGAGAGGAG
GTACCTGCCCACAGCCTTGGTCCCTGGGGGCTAG
AGGACAAGGAGGGCAAUGGAGCAUCAGUA
CCUGCCCACAGCCUUGGUCCCUGGGGGCU
AG
49
113/412
ATGTTAATGCCTCACCCCCAAACCCTTTTCTCTCTC
AUGUUAAUGCCUCACCCCCAAACCCUUUUC
MLMPHPQTLFSLTLCLQLQIDFQ
ACTCTCTGCCTCCAACTTCAGATTGACTTTCAATAG
UCUCUCACUCUCUGCCUCCAACUUCAGAU
UGACUUUCAAUAG
50
114/413
ATGTCCTCCCTTCAGTGTTTTGTGGGCCTGAATTTC
AUGUCCUCCCUUCAGUGUUUUGUGGGCCU
MSSLQCFVGLNFITLHFSHGHQACLLLLG
ATCACACTGCATTTCAGCCATGGTCATCAAGCCTG
GAAUUUCAUCACACUGCAUUUCAGCCAUG
MFTDSLLRAPTTKKVSRPTALTSICRCQ
TTTGCTTCTTTTGGGCATGTTCACAGATTCTCTGTT
GUCAUCAAGCCUGUUUGCUUCUUUUGGG
AAGAGCCCCCACCACCAAGAAGGTTAGCAGGCCA
CAUGUUCACAGAUUCUCUGUUAAGAGCCC
ACAGCTCTGACATCTATCTGTAGATGCCAGTAG
CCACCACCAAGAAGGUUAGCAGGCCAACAG
CUCUGACAUCUAUCUGUAGAUGCCAGUAG
51
115/414
CTGGGCGTCTTGCCCTTGTCCCCCAGAGATGATAC
CUGGGCGUCUUGCCCUUGUCCCCCAGAGA
LGVLPLSPRDDTLPASGEVLTSFFRAAKG
CCTCCCAGCAAGTGGAGAAGTTCTCACTTCCTTCT
UGAUACCCUCCCAGCAAGUGGAGAAGUUC
ATQIRVEEKTQLPGWEE
TTAGAGCAGCTAAAGGGGCTACCCAGATCAGGGT
UCACUUCCUUCUUUAGAGCAGCUAAAGGG
TGAAGAGAAAACTCAATTACCAGGGTGGGAAGAA
GCUACCCAGAUCAGGGUUGAAGAGAAAAC
TGA
UCAAUUACCAGGGUGGGAAGAAUGA
52
116/415
ATGGCTTCTTTCGGAAAGGTCTGGTTGGTGTGGCT
AUGGCUUCUUUCGGAAAGGUCUGGUUGG
MASFGKVWLVWLQYFATHELRVCPGTL
CCAATACTTTGCCACCCATGAACTCAGGGTGTGCC
UGUGGCUCCAAUACUUUGCCACCCAUGAA
VLYSLLAHLCSVDFVLQAQVQGKMSTYFL
CTGGGACACTGGTTTTATATAGTCTTTTGGCACAC
CUCAGGGUGUGCCCUGGGACACUGGUUUU
ILASASLLSS
CTGTGTTCTGTTGACTTCGTTCTTCAAGCCCAAGTG
AUAUAGUCUUUUGGCACACCUGUGUUCU
CAAGGGAAAATGTCCACCTACTTTCTCATCTTGGC
GUUGACUUCGUUCUUCAAGCCCAAGUGCA
CTCTGCCTCCTTACTTAGCTCTTAA
AGGGAAAAUGUCCACCUACUUUCUCAUCU
UGGCCUCUGCCUCCUUACUUAGCUCUUAA
53
117/416
CTGTTGAACTCAAGAAATCAAGGGCCAGTCATCA
CUGUUGAACUCAAGAAAUCAAGGGCCAGU
LLNSRNQGPVIKLPILIDSLCLLRG
AGCTGCCCATTTTAATTGATTCACTCTGTTTGTTGA
CAUCAAGCUGCCCAUUUUAAUUGAUUCAC
GAGGATAG
UCUGUUUGUUGAGAGGAUAG
54
118/417
TTGGGGTGGGATAGACATGTTCTGGTTTTCTTTAT
UUGGGGUGGGAUAGACAUGUUCUGGUUU
LGWDRHVLVFFIITQSGSCTGSLLAVLNR
TATTACACAATCTGGCTCATGTACAGGATCACTTTT
UCUUUAUUAUUACACAAUCUGGCUCAUGU
KKYPPLFSVTLGYILIGPLHLFWNDFHLL
AGCTGTTTTAAACAGAAAAAAATATCCACCACTCT
ACAGGAUCACUUUUAGCUGUUUUAAACAG
TTTCAGTTACACTAGGTTACATTTTAATAGGTCCTT
AAAAAAAUAUCCACCACUCUUUUCAGUUA
TACATCTGTTTTGGAATGATTTTCATCTTTTGTGA
CACUAGGUUACAUUUUAAUAGGUCCUUUA
CAUCUGUUUUGGAAUGAUUUUCAUCUUU
UGUGA
55
119/418
TTGGCCATTGATGTTCTAGCCAATGTAATTGACAG
UUGGCCAUUGAUGUUCUAGCCAAUGUAAU
LAIDVLANVIDRSLILHALCSTNRVGMVGI
AAGTCTCATTTTGCATGCGCTCTGCTCTACAAACA
UGACAGAAGUCUCAUUUUGCAUGCGCUCU
LYSPVRDWPLRPT
GAGTTGGTATGGTTGGTATACTGTACTCACCTGTG
GCUCUACAAACAGAGUUGGUAUGGUUGG
AGGGACTGGCCACTCAGACCCACTTAG
UAUACUGUACUCACCUGUGAGGGACUGGC
CACUCAGACCCACUUAG
56
120/419
TTGTCCAGGTTGAAAAATAATGTGCTGATGCTAGA
UUGUCCAGGUUGAAAAAUAAUGUGCUGA
LSRLKNNVLMLESLSVHTLLLNTYRHT
GTCCCTCTCTGTCCATACTCTACTTCTAAATACATA
UGCUAGAGUCCCUCUCUGUCCAUACUCUA
TAGGCATACATAG
CUUCUAAAUACAUAUAGGCAUACAUAG
57
121/420
CTGGCAGGGCATAAAGGCCCAGGCCACTTCCTCT
CUGGCAGGGCAUAAAGGCCCAGGCCACUU
LAGHKGPGHFLCPFPALHQSCISGDSLQT
GCCCCTTCCCAGCCCTGCACCAAAGCTGCATTTCA
CCUCUGCCCCUUCCCAGCCCUGCACCAAAG
AQ
GGAGACTCTCTCCAGACAGCCCAGTAA
CUGCAUUUCAGGAGACUCUCUCCAGACAG
CCCAGUAA
58
122/421
TTGACCCACTACAAGGGGTCATGGGAATCAGGAA
UUGACCCACUACAAGGGGUCAUGGGAAUC
LTHYKGSWESGMPKHQIKSKT
TGCCAAAGCACCAGATCAAATCCAAAACTTAA
AGGAAUGCCAAAGCACCAGAUCAAAUCCAA
AACUUAA
59
123/422
ATGTTCAGTTTCTTGGAAAAGGAAGTTTCTACCCC
AUGUUCAGUUUCUUGGAAAAGGAAGUUU
MFSFLEKEVSTPDAFVGRSVLTINLFENLL
TGATGCCTTTGTAGGCAGATCTGTTCTCACCATTA
CUACCCCUGAUGCCUUUGUAGGCAGAUCU
KQFLKREMKASHYITKDYIVPAKIPKFIRA
ATCTTTTTGAAAATCTTTTAAAGCAGTTTTTAAAAA
GUUCUCACCAUUAAUCUUUUUGAAAAUCU
GGEQALVPL
GAGAGATGAAAGCATCACATTATATAACCAAAGA
UUUAAAGCAGUUUUUAAAAAGAGAGAUG
TTACATTGTACCTGCTAAGATACCAAAATTCATAA
AAAGCAUCACAUUAUAUAACCAAAGAUUA
GGGCAGGGGGGGAGCAAGCATTAGTGCCTCTTTG
CAUUGUACCUGCUAAGAUACCAAAAUUCA
A
UAAGGGCAGGGGGGGAGCAAGCAUUAGU
GCCUCUUUGA
60
124/423
TTGCTGAGCAAATTGTTGAAGCTCCATCATTGCAT
UUGCUGAGCAAAUUGUUGAAGCUCCAUCA
LLSKLLKLHHCMVGNGAVLSHCVC
GGTTGGAAATGGAGCTGTTCTTAGCCACTGTGTTT
UUGCAUGGUUGGAAAUGGAGCUGUUCUU
GCTAG
AGCCACUGUGUUUGCUAG
61
125/424
TTGAAATACATTGTAAATGAATATTTGTATCCATG
UUGAAAUACAUUGUAAAUGAAUAUUUGU
LKYIVNEYLYPCFKIEIYIYIYIYIYIYI
TTTCAAAATTGAAATATATATATATATATATATATA
AUCCAUGUUUCAAAAUUGAAAUAUAUAUA
TATATATATATATATATAG
UAUAUAUAUAUAUAUAUAUAUAUAUAUA
UAUAG
62
126/425
ATGCAGTTTTGTTTTGGAAGCTCTCTCAGAACAAA
AUGCAGUUUUGUUUUGGAAGCUCUCUCA
MQFCFGSSLRTNKTPGLIS
CAAGACACCTGGATTGATCAGTTAA
GAACAAACAAGACACCUGGAUUGAUCAGU
UAA
63
127/426
TTGACCAGTGGCCCCCTAATGGGACCTGAGCTGTT
UUGACCAGUGGCCCCCUAAUGGGACCUGA
LTSGPLMGPELLEEENCSLVFTILVREGQF
GGAAGAAGAGAACTGTTCCTTGGTCTTCACCATCC
GCUGUUGGAAGAAGAGAACUGUUCCUUG
PALEPGASALSFTQIPSPEIEVLLLLGVCVL
TTGTGAGAGAAGGGCAGTTTCCTGCATTGGAACC
GUCUUCACCAUCCUUGUGAGAGAAGGGCA
TGGAGCAAGCGCTCTATCTTTCACACAAATTCCCT
GUUUCCUGCAUUGGAACCUGGAGCAAGCG
CACCTGAGATTGAGGTGCTCTTGTTACTGGGTGTC
CUCUAUCUUUCACACAAAUUCCCUCACCUG
TGTGTGCTGTAA
AGAUUGAGGUGCUCUUGUUACUGGGUGU
CUGUGUGCUGUAA
64
128/427
TTGACAAATGAAAATGTGTTTTTCTCTGTTAAAACT
UUGACAAAUGAAAAUGUGUUUUUCUCUG
LTNENVFFSVKTCQSTRSCISVGAGPFLPT
TGTCAGAGTACTAGAAGTTGTATCTCTGTAGGTGC
UUAAAACUUGUCAGAGUACUAGAAGUUG
GRVFFFD
AGGTCCATTTCTGCCCACAGGTAGGGTGTTTTTCT
UAUCUCUGUAGGUGCAGGUCCAUUUCUGC
TTGATTAA
CCACAGGUAGGGUGUUUUUCUUUGAUUA
A
65
129/428
ATGAGAAGCCTTAGAATGGGTGGCCCTTGTGACC
AUGAGAAGCCUUAGAAUGGGUGGCCCUUG
MRSLRMGGPCDLKHFPHKLLNKIVMEL
TGAAACACTTCCCACATAAGCTACTTAACAAGATT
UGACCUGAAACACUUCCCACAUAAGCUACU
QIPLPTKD
GTCATGGAGCTGCAGATTCCATTGCCCACCAAAGA
UAACAAGAUUGUCAUGGAGCUGCAGAUUC
CTAG
CAUUGCCCACCAAAGACUAG
66
130/429
ATGCTGTTTCTCTGGTGGTTCCCTCTCTGGCTGCTG
AUGCUGUUUCUCUGGUGGUUCCCUCUCU
MLFLWWFPLWLLPHSMGTCTLQR
CCTCACAGTATGGGAACCTGTACTCTGCAGAGGT
GGCUGCUGCCUCACAGUAUGGGAACCUGU
GA
ACUCUGCAGAGGUGA
67
131/430
CTGTCCTACAGTGAAGTGCCTGGGGGGTTGTCCT
CUGUCCUACAGUGAAGUGCCUGGGGGGU
LSYSEVPGGLSYPISHLDADSSHHQNDPR
ATCCCATAAGCCACTTGGATGCTGACAGCAGCCAC
UGUCCUAUCCCAUAAGCCACUUGGAUGCU
KKKKKKIKKSPHNPVTPFCFPLDWNID
CATCAGAATGACCCACGCAAAAAAAAGAAAAAAA
GACAGCAGCCACCAUCAGAAUGACCCACGC
AAATTAAAAAGTCCCCTCACAACCCAGTGACACCT
AAAAAAAAGAAAAAAAAAAUUAAAAAGUCC
TTCTGCTTTCCTCTAGACTGGAACATTGATTAG
CCUCACAACCCAGUGACACCUUUCUGCUUU
CCUCUAGACUGGAACAUUGAUUAG
68
132/431
ATGGGGCTCATTTCTCACGGTGGCACTTGGCCTCC
AUGGGGCUCAUUUCUCACGGUGGCACUUG
MGLISHGGTWPPLGSRTSSKR
ACTGGGCAGCAGGACCAGCTCCAAGCGCTAG
GCCUCCACUGGGCAGCAGGACCAGCUCCAA
GCGCUAG
69
133/432
ATGGCAGAAACTTGTTTGTTGGACTACATGTGTGA
AUGGCAGAAACUUGUUUGUUGGACUACA
MAETCLLDYMCDFGSVSASAFRNVIHCV
CTTTGGGTCTGTCTCTGCCTCTGCTTTCAGAAATGT
UGUGUGACUUUGGGUCUGUCUCUGCCUC
KYWLTGLPAKTWPHPLLWLQDRVIVNKE
CATCCATTGTGTAAAATATTGGCTTACTGGTCTGC
UGCUUUCAGAAAUGUCAUCCAUUGUGUAA
TQEKLLMSSYHCC
CAGCTAAAACTTGGCCACATCCCCTGTTATGGCTG
AAUAUUGGCUUACUGGUCUGCCAGCUAAA
CAGGATCGAGTTATTGTTAACAAAGAGACCCAAG
ACUUGGCCACAUCCCCUGUUAUGGCUGCA
AAAAGCTGCTAATGTCCTCTTATCATTGTTGTTAA
GGAUCGAGUUAUUGUUAACAAAGAGACCC
AAGAAAAGCUGCUAAUGUCCUCUUAUCAU
UGUUGUUAA
70
134/433
CTGTTTTCCCCCACTCTCTCTCCACCTCCTCCTGCCT
CUGUUUUCCCCCACUCUCUCUCCACCUCCU
LFSPTLSPPPPAFPTPSAEPEIKR
TCCCCACCCCGAGTGCGGAGCCAGAGATCAAAAG
CCUGCCUUCCCCACCCCGAGUGCGGAGCCA
ATGA
GAGAUCAAAAGAUGA
71
135/434
GTGGACACTGAATTTGGAAGGTGGAGGATTTTGT
GUGGACACUGAAUUUGGAAGGUGGAGGA
VDTEFGRWRILFFSFKIWASFESTLQVLR
TTTTTTCTTTTAAGATCTGGGCATCTTTTGAATCTA
UUUUGUUUUUUUCUUUUAAGAUCUGGGC
DRL
CCCTTCAAGTATTAAGAGACAGACTGTGA
AUCUUUUGAAUCUACCCUUCAAGUAUUAA
GAGACAGACUGUGA
72
136/435
ATGGCCCAGTGTCAAGTTGTGCTTGTTTACAGCAC
AUGGCCCAGUGUCAAGUUGUGCUUGUUU
MAQCQVVLVYSTTLCQPHKRLLILCHGKF
TACTCTGTGCCAGCCACACAAACGTTTACTTATCTT
ACAGCACUACUCUGUGCCAGCCACACAAAC
RELRLSGEIKTKTSKQKKKAKTKQKISQKT
ATGCCACGGGAAGTTTAGAGAGCTAAGATTATCT
GUUUACUUAUCUUAUGCCACGGGAAGUU
LLVFFPQK
GGGGAAATCAAAACAAAAACAAGCAAACAAAAA
UAGAGAGCUAAGAUUAUCUGGGGAAAUCA
AAAAAAGCAAAAACAAAACAAAAAATAAGCCAAA
AAACAAAAACAAGCAAACAAAAAAAAAAAG
AAACCTTGCTAGTGTTTTTTCCTCAAAAATAA
CAAAAACAAAACAAAAAAUAAGCCAAAAAA
CCUUGCUAGUGUUUUUUCCUCAAAAAUAA
73
137/436
TTGACTTTCAATAGTTTTTCTAAGACCTTTGAACTG
UUGACUUUCAAUAGUUUUUCUAAGACCU
LTFNSFSKTFELNVLFSQNLATSTEKSDH
AATGTTCTCTTCAGCCAAAACTTGGCGACTTCCAC
UUGAACUGAAUGUUCUCUUCAGCCAAAAC
AGAAAAGTCTGACCACTGA
UUGGCGACUUCCACAGAAAAGUCUGACCA
CUGA
74
138/437
ATGAAGTTTTCTGCCAAACTCCGTGAAGCCACAAG
AUGAAGUUUUCUGCCAAACUCCGUGAAGC
MKFSAKLREATSTLCPPFSVLWA
CACCTTATGTCCTCCCTTCAGTGTTTTGTGGGCCTG
CACAAGCACCUUAUGUCCUCCCUUCAGUG
A
UUUUGUGGGCCUGA
75
139/438
CTGCTGGAGAAAACTAAAGCTGACAGGTTCCCTTT
CUGCUGGAGAAAACUAAAGCUGACAGGUU
LLEKTKADRFPFWGGIDMFWFSLLLHNL
TTGGGGTGGGATAGACATGTTCTGGTTTTCTTTAT
CCCUUUUUGGGGUGGGAUAGACAUGUUC
AHVQDHF
TATTACACAATCTGGCTCATGTACAGGATCACTTTT
UGGUUUUCUUUAUUAUUACACAAUCUGG
AG
CUCAUGUACAGGAUCACUUUUAG
76
140/439
ATGATTTTCATCTTTTGTGATACACAGATTGAATTA
AUGAUUUUCAUCUUUUGUGAUACACAGA
MIFIFCDTQIELYHFHISPCKY
TATCATTTTCATATCTCTCCTTGTAAATACTAG
UUGAAUUAUAUCAUUUUCAUAUCUCUCCU
UGUAAAUACUAG
77
141/440
TTGACAGAAGTCTCATTTTGCATGCGCTCTGCTCT
UUGACAGAAGUCUCAUUUUGCAUGCGCUC
LTEVSFCMRSALQTELVWLVYCTHL
ACAAACAGAGTTGGTATGGTTGGTATACTGTACTC
UGCUCUACAAACAGAGUUGGUAUGGUUG
ACCTGTGA
GUAUACUGUACUCACCUGUGA
78
142/441
GTGAGGAAGAGAAAAAGAAGGAGCACCAGGGA
GUGAGGAAGAGAAAAAGAAGGAGCACCAG
VRKRKRRSTREKAPSVLGSRQLPGSRTL
GAAGGCTCCGTCTGTGCTGGGCAGCAGACAGCTG
GGAGAAGGCUCCGUCUGUGCUGGGCAGCA
CCAGGATCACGAACTCTGTAG
GACAGCUGCCAGGAUCACGAACUCUGUAG
79
143/442
CTGACTGTCTACGAATTATCTTGTGCCAGTTGCCC
CUGACUGUCUACGAAUUAUCUUGUGCCAG
LTVYELSCASCPGERALGQGSGFHV
AGGTGAGAGGGCACTGGGCCAAGGGAGTGGTTT
UUGCCCAGGUGAGAGGGCACUGGGCCAAG
TCATGTTTGA
GGAGUGGUUUUCAUGUUUGA
80
144/443
TTGTGGGTTTTTTTTTCCCTAATAATATACATGTTT
UUGUGGGUUUUUUUUUCCCUAAUAAUAU
LWVFFSLIIYMFRRIENNFGKMGLWVLH
AGAAGAATTGAAAATAATTTCGGGAAAATGGGAT
ACAUGUUUAGAAGAAUUGAAAAUAAUUUC
TATGGGTCCTTCACTAA
GGGAAAAUGGGAUUAUGGGUCCUUCACU
AA
81
145/444
TTGGAAATGGAGCTGTTCTTAGCCACTGTGTTTGC
UUGGAAAUGGAGCUGUUCUUAGCCACUG
LEMELFLATVFASAHVSLSEDVKPLLIREH
TAGTGCCCATGTTAGCTTATCTGAAGATGTGAAAC
UGUUUGCUAGUGCCCAUGUUAGCUUAUC
LKY
CCTTGCTGATAAGGGAGCATTTAAAGTACTAG
UGAAGAUGUGAAACCCUUGCUGAUAAGGG
AGCAUUUAAAGUACUAG
82
146/445
ATGAATATTTGTATCCATGTTTCAAAATTGAAATAT
AUGAAUAUUUGUAUCCAUGUUUCAAAAU
MNICIHVSKLKYIYIYIYIYIYIYSVCVCSDSF
ATATATATATATATATATATATATATATATATATAT
UGAAAUAUAUAUAUAUAUAUAUAUAUAU
NFLCIFIFGSRSHLMPCTCERGCSFVLEAL
AGTGTGTGTGTGTGTTCTGATAGCTTTAACTTTCTC
AUAUAUAUAUAUAUAUAGUGUGUGUGUG
SEQTRHLD
TGCATCTTTATATTTGGTTCCAGATCACACCTGATG
UGUUCUGAUAGCUUUAACUUUCUCUGCA
CCATGTACTTGTGAGAGAGGATGCAGTTTTGTTTT
UCUUUAUAUUUGGUUCCAGAUCACACCUG
GGAAGCTCTCTCAGAACAAACAAGACACCTGGAT
AUGCCAUGUACUUGUGAGAGAGGAUGCAG
TGA
UUUUGUUUUGGAAGCUCUCUCAGAACAAA
CAAGACACCUGGAUUGA
83
147/446
ATGGAACTGACTGAGATTTACCACAGGGAAGGCC
AUGGAACUGACUGAGAUUUACCACAGGGA
MELTEIYHREGPNLGPKAYPSD
CAAACTTGGGGCCAAAAGCCTACCCAAGTGATTG
AGGCCCAAACUUGGGGCCAAAAGCCUACCC
A
AAGUGAUUGA
84
148/447
CTGCATTGGAACCTGGAGCAAGCGCTCTATCTTTC
CUGCAUUGGAACCUGGAGCAAGCGCUCUA
LHWNLEQALYLSHKFPHLRLRCSCYWVS
ACACAAATTCCCTCACCTGAGATTGAGGTGCTCTT
UCUUUCACACAAAUUCCCUCACCUGAGAU
VCCNSGFGYVL
GTTACTGGGTGTCTGTGTGCTGTAATTCTGGTTTT
UGAGGUGCUCUUGUUACUGGGUGUCUGU
GGATATGTTCTGTAA
GUGCUGUAAUUCUGGUUUUGGAUAUGUU
CUGUAA
85
149/448
ATGAAAATGTGTTTTTCTCTGTTAAAACTTGTCAG
AUGAAAAUGUGUUUUUCUCUGUUAAAAC
MKMCFSLLKLVRVLEVVSL
AGTACTAGAAGTTGTATCTCTGTAG
UUGUCAGAGUACUAGAAGUUGUAUCUCU
GUAG
86
150/449
GTGCAGGTCCATTTCTGCCCACAGGTAGGGTGTTT
GUGCAGGUCCAUUUCUGCCCACAGGUAGG
VQVHFCPQVGCFSLIKRLTLLLPRTSQLN
TTCTTTGATTAAGAGATTGACACTTCTGTTGCCTAG
GUGUUUUUCUUUGAUUAAGAGAUUGACA
HF
GACCTCCCAACTCAACCATTTCTAG
CUUCUGUUGCCUAGGACCUCCCAACUCAAC
CAUUUCUAG
87
151/450
TTGTCATGGAGCTGCAGATTCCATTGCCCACCAAA
UUGUCAUGGAGCUGCAGAUUCCAUUGCCC
LSWSCRFHCPPKTRTHTYPYTKGKTILKCC
GACTAGAACACACACATATCCATACACCAAAGGA
ACCAAAGACUAGAACACACACAUAUCCAUA
FSGGSLSGCCLTVWEPVLCRGDRPDLHYL
AAGACAATTCTGAAATGCTGTTTCTCTGGTGGTTC
CACCAAAGGAAAGACAAUUCUGAAAUGCU
TTLALGANCPTVKCLGGCPIP
CCTCTCTGGCTGCTGCCTCACAGTATGGGAACCTG
GUUUCUCUGGUGGUUCCCUCUCUGGCUGC
TACTCTGCAGAGGTGACAGGCCAGATTTGCATTAT
UGCCUCACAGUAUGGGAACCUGUACUCUG
CTCACAACCTTAGCCCTTGGTGCTAACTGTCCTAC
CAGAGGUGACAGGCCAGAUUUGCAUUAUC
AGTGAAGTGCCTGGGGGGTTGTCCTATCCCATAA
UCACAACCUUAGCCCUUGGUGCUAACUGU
CCUACAGUGAAGUGCCUGGGGGGUUGUCC
UAUCCCAUAA
88
152/451
ATGCTGACAGCAGCCACCATCAGAATGACCCACG
AUGCUGACAGCAGCCACCAUCAGAAUGACC
MLTAATIRMTHAKKRKKKLKSPLTTQ
CAAAAAAAAGAAAAAAAAAATTAAAAAGTCCCCT
CACGCAAAAAAAAGAAAAAAAAAAUUAAAA
CACAACCCAGTGA
AGUCCCCUCACAACCCAGUGA
89
153/452
TTGATTAGGGAGTGCCTCAGACATGACATTCTTGT
UUGAUUAGGGAGUGCCUCAGACAUGACAU
LIRECLRHDILVLSLELIWQQEGADYVNR
GCTGTCCTTGGAATTAATCTGGCAGCAGGAGGGA
UCUUGUGCUGUCCUUGGAAUUAAUCUGG
DKN
GCAGACTATGTAAACAGAGATAAAAATTAA
CAGCAGGAGGGAGCAGACUAUGUAAACAG
AGAUAAAAAUUAA
90
154/453
TTGAAGGAAAAAAGAAATAAGAAGAGAGAGAGA
UUGAAGGAAAAAAGAAAUAAGAAGAGAGA
LKEKRNKKRERKKASHKDFLKRNNFA
AAGAAAGCATCACACAAAGATTTTCTTAAAAGAA
GAGAAAGAAAGCAUCACACAAAGAUUUUC
ACAATTTTGCTTGA
UUAAAAGAAACAAUUUUGCUUGA
91
155/454
GTGGCACTTGGCCTCCACTGGGCAGCAGGACCAG
GUGGCACUUGGCCUCCACUGGGCAGCAGG
VALGLHWAAGPAPSASVLFSFCNLGIFCC
CTCCAAGCGCTAGTGTTCTGTTCTCTTTTTGTAATC
ACCAGCUCCAAGCGCUAGUGUUCUGUUCU
SKYN
TTGGAATCTTTTGTTGCTCTAAATACAATTAA
CUUUUUGUAAUCUUGGAAUCUUUUGUUG
CUCUAAAUACAAUUAA
92
156/455
CTGGTCTGCCAGCTAAAACTTGGCCACATCCCCTG
CUGGUCUGCCAGCUAAAACUUGGCCACAU
LVCQLKLGHIPCYGCRIELLLTKRPKKSC
TTATGGCTGCAGGATCGAGTTATTGTTAACAAAGA
CCCCUGUUAUGGCUGCAGGAUCGAGUUAU
GACCCAAGAAAAGCTGCTAA
UGUUAACAAAGAGACCCAAGAAAAGCUGC
UAA
93
157/456
GTGTCACTATGGAGCTCTCACATGTGGAAGCTGC
GUGUCACUAUGGAGCUCUCACAUGUGGAA
VSLWSSHMWKLQGLLQKSR
AAGGTCTTCTTCAAAAGAGCCGCTGA
GCUGCAAGGUCUUCUUCAAAAGAGCCGCU
GA
94
158/457
CTGCCAGGGACCATGTTTTGCCCATTGACTATTAC
CUGCCAGGGACCAUGUUUUGCCCAUUGAC
LPGTMFCPLTITFHPRRPA
TTTCCACCCCAGAAGACCTGCCTGA
UAUUACUUUCCACCCCAGAAGACCUGCCU
GA
95
159/458
GTGGAGATGAAGCTTCTGGGTGTCACTATGGAGC
GUGGAGAUGAAGCUUCUGGGUGUCACUA
VEMKLLGVTMELSHVEAARSSSKEPLK
TCTCACATGTGGAAGCTGCAAGGTCTTCTTCAAAA
UGGAGCUCUCACAUGUGGAAGCUGCAAGG
GAGCCGCTGAAG
UCUUCUUCAAAAGAGCCGCUGAAG
96
160/459
TTGGAGACTGCCAGGGACCATGTTTTGCCCATTGA
UUGGAGACUGCCAGGGACCAUGUUUUGCC
LETARDHVLPIDYYFPPQKTCLICGDEASG
CTATTACTTTCCACCCCAGAAGACCTGCCTGATCT
CAUUGACUAUUACUUUCCACCCCAGAAGA
CHYGALTCGSCKVFFKRAAEVWRLPGT
GTGGAGATGAAGCTTCTGGGTGTCACTATGGAGC
CCUGCCUGAUCUGUGGAGAUGAAGCUUCU
MFCPLTITFHPRRPA
TCTCACATGTGGAAGCTGCAAGGTCTTCTTCAAAA
GGGUGUCACUAUGGAGCUCUCACAUGUGG
GAGCCGCTGAAGTTTGGAGACTGCCAGGGACCAT
AAGCUGCAAGGUCUUCUUCAAAAGAGCCG
GTTTTGCCCATTGACTATTACTTTCCACCCCAGAAG
CUGAAGUUUGGAGACUGCCAGGGACCAUG
ACCTGCCTGA
UUUUGCCCAUUGACUAUUACUUUCCACCC
CAGAAGACCUGCCUGA
97
161/460
GTGGAGATGAAGCTTCTGGGTGTCACTATGGAGC
GUGGAGAUGAAGCUUCUGGGUGUCACUA
VEMKLLGVTMELSHVEAARSSSKEPLKFG
TCTCACATGTGGAAGCTGCAAGGTCTTCTTCAAAA
UGGAGCUCUCACAUGUGGAAGCUGCAAGG
DCQGPCFAH
GAGCCGCTGAAGTTTGGAGACTGCCAGGGACCAT
UCUUCUUCAAAAGAGCCGCUGAAGUUUGG
GTTTTGCCCATTGA
AGACUGCCAGGGACCAUGUUUUGCCCAUU
GA
98 (BSS)
162/461
CTGTGCGCCAGCAGAAATGATTGCACTATTGATAA
CUGUGCGCCAGCAGAAAUGAUUGCACUAU
LCASRNDCTIDKFRRKNCPSCRLRKCYEA
ATTCCGAAGGAAAAATTGTCCATCTTGTCGTCTTC
UGAUAAAUUCCGAAGGAAAAAUUGUCCAU
GMTLGVWRLPGTMFCPLTITFHPRRPA
GGAAATGTTATGAAGCAGGGATGACTCTGGGAGT
CUUGUCGUCUUCGGAAAUGUUAUGAAGCA
TTGGAGACTGCCAGGGACCATGTTTTGCCCATTGA
GGGAUGACUCUGGGAGUUUGGAGACUGC
CTATTACTTTCCACCCCAGAAGACCTGCCTGA
CAGGGACCAUGUUUUGCCCAUUGACUAUU
ACUUUCCACCCCAGAAGACCUGCCUGA
99 (BSS)
163/462
ATGCACAAGTCCCGGATGTACAGCCAGTGTGTCC
AUGCACAAGUCCCGGAUGUACAGCCAGUG
MHKSRMYSQCVRMRHLSQEFGWLQITP
GAATGAGGCACCTCTCTCAAGAGTTTGGATGGCT
UGUCCGAAUGAGGCACCUCUCUCAAGAGU
QEFLCMKALLLFSISFRNLHVDDQMAVIQ
CCAAATCACCCCCCAGGAATTCCTGTGCATGAAAG
UUGGAUGGCUCCAAAUCACCCCCCAGGAA
YSWMGLMVFAMGWRSFTNVNSRMLY
CACTGCTACTCTTCAGCATTAGCTTCCGCAACTTAC
UUCCUGUGCAUGAAAGCACUGCUACUCUU
FAPDLVFN
ACGTGGACGACCAGATGGCTGTCATTCAGTACTCC
CAGCAUUAGCUUCCGCAACUUACACGUGG
TGGATGGGGCTCATGGTGTTTGCCATGGGCTGGC
ACGACCAGAUGGCUGUCAUUCAGUACUCC
GATCCTTCACCAATGTCAACTCCAGGATGCTCTAC
UGGAUGGGGCUCAUGGUGUUUGCCAUGG
TTCGCCCCTGATCTGGTTTTCAAT
GCUGGCGAUCCUUCACCAAUGUCAACUCCA
GGAUGCUCUACUUCGCCCCUGAUCUGGUU
UUCAAU
100 (BSS)
164/463
TTGCATGCAAAAGAAAAAATCCCACATCCTGCTCA
UUGCAUGCAAAAGAAAAAAUCCCACAUCCU
LHAKEKIPHPAQDASTSSPSSWTPCSLLP
AGACGCTTCTACCAGCTCACCAAGCTCCTGGACTC
GCUCAAGACGCUUCUACCAGCUCACCAAGC
QLTRGRPDGCHSVLLDGAHGVCHGLAIL
CGTGCAGCCTGCTTCCGCAACTTACACGTGGACGA
UCCUGGACUCCGUGCAGCCUGCUUCCGCA
HQCQLQDALLRP
CCAGATGGCTGTCATTCAGTACTCCTGGATGGGG
ACUUACACGUGGACGACCAGAUGGCUGUC
CTCATGGTGTTTGCCATGGGCTGGCGATCCTTCAC
AUUCAGUACUCCUGGAUGGGGCUCAUGGU
CAATGTCAACTCCAGGATGCTCTACTTCGCCCCTG
GUUUGCCAUGGGCUGGCGAUCCUUCACCA
A
AUGUCAACUCCAGGAUGCUCUACUUCGCC
CCUGA
101 (BSS)
165/464
ATGCAAAAGAAAAAATCCCACATCCTGCTCAAGAC
AUGCAAAAGAAAAAAUCCCACAUCCUGCUC
MQKKKSHILLKTLLPAHQAPGLRAACFRN
GCTTCTACCAGCTCACCAAGCTCCTGGACTCCGTG
AAGACGCUUCUACCAGCUCACCAAGCUCCU
LHVDDQMAVIQYSWMGLMVFAMGW
CAGCCTGCTTCCGCAACTTACACGTGGACGACCAG
GGACUCCGUGCAGCCUGCUUCCGCAACUU
RSFTNVNSRMLYFAPDLVFN
ATGGCTGTCATTCAGTACTCCTGGATGGGGCTCAT
ACACGUGGACGACCAGAUGGCUGUCAUUC
GGTGTTTGCCATGGGCTGGCGATCCTTCACCAATG
AGUACUCCUGGAUGGGGCUCAUGGUGUU
TCAACTCCAGGATGCTCTACTTCGCCCCTGATCTG
UGCCAUGGGCUGGCGAUCCUUCACCAAUG
GTTTTCAAT
UCAACUCCAGGAUGCUCUACUUCGCCCCU
GAUCUGGUUUUCAAU
102 (BSS)
166/465
GTGGATGGGCTGAAAAATCAAAAATTCTTTGATG
GUGGAUGGGCUGAAAAAUCAAAAAUUCUU
VDGLKNQKFFDELRMNYIKELDRIIACKR
AACTTCGAATGAACTACATCAAGGAACTCGATCGT
UGAUGAACUUCGAAUGAACUACAUCAAGG
KNPTSCSRRFYQLTKLLDSVQPASATYTW
ATCATTGCATGCAAAAGAAAAAATCCCACATCCTG
AACUCGAUCGUAUCAUUGCAUGCAAAAGA
TTRWLSFSTPGWGSWCLPWAGDPSPM
CTCAAGACGCTTCTACCAGCTCACCAAGCTCCTGG
AAAAAUCCCACAUCCUGCUCAAGACGCUUC
STPGCSTSPLIWFSM
ACTCCGTGCAGCCTGCTTCCGCAACTTACACGTGG
UACCAGCUCACCAAGCUCCUGGACUCCGUG
ACGACCAGATGGCTGTCATTCAGTACTCCTGGATG
CAGCCUGCUUCCGCAACUUACACGUGGAC
GGGCTCATGGTGTTTGCCATGGGCTGGCGATCCTT
GACCAGAUGGCUGUCAUUCAGUACUCCUG
CACCAATGTCAACTCCAGGATGCTCTACTTCGCCC
GAUGGGGCUCAUGGUGUUUGCCAUGGGC
CTGATCTGGTTTTCAATG
UGGCGAUCCUUCACCAAUGUCAACUCCAG
GAUGCUCUACUUCGCCCCUGAUCUGGUUU
UCAAUG
FOLH1
neo-
antigen
DNA sequence
RNA sequence
Protein Sequence
1
167/466
GTGGTATGTCACTCAGAAAGAATCGTAATGGGTA
GUGGUAUGUCACUCAGAAAGAAUCGUAAU
VVCHSERIVMGILINFKIGIFEIKLNIIYSYVS
TATTGATAAATTTTAAAATTGGTATATTTGAAATAA
GGGUAUAUUGAUAAAUUUUAAAAUUGGU
VYICVCLYCLSSPYGLKLNFIIIRGYSEVEKF
AGTTGAATATTATATATAGTTATGTGAGTGTTTAT
AUAUUUGAAAUAAAGUUGAAUAUUAUAU
NSVLNLRRMA
ATATGTGTGTGTTTATATTGTTTATCTTCTCCCTAT
AUAGUUAUGUGAGUGUUUAUAUAUGUGU
GGATTAAAACTGAATTTCATAATTATAAGAGGTTA
GUGUUUAUAUUGUUUAUCUUCUCCCUAU
TTCTGAAGTGGAAAAATTTAACTCAGTATTAAATC
GGAUUAAAACUGAAUUUCAUAAUUAUAAG
TAAGGAGAATGGCCTAA
AGGUUAUUCUGAAGUGGAAAAAUUUAAC
UCAGUAUUAAAUCUAAGGAGAAUGGCCUA
A
2
168/467
ATGTCACTTCACACAGAAGAAAACATCAGTATGTC
AUGUCACUUCACACAGAAGAAAACAUCAG
MSLHTEENISMSESTLGNMHKIIPSQRPH
AGAGAGCACACTGGGGAATATGCACAAGATTATC
UAUGUCAGAGAGCACACUGGGGAAUAUGC
GLPGQPGLRGSLSQHTIWEMDQITLLVN
CCAAGCCAGAGGCCTCACGGCCTACCTGGCCAGC
ACAAGAUUAUCCCAAGCCAGAGGCCUCACG
VITL
CTGGGCTGAGAGGATCACTATCTCAGCACACTATT
GCCUACCUGGCCAGCCUGGGCUGAGAGGA
TGGGAAATGGATCAAATCACACTTTTAGTAAATGT
UCACUAUCUCAGCACACUAUUUGGGAAAU
TATCACTCTATAG
GGAUCAAAUCACACUUUUAGUAAAUGUUA
UCACUCUAUAG
3
169/468
ATGAACACTTGTGAACCTAATAGCCATATGAAGAA
AUGAACACUUGUGAACCUAAUAGCCAUAU
MNTCEPNSHMKKITFLISLDAPCTNDSYA
AATAACATTTCTAATATCTTTGGATGCCCCATGTAC
GAAGAAAAUAACAUUUCUAAUAUCUUUGG
FAFS
TAATGACAGTTATGCTTTTGCATTTTCTTGA
AUGCCCCAUGUACUAAUGACAGUUAUGCU
UUUGCAUUUUCUUGA
4
170/469
CTGTCATTATTTATAATTTTATCACACATGGCTGTA
CUGUCAUUAUUUAUAAUUUUAUCACACAU
LSLFIILSHMAVSFTCFGIMYF
TCCTTTACATGTTTTGGCATTATGTATTTTTGA
GGCUGUAUCCUUUACAUGUUUUGGCAUU
AUGUAUUUUUGA
5
171/470
TTGACACTTCAGAACTCCCCAGAAACTTGTCTTCT
UUGACACUUCAGAACUCCCCAGAAACUUG
LTLQNSPETCLLGQCVKLFMKKCQARGA
GGGCCAATGTGTAAAGCTGTTTATGAAGAAATGT
UCUUCUGGGCCAAUGUGUAAAGCUGUUU
LLQIWQRTISGEALGRWVLLATMNFRK
CAAGCCAGAGGGGCTCTACTACAAATTTGGCAAA
AUGAAGAAAUGUCAAGCCAGAGGGGCUCU
WVP
GGACAATTTCAGGAGAAGCTCTTGGCCGCTGGGT
ACUACAAAUUUGGCAAAGGACAAUUUCAG
TCTCCTGGCCACCATGAACTTCAGGAAGTGGGTG
GAGAAGCUCUUGGCCGCUGGGUUCUCCUG
CCATAG
GCCACCAUGAACUUCAGGAAGUGGGUGCC
AUAG
6
172/471
CTGGGCACTGGTGTAGCTCTGTATGCCCTCCGTGT
CUGGGCACUGGUGUAGCUCUGUAUGCCCU
LGTGVALYALRVRCWRCHLHCQSLPRVH
CAGATGCTGGAGATGTCATTTGCATTGCCAGAGTT
CCGUGUCAGAUGCUGGAGAUGUCAUUUGC
TESRLKSTLLEHLSNAFYSQSLTSLTRDKEE
TGCCAAGGGTGCACACAGAAAGCAGATTGAAAAG
AUUGCCAGAGUUUGCCAAGGGUGCACACA
LFNGPRSIYEDNQVGVWSG
CACCCTCTTGGAACATCTCTCCAATGCCTTCTACTC
GAAAGCAGAUUGAAAAGCACCCUCUUGGA
ACAAAGTTTAACATCATTAACACGTGACAAAGAA
ACAUCUCUCCAAUGCCUUCUACUCACAAAG
GAACTATTTAATGGGCCCAGATCTATTTATGAAGA
UUUAACAUCAUUAACACGUGACAAAGAAG
CAATCAAGTGGGAGTTTGGAGTGGATAA
AACUAUUUAAUGGGCCCAGAUCUAUUUAU
GAAGACAAUCAAGUGGGAGUUUGGAGUG
GAUAA
7
173/472
ATGCCCCATGTACTAATGACAGTTATGCTTTTGCA
AUGCCCCAUGUACUAAUGACAGUUAUGCU
MPHVLMTVMLLHFLEFYVYLSFLCHYL
TTTTCTTGAATTTTATGTTTATTTATCTTTCCTCTGT
UUUGCAUUUUCUUGAAUUUUAUGUUUAU
CATTATTTATAA
UUAUCUUUCCUCUGUCAUUAUUUAUAA
8
174/473
CTGTATCCTTTACATGTTTTGGCATTATGTATTTTT
CUGUAUCCUUUACAUGUUUUGGCAUUAU
LYPLHVLALCIFELFVKTIIPCVIFRDLIFFID
GAACTTTTTGTAAAGACAATCATACCATGTGTAAT
GUAUUUUUGAACUUUUUGUAAAGACAAU
F
TTTCAGGGACTTGATTTTTTTCATTGACTTTTAA
CAUACCAUGUGUAAUUUUCAGGGACUUGA
UUUUUUUCAUUGACUUUUAA
9
175/474
ATGGTGAATTTGGAGCAGCCAATGAAGAGTCCCC
AUGGUGAAUUUGGAGCAGCCAAUGAAGAG
MVNLEQPMKSPLTLWPRSKLRSRCSVAL
TCACATTGTGGCCTCGCTCAAAGTTAAGAAGTCGC
UCCCCUCACAUUGUGGCCUCGCUCAAAGU
EESASN
TGTAGTGTTGCCCTTGAAGAATCTGCTTCAAATTG
UAAGAAGUCGCUGUAGUGUUGCCCUUGAA
A
GAAUCUGCUUCAAAUUGA
10
176/475
ATGCCCTCCGTGTCAGATGCTGGAGATGTCATTTG
AUGCCCUCCGUGUCAGAUGCUGGAGAUGU
MPSVSDAGDVICIARVCQGCTQKAD
CATTGCCAGAGTTTGCCAAGGGTGCACACAGAAA
CAUUUGCAUUGCCAGAGUUUGCCAAGGGU
GCAGATTGA
GCACACAGAAAGCAGAUUGA
11
177/476
GTGACAAAGAAGAACTATTTAATGGGCCCAGATC
GUGACAAAGAAGAACUAUUUAAUGGGCCC
VTKKNYLMGPDLFMKTIKWEFGVDNPN
TATTTATGAAGACAATCAAGTGGGAGTTTGGAGT
AGAUCUAUUUAUGAAGACAAUCAAGUGGG
LDNW
GGATAACCCAAATTTGGATAACTGGTGA
AGUUUGGAGUGGAUAACCCAAAUUUGGA
UAACUGGUGA
12
178/477
TTGAATTTGTGTGGTATGTCACTCAGAAAGAATCG
UUGAAUUUGUGUGGUAUGUCACUCAGAA
LNLCGMSLRKNRNGYIDKF
TAATGGGTATATTGATAAATTTTAA
AGAAUCGUAAUGGGUAUAUUGAUAAAUU
UUAA
13
179/478
ATGTGTGTGTTTATATTGTTTATCTTCTCCCTATGG
AUGUGUGUGUUUAUAUUGUUUAUCUUCU
MCVFILFIFSLWIKTEFHNYKRLF
ATTAAAACTGAATTTCATAATTATAAGAGGTTATT
CCCUAUGGAUUAAAACUGAAUUUCAUAAU
CTGA
UAUAAGAGGUUAUUCUGA
14
180/479
TTGAATTTTATGTTTATTTATCTTTCCTCTGTCATTA
UUGAAUUUUAUGUUUAUUUAUCUUUCCU
LNFMFIYLSSVIIYNFITHGCILYMFWHYV
TTTATAATTTTATCACACATGGCTGTATCCTTTACA
CUGUCAUUAUUUAUAAUUUUAUCACACAU
FLNFL
TGTTTTGGCATTATGTATTTTTGAACTTTTTGTAA
GGCUGUAUCCUUUACAUGUUUUGGCAUU
AUGUAUUUUUGAACUUUUUGUAA
15
181/480
ATGAAATATTGTCTAGTAATATGTAAAGAGAAGTC
AUGAAAUAUUGUCUAGUAAUAUGUAAAG
MKYCLVICKEKSQEYVMSRCKELLLSPW
TCAAGAATATGTGATGAGCAGATGTAAGGAATTG
AGAAGUCUCAAGAAUAUGUGAUGAGCAGA
CTCTTGTCTCCATGGTGA
UGUAAGGAAUUGCUCUUGUCUCCAUGGU
GA
16
182/481
GTGTTGCCCTTGAAGAATCTGCTTCAAATTGACAC
GUGUUGCCCUUGAAGAAUCUGCUUCAAAU
VLPLKNLLQIDTSELPRNLSSGPMCKAVY
TTCAGAACTCCCCAGAAACTTGTCTTCTGGGCCAA
UGACACUUCAGAACUCCCCAGAAACUUGUC
EEMSSQRGSTTNLAKDNFRRSSWPLGSP
TGTGTAAAGCTGTTTATGAAGAAATGTCAAGCCA
UUCUGGGCCAAUGUGUAAAGCUGUUUAU
GHHELQEVGAIAAA
GAGGGGCTCTACTACAAATTTGGCAAAGGACAAT
GAAGAAAUGUCAAGCCAGAGGGGCUCUAC
TTCAGGAGAAGCTCTTGGCCGCTGGGTTCTCCTGG
UACAAAUUUGGCAAAGGACAAUUUCAGGA
CCACCATGAACTTCAGGAAGTGGGTGCCATAGCA
GAAGCUCUUGGCCGCUGGGUUCUCCUGGC
GCAGCCTGA
CACCAUGAACUUCAGGAAGUGGGUGCCAU
AGCAGCAGCCUGA
17
183/482
ATGCTGGAGATGTCATTTGCATTGCCAGAGTTTGC
AUGCUGGAGAUGUCAUUUGCAUUGCCAGA
MLEMSFALPEFAKGAHRKQIEKHPLGTSL
CAAGGGTGCACACAGAAAGCAGATTGAAAAGCAC
GUUUGCCAAGGGUGCACACAGAAAGCAGA
QCLLLTKFNIINT
CCTCTTGGAACATCTCTCCAATGCCTTCTACTCACA
UUGAAAAGCACCCUCUUGGAACAUCUCUC
AAGTTTAACATCATTAACACGTGA
CAAUGCCUUCUACUCACAAAGUUUAACAU
CAUUAACACGUGA
18
184/483
CTGAATGGTGCAGGAGACCCTCTCACACCAGGTT
CUGAAUGGUGCAGGAGACCCUCUCACACC
LNGAGDPLTPGYPANEYAYRRGIAEAVG
ACCCAGCAAATGAATATGCTTATAGGCGTGGAATT
AGGUUACCCAGCAAAUGAAUAUGCUUAUA
LPSIPVHPIGYYDAQKLLEVVYKILQ
GCAGAGGCTGTTGGTCTTCCAAGTATTCCTGTTCA
GGCGUGGAAUUGCAGAGGCUGUUGGUCU
TCCAATTGGATACTATGATGCACAGAAGCTCCTAG
UCCAAGUAUUCCUGUUCAUCCAAUUGGAU
AGGTGGTTTATAAAATCCTCCAATGA
ACUAUGAUGCACAGAAGCUCCUAGAGGUG
GUUUAUAAAAUCCUCCAAUGA
19 (BSS)
185/484
CTGAGAACATCAAGAAGTTCTTATAGGTGGTTTAT
CUGAGAACAUCAAGAAGUUCUUAUAGGUG
LRTSRSSYRWFIKSSNEATNITPKHNMKA
AAAATCCTCCAATGAAGCTACTAACATTACTCCAA
GUUUAUAAAAUCCUCCAAUGAAGCUACUA
FLDELKAENIKKFL
AGCATAATATGAAAGCATTTTTGGATGAATTGAAA
ACAUUACUCCAAAGCAUAAUAUGAAAGCA
GCTGAGAACATCAAGAAGTTCTTA
UUUUUGGAUGAAUUGAAAGCUGAGAACA
UCAAGAAGUUCUUA
20 (BSS)
186/485
TTGAACCACCTCCTCCAGGATATGAAAATGTTTCG
UUGAACCACCUCCUCCAGGAUAUGAAAAU
LNHLLQDMKMFRILYHLSVLSLLKECQRF
GATATTGTACCACCTTTCAGTGCTTTCTCTCCTCAA
GUUUCGGAUAUUGUACCACCUUUCAGUGC
STHHYLNHLLQDMKMFRILYHLSVLSLLK
GGAATGCCAGAGATTTTCAACACATCATTATTTGA
UUUCUCUCCUCAAGGAAUGCCAGAGAUUU
ECQ
ACCACCTCCTCCAGGATATGAAAATGTTTCGGATA
UCAACACAUCAUUAUUUGAACCACCUCCUC
TTGTACCACCTTTCAGTGCTTTCTCTCCTCAAGGAA
CAGGAUAUGAAAAUGUUUCGGAUAUUGU
TGCCAG
ACCACCUUUCAGUGCUUUCUCUCCUCAAG
GAAUGCCAG
Nkx3.1
neo-
antigen
DNA sequence
RNA sequence
Protein Sequence
1
187/486
GTGTGCACACCAAAGCTATTGGAGATTTGCGTGG
GUGUGCACACCAAAGCUAUUGGAGAUUUG
VCTPKLLEICVEISDSSLVRQ
AAATCTCAGATTCTTCACTGGTGAGACAATGA
CGUGGAAAUCUCAGAUUCUUCACUGGUGA
GACAAUGA
2
188/487
ATGGTATGCCAACTTAAGTATTTACAGGGTGGCCC
AUGGUAUGCCAACUUAAGUAUUUACAGGG
MVCQLKYLQGGPNRTRCTRCDFKTSCIN
AAATAGAACAAGATGCACTCGCTGTGATTTTAAGA
UGGCCCAAAUAGAACAAGAUGCACUCGCU
RTPLQEGGPGQENLRLSKTGA
CAAGCTGTATAAACAGAACTCCACTGCAAGAGGG
GUGAUUUUAAGACAAGCUGUAUAAACAGA
GGGGCCGGGCCAGGAGAATCTCCGCTTGTCCAAG
ACUCCACUGCAAGAGGGGGGGCCGGGCCA
ACAGGGGCCTAA
GGAGAAUCUCCGCUUGUCCAAGACAGGGG
CCUAA
3
189/488
ATGAGCAGAGAGCAAAGGAGAAATCTGGCTGTCC
AUGAGCAGAGAGCAAAGGAGAAAUCUGGC
MSREQRRNLAVLPFSFCYLR
TTCCATTTTCATTCTGTTATCTCAGGTGA
UGUCCUUCCAUUUUCAUUCUGUUAUCUCA
GGUGA
4
190/489
TTGGGCTCCACATTGCAATGTTCAATGCCACGTGC
UUGGGCUCCACAUUGCAAUGUUCAAUGCC
LGSTLQCSMPRAADTDRSTSQHKRQGSL
TGCTGACACCGACCGGAGTACTAGCCAGCACAAA
ACGUGCUGCUGACACCGACCGGAGUACUA
NCFLLFTFLLK
AGGCAGGGTAGCCTGAATTGCTTTCTGCTCTTTAC
GCCAGCACAAAAGGCAGGGUAGCCUGAAU
ATTTCTTTTAAAATAA
UGCUUUCUGCUCUUUACAUUUCUUUUAA
AAUAA
5
191/490
CTGAGTACTCTTTCTCTCCCCTCCTCTGAATTTAATT
CUGAGUACUCUUUCUCUCCCCUCCUCUGA
LSTLSLPSSEFNSFNLQFARITHFTVMYIVL
CTTTCAACTTGCAATTTGCAAGGATTACACATTTCA
AUUUAAUUCUUUCAACUUGCAAUUUGCAA
QKKKKSVFV
CTGTGATGTATATTGTGTTGCAAAAAAAAAAAAAA
GGAUUACACAUUUCACUGUGAUGUAUAU
AGTGTCTTTGTTTAA
UGUGUUGCAAAAAAAAAAAAAAAGUGUCU
UUGUUUAA
6
192/491
CTGACAGGTGAATTGGATGGTTCTCAGAACCATTT
CUGACAGGUGAAUUGGAUGGUUCUCAGA
LTGELDGSQNHFTQTACFYPV
CACCCAGACAGCCTGTTTCTATCCTGTTTAA
ACCAUUUCACCCAGACAGCCUGUUUCUAU
CCUGUUUAA
7
193/492
CTGAAAACACTTCAGGCGCCCTTCCAAGGCTTCCC
CUGAAAACACUUCAGGCGCCCUUCCAAGGC
LKTLQAPFQGFPKPLSSRRSAPELPSPTLR
CAAACCCCTAAGCAGCCGCAGAAGCGCTCCCGAG
UUCCCCAAACCCCUAAGCAGCCGCAGAAGC
CTGCCTTCTCCCACACTCAGGTGA
GCUCCCGAGCUGCCUUCUCCCACACUCAGG
UGA
8
194/493
CTGCCTTCCCCAGGGTGTCTCTATGAAAAGCACAA
CUGCCUUCCCCAGGGUGUCUCUAUGAAAA
LPSPGCLYEKHKGPRSGSKRCAHQSYWR
GGGGCCAAGGTCAGGGAGCAAGAGGTGTGCACA
GCACAAGGGGCCAAGGUCAGGGAGCAAGA
FAWKSQILHW
CCAAAGCTATTGGAGATTTGCGTGGAAATCTCAG
GGUGUGCACACCAAAGCUAUUGGAGAUUU
ATTCTTCACTGGTGA
GCGUGGAAAUCUCAGAUUCUUCACUGGUG
A
9
195/494
CTGTTTGAAGGGGAGAGAGGGAAAATCAAGTGG
CUGUUUGAAGGGGAGAGAGGGAAAAUCA
LFEGERGKIKWYFPALCMILDELYTQGFC
TATTTTCCAGCACTTTGTATGATTTTGGATGAGTTG
AGUGGUAUUUUCCAGCACUUUGUAUGAU
SATPSSCVTEYQL
TACACCCAAGGATTCTGTTCTGCAACTCCATCCTCC
UUUGGAUGAGUUGUACACCCAAGGAUUCU
TGTGTCACTGAATATCAACTCTGA
GUUCUGCAACUCCAUCCUCCUGUGUCACU
GAAUAUCAACUCUGA
10
196/495
CTGTTGGCCTTGGAATATGGCCAAGGCTCTCTCTG
CUGUUGGCCUUGGAAUAUGGCCAAGGCUC
LLALEYGQGSLCPCKREGQIESLQENALM
TCCCTGTAAAAGAGAGGGGCAAATAGAGAGTCTC
UCUCUGUCCCUGUAAAAGAGAGGGGCAAA
LSTYLHGRGRWVGGDENISFSYSFLFLLK
CAAGAGAACGCCCTCATGCTCAGCACATATTTGCA
UAGAGAGUCUCCAAGAGAACGCCCUCAUG
WYANLSIYRVAQIEQDALAVILRQAV
TGGGAGGGGGAGATGGGTGGGAGGAGATGAAA
CUCAGCACAUAUUUGCAUGGGAGGGGGAG
ATATCAGCTTTTCTTATTCCTTTTTATTCCTTTTAAA
AUGGGUGGGAGGAGAUGAAAAUAUCAGC
ATGGTATGCCAACTTAAGTATTTACAGGGTGGCCC
UUUUCUUAUUCCUUUUUAUUCCUUUUAA
AAATAGAACAAGATGCACTCGCTGTGATTTTAAGA
AAUGGUAUGCCAACUUAAGUAUUUACAGG
CAAGCTGTATAA
GUGGCCCAAAUAGAACAAGAUGCACUCGC
UGUGAUUUUAAGACAAGCUGUAUAA
11
197/496
CTGGAGAATTTAGAATCAGAAGTTTCCTGGAGTTT
CUGGAGAAUUUAGAAUCAGAAGUUUCCUG
LENLESEVSWSFQAIIYTVS
TCAGGCTATCATATATACTGTATCCTGA
GAGUUUUCAGGCUAUCAUAUAUACUGUA
UCCUGA
12
198/497
TTGGGGGCAGATTCTGAATTGGCTAAAAGACATG
UUGGGGGCAGAUUCUGAAUUGGCUAAAA
LGADSELAKRHAFLKLATLISFL
CATTTTTAAAACTAGCAACTCTTATTTCTTTCCTTTA
GACAUGCAUUUUUAAAACUAGCAACUCUU
A
AUUUCUUUCCUUUAA
13
199/498
TTGAGAATCTTTGCATGCAGAGGAGGTAAGAGGT
UUGAGAAUCUUUGCAUGCAGAGGAGGUA
LRIFACRGGKRYWIFTEEEHSAE
ATTGGATTTTCACAGAGGAAGAACACAGCGCAGA
AGAGGUAUUGGAUUUUCACAGAGGAAGA
ATGA
ACACAGCGCAGAAUGA
14
200/499
CTGTCCAGTGGAGGGCTCATGGGTGGGACATGGA
CUGUCCAGUGGAGGGCUCAUGGGUGGGA
LSSGGLMGGTWKRRQPRPWGAQSTEQ
AAAGAAGGCAGCCTAGGCCCTGGGGAGCCCAGTC
CAUGGAAAAGAAGGCAGCCUAGGCCCUGG
ARD
CACTGAGCAAGCAAGGGACTGA
GGAGCCCAGUCCACUGAGCAAGCAAGGGA
CUGA
15
201/500
GTGAGCCTTTTGCAGGAAAAGGCTAAGAAAAAGG
GUGAGCCUUUUGCAGGAAAAGGCUAAGAA
VSLLQEKAKKKENHSKTQQETVQMLWE
AAAACCATTCTAAAACACAACAAGAAACTGTCCAA
AAAGGAAAACCAUUCUAAAACACAACAAGA
LCLLPIMGPQNG
ATGCTTTGGGAACTGTGTTTATTGCCTATAATGGG
AACUGUCCAAAUGCUUUGGGAACUGUGUU
TCCCCAAAATGGGTAA
UAUUGCCUAUAAUGGGUCCCCAAAAUGGG
UAA
16
202/501
TTGGCCAATTTCACCCCATTTTCTGTGGTTTGGGCT
UUGGCCAAUUUCACCCCAUUUUCUGUGGU
LANFTPFSVVWAPHCNVQCHVLLTPTGV
CCACATTGCAATGTTCAATGCCACGTGCTGCTGAC
UUGGGCUCCACAUUGCAAUGUUCAAUGCC
LASTKGRVA
ACCGACCGGAGTACTAGCCAGCACAAAAGGCAGG
ACGUGCUGCUGACACCGACCGGAGUACUA
GTAGCCTGA
GCCAGCACAAAAGGCAGGGUAGCCUGA
17
203/502
TTGTGTTGCAAAAAAAAAAAAAAAGTGTCTTTGTT
UUGUGUUGCAAAAAAAAAAAAAAAGUGUC
LCCKKKKKVSLFKITWFVNPSCFFPIGTSH
TAAAATTACTTGGTTTGTGAATCCATCTTGCTTTTT
UUUGUUUAAAAUUACUUGGUUUGUGAAU
CCCCATTGGAACTAGTCATTAA
CCAUCUUGCUUUUUCCCCAUUGGAACUAG
UCAUUAA
18
204/503
GTGAATTGGATGGTTCTCAGAACCATTTCACCCAG
GUGAAUUGGAUGGUUCUCAGAACCAUUUC
VNWMVLRTISPRQPVSILFNKLVWVLYM
ACAGCCTGTTTCTATCCTGTTTAATAAATTAGTTTG
ACCCAGACAGCCUGUUUCUAUCCUGUUUA
HNKPCSNLSHKSL
GGTTCTCTACATGCATAACAAACCCTGCTCCAATC
AUAAAUUAGUUUGGGUUCUCUACAUGCA
TGTCACATAAAAGTCTGTGA
UAACAAACCCUGCUCCAAUCUGUCACAUAA
AAGUCUGUGA
19
205/504
CTGCGTGGGCAGCTGGAGCCCAGCTTTTTGGTAA
CUGCGUGGGCAGCUGGAGCCCAGCUUUUU
LRGQLEPSFLVMPAQVTTIMIKNCLPQG
TGCCAGCTCAGGTGACAACCATTATGATCAAAAAC
GGUAAUGCCAGCUCAGGUGACAACCAUUA
VSMKSTRGQGQGARGVHTKAIGDLRGN
TGCCTTCCCCAGGGTGTCTCTATGAAAAGCACAAG
UGAUCAAAAACUGCCUUCCCCAGGGUGUC
LRFFTGETMKQQRQ
GGGCCAAGGTCAGGGAGCAAGAGGTGTGCACAC
UCUAUGAAAAGCACAAGGGGCCAAGGUCA
CAAAGCTATTGGAGATTTGCGTGGAAATCTCAGA
GGGAGCAAGAGGUGUGCACACCAAAGCUA
TTCTTCACTGGTGAGACAATGAAACAACAGAGAC
UUGGAGAUUUGCGUGGAAAUCUCAGAUU
AGTGA
CUUCACUGGUGAGACAAUGAAACAACAGA
GACAGUGA
20
206/505
TTGCTTCTGGCTACCTGTTTGAAGGGGAGAGAGG
UUGCUUCUGGCUACCUGUUUGAAGGGGA
LLLATCLKGREGKSSGIFQHFV
GAAAATCAAGTGGTATTTTCCAGCACTTTGTATGA
GAGAGGGAAAAUCAAGUGGUAUUUUCCAG
CACUUUGUAUGA
21
207/506
ATGGGAGGGGGAGATGGGTGGGAGGAGATGAA
AUGGGAGGGGGAGAUGGGUGGGAGGAGA
MGGGDGWEEMKISAFLIPFYSF
AATATCAGCTTTTCTTATTCCTTTTTATTCCTTTTAA
UGAAAAUAUCAGCUUUUCUUAUUCCUUU
UUAUUCCUUUUAA
22
208/507
CTGTATAAACAGAACTCCACTGCAAGAGGGGGGG
CUGUAUAAACAGAACUCCACUGCAAGAGG
LYKQNSTARGGAGPGESPLVQDRGLRRV
CCGGGCCAGGAGAATCTCCGCTTGTCCAAGACAG
GGGGGCCGGGCCAGGAGAAUCUCCGCUUG
STLLLGAVAFFY
GGGCCTAAGGAGGGTCTCCACACTGCTGCTAGGG
UCCAAGACAGGGGCCUAAGGAGGGUCUCC
GCTGTTGCATTTTTTTATTAG
ACACUGCUGCUAGGGGCUGUUGCAUUUU
UUUAUUAG
23
209/508
CTGTATCCTGAAAGGCAACATAATTCTTCCTTCCCT
CUGUAUCCUGAAAGGCAACAUAAUUCUUC
LYPERQHNSSFPPFKILCSFLQQLLTKGLH
CCTTTTAAAATTTTGTGTTCCTTTTTGCAGCAATTA
CUUCCCUCCUUUUAAAAUUUUGUGUUCCU
FSPDF
CTCACTAAAGGGCTTCATTTTAGTCCAGATTTTTAG
UUUUGCAGCAAUUACUCACUAAAGGGCUU
CAUUUUAGUCCAGAUUUUUAG
24
210/509
ATGCCTCGCTTATTTAGCCCGAGATCTGGTCTTTTT
AUGCCUCGCUUAUUUAGCCCGAGAUCUGG
MPRLFSPRSGLFFFFFFFFSVSPKLYLS
TTTTTTTTTTTTTTTTTTTTTTCCGTCTCCCCAAAGCT
UCUUUUUUUUUUUUUUUUUUUUUUUUU
TTATCTGTCTTGA
UUCCGUCUCCCCAAAGCUUUAUCUGUCUU
GA
25
211/510
TTGCATGCAGAGGAGGTAAGAGGTATTGGATTTT
UUGCAUGCAGAGGAGGUAAGAGGUAUUG
LHAEEVRGIGFSQRKNTAQNEGPGLLSCP
CACAGAGGAAGAACACAGCGCAGAATGAAGGGC
GAUUUUCACAGAGGAAGAACACAGCGCAG
VEGSWVGHGKEGSLGPGEPSPLSKQGTE
CAGGCTTACTGAGCTGTCCAGTGGAGGGCTCATG
AAUGAAGGGCCAGGCUUACUGAGCUGUCC
GGTGGGACATGGAAAAGAAGGCAGCCTAGGCCC
AGUGGAGGGCUCAUGGGUGGGACAUGGA
TGGGGAGCCCAGTCCACTGAGCAAGCAAGGGACT
AAAGAAGGCAGCCUAGGCCCUGGGGAGCC
GAGTGA
CAGUCCACUGAGCAAGCAAGGGACUGAGU
GA
26
212/511
ATGGGTAACCTAGACTTCAGAGAGAATGAGCAGA
AUGGGUAACCUAGACUUCAGAGAGAAUGA
MGNLDFRENEQRAKEKSGCPSIFILLSQV
GAGCAAAGGAGAAATCTGGCTGTCCTTCCATTTTC
GCAGAGAGCAAAGGAGAAAUCUGGCUGUC
SW
ATTCTGTTATCTCAGGTGAGCTGGTAG
CUUCCAUUUUCAUUCUGUUAUCUCAGGUG
AGCUGGUAG
27
213/512
TTGAAGTTTAGTCAGCACCCCCACCAAACTTTATTT
UUGAAGUUUAGUCAGCACCCCCACCAAACU
LKFSQHPHQTLFFYVFFATYECFENKVPM
TTCTATGTGTTTTTTGCAACATATGAGTGTTTTGAA
UUAUUUUUCUAUGUGUUUUUUGCAACAU
SLLDL
AATAAAGTACCCATGTCTTTATTAGATTTA
AUGAGUGUUUUGAAAAUAAAGUACCCAUG
UCUUUAUUAGAUUUA
28 (BSS)
214/513
GTGCGGGCCGGGCGGGTGCATTCAGGCCAAGGC
GUGCGGGCCGGGCGGGUGCAUUCAGGCCA
VRAGRVHSGQGGAAGMLRVPEPRPGEA
GGGGCCGCCGGGATGCTCAGGGTTCCGGAGCCG
AGGCGGGGCCGCCGGGAUGCUCAGGGUUC
KAEGAAPPTPSKPLTSFLIQDILRDGAQR
CGGCCCGGGGAGGCGAAAGCGGAGGGGGCCGC
CGGAGCCGCGGCCCGGGGAGGCGAAAGCG
QGGRTSSQRQRDPEPEPEPEPEGGRSRA
GCCGCCGACCCCGTCCAAGCCGCTCACGTCCTTCC
GAGGGGGCCGCGCCGCCGACCCCGUCCAAG
GAQNDQLSTGPRAAPEEAETLAETEPGG
TCATCCAGGACATCCTGCGGGACGGCGCGCAGCG
CCGCUCACGUCCUUCCUCAUCCAGGACAUC
AGRAGAFRPRRGRRDAQGSGAAARGGE
GCAAGGCGGCCGCACGAGCAGCCAGAGACAGCG
CUGCGGGACGGCGCGCAGCGGCAAGGCGG
SGGGRAADPVQAAHVLPHPGHPAGRRA
CGACCCGGAGCCGGAGCCAGAGCCAGAGCCAGA
CCGCACGAGCAGCCAGAGACAGCGCGACCC
AARRPHEQPETARPGAGARARARGRTQ
GGGAGGACGCAGCCGCGCCGGGGCGCAGAACGA
GGAGCCGGAGCCAGAGCCAGAGCCAGAGG
PRRGAERPAEHRAPRRAGGGRDAGRDR
CCAGCTGAGCACCGGGCCCCGCGCCGCGCCGGAG
GAGGACGCAGCCGCGCCGGGGCGCAGAAC
A
GAGGCCGAGACGCTGGCAGAGACCGAGCCAGGC
GACCAGCUGAGCACCGGGCCCCGCGCCGCG
GGTGCGGGCCGGGCGGGTGCATTCAGGCCAAGG
CCGGAGGAGGCCGAGACGCUGGCAGAGAC
CGGGGCCGCCGGGATGCTCAGGGTTCCGGAGCC
CGAGCCAGGCGGUGCGGGCCGGGCGGGUG
GCGGCCCGGGGAGGCGAAAGCGGAGGGGGCCG
CAUUCAGGCCAAGGCGGGGCCGCCGGGAU
CGCCGCCGACCCCGTCCAAGCCGCTCACGTCCTTC
GCUCAGGGUUCCGGAGCCGCGGCCCGGGG
CTCATCCAGGACATCCTGCGGGACGGCGCGCAGC
AGGCGAAAGCGGAGGGGGCCGCGCCGCCG
GGCAAGGCGGCCGCACGAGCAGCCAGAGACAGC
ACCCCGUCCAAGCCGCUCACGUCCUUCCUC
GCGACCCGGAGCCGGAGCCAGAGCCAGAGCCAG
AUCCAGGACAUCCUGCGGGACGGCGCGCA
AGGGAGGACGCAGCCGCGCCGGGGCGCAGAACG
GCGGCAAGGCGGCCGCACGAGCAGCCAGA
ACCAGCTGAGCACCGGGCCCCGCGCCGCGCCGGA
GACAGCGCGACCCGGAGCCGGAGCCAGAGC
GGAGGCCGAGACGCTGGCAGAGACCGAGCC
CAGAGCCAGAGGGAGGACGCAGCCGCGCC
GGGGCGCAGAACGACCAGCUGAGCACCGG
GCCCCGCGCCGCGCCGGAGGAGGCCGAGAC
GCUGGCAGAGACCGAGCC
NPY neo-
antigen
DNA sequence
RNA sequence
Protein sequence
1
215/514
CTGGCTCTCACCCCTCGGAGACGCTCGCCCGACAG
CUGGCUCUCACCCCUCGGAGACGCUCGCCC
LALTPRRRSPDSIVLAAQPRPRASHHAR
CATAGTACTTGCCGCCCAGCCACGCCCGCGCGCCA
GACAGCAUAGUACUUGCCGCCCAGCCACGC
GCCACCATGCTAGGTAA
CCGCGCGCCAGCCACCAUGCUAGGUAA
2
216/515
GTGCCTGGGTGCGCTGGCCGAGGCGTACCCCTCC
GUGCCUGGGUGCGCUGGCCGAGGCGUACC
VPGCAGRGVPLQAGQPGRGRTSGGHG
AAGCCGGACAACCCGGGCGAGGACGCACCAGCG
CCUCCAAGCCGGACAACCCGGGCGAGGACG
QILLGAATLHQPHHQAEIWKTIQPRDTD
GAGGACATGGCCAGATACTACTCGGCGCTGCGAC
CACCAGCGGAGGACAUGGCCAGAUACUAC
FRPLDERKHRKCSQNSA
ACTACATCAACCTCATCACCAGGCAGAGATATGGA
UCGGCGCUGCGACACUACAUCAACCUCAUC
AAACGATCCAGCCCAGAGACACTGATTTCAGACCT
ACCAGGCAGAGAUAUGGAAAACGAUCCAG
CTTGATGAGAGAAAGCACAGAAAATGTTCCCAGA
CCCAGAGACACUGAUUUCAGACCUCUUGA
ACTCGGCTTGA
UGAGAGAAAGCACAGAAAAUGUUCCCAGA
ACUCGGCUUGA
3
217/516
GTGGTGATGGGAAATGAGACTTGCTCTCTGGCCT
GUGGUGAUGGGAAAUGAGACUUGCUCUC
VVMGNETCSLAFSYFQPIFHRVKRESTHP
TTTCCTATTTTCAGCCCATATTTCATCGTGTAAAAC
UGGCCUUUUCCUAUUUUCAGCCCAUAUUU
TNACSHCAEFCNVFLCHHCIYVCLNKVSC
GAGAATCCACCCATCCTACCAATGCATGCAGCCAC
CAUCGUGUAAAACGAGAAUCCACCCAUCCU
IQK
TGTGCTGAATTCTGCAATGTTTTCCTTTGTCATCAT
ACCAAUGCAUGCAGCCACUGUGCUGAAUU
TGTATATATGTGTGTTTAAATAAAGTATCATGCAT
CUGCAAUGUUUUCCUUUGUCAUCAUUGU
TCAAAAGTGA
AUAUAUGUGUGUUUAAAUAAAGUAUCAU
GCAUUCAAAAGUGA
4
218/517
ATGCATGCAGCCACTGTGCTGAATTCTGCAATGTT
AUGCAUGCAGCCACUGUGCUGAAUUCUGC
MHAATVLNSAMFSFVIIVYMCV
TTCCTTTGTCATCATTGTATATATGTGTGTTTAA
AAUGUUUUCCUUUGUCAUCAUUGUAUAU
AUGUGUGUUUAA
5
219/518
GTGCGCTGGCCGAGGCGTACCCCTCCAAGCCGGA
GUGCGCUGGCCGAGGCGUACCCCUCCAAG
VRWPRRTPPSRTTRARTHQRRTWPDTT
CAACCCGGGCGAGGACGCACCAGCGGAGGACAT
CCGGACAACCCGGGCGAGGACGCACCAGCG
RRCDTTSTSSPGRDMENDPAQRH
GGCCAGATACTACTCGGCGCTGCGACACTACATCA
GAGGACAUGGCCAGAUACUACUCGGCGCU
ACCTCATCACCAGGCAGAGATATGGAAAACGATC
GCGACACUACAUCAACCUCAUCACCAGGCA
CAGCCCAGAGACACTGA
GAGAUAUGGAAAACGAUCCAGCCCAGAGA
CACUGA
6
220/519
TTGCTCTCTGGCCTTTTCCTATTTTCAGCCCATATTT
UUGCUCUCUGGCCUUUUCCUAUUUUCAGC
LLSGLFLFSAHISSCKTRIHPSYQCMQPLC
CATCGTGTAAAACGAGAATCCACCCATCCTACCAA
CCAUAUUUCAUCGUGUAAAACGAGAAUCC
TGCATGCAGCCACTGTGCTGA
ACCCAUCCUACCAAUGCAUGCAGCCACUGU
GCUGA
7 (BSS)
221/520
CTGATTTCAGACCTCTTGATGAGAGAAAGCACAG
CUGAUUUCAGACCUCUUGAUGAGAGAAAG
LISDLLMRESTENVPRTRYGKRSSPETLIS
AAAATGTTCCCAGAACTCGATATGGAAAACGATCC
CACAGAAAAUGUUCCCAGAACUCGAUAUG
DLLMRESTENVPRT
AGCCCAGAGACACTGATTTCAGACCTCTTGATGAG
GAAAACGAUCCAGCCCAGAGACACUGAUU
AGAAAGCACAGAAAATGTTCCCAGAACT
UCAGACCUCUUGAUGAGAGAAAGCACAGA
AAAUGUUCCCAGAACU
8 (BSS)
222/521
GTGCCTGGGTGCGCTGGCCGAGGCGTACCCCTCC
GUGCCUGGGUGCGCUGGCCGAGGCGUACC
VPGCAGRGVPLQAGQPGRGRTSGGHG
AAGCCGGACAACCCGGGCGAGGACGCACCAGCG
CCUCCAAGCCGGACAACCCGGGCGAGGACG
QILLGAATLHQPHHQAEMLGNKRLGLSG
GAGGACATGGCCAGATACTACTCGGCGCTGCGAC
CACCAGCGGAGGACAUGGCCAGAUACUAC
LTLALSLLVCLGAEAYPSKPDNPGEDAPA
ACTACATCAACCTCATCACCAGGCAGAGATGCTAG
UCGGCGCUGCGACACUACAUCAACCUCAUC
EDMARYYSALRHYINLITRQ
GTAACAAGCGACTGGGGCTGTCCGGACTGACCCT
ACCAGGCAGAGAUGCUAGGUAACAAGCGA
CGCCCTGTCCCTGCTCGTGTGCCTGGGTGCGCTGG
CUGGGGCUGUCCGGACUGACCCUCGCCCU
CCGAGGCGTACCCCTCCAAGCCGGACAACCCGGG
GUCCCUGCUCGUGUGCCUGGGUGCGCUGG
CGAGGACGCACCAGCGGAGGACATGGCCAGATA
CCGAGGCGUACCCCUCCAAGCCGGACAACC
CTACTCGGCGCTGCGACACTACATCAACCTCATCA
CGGGCGAGGACGCACCAGCGGAGGACAUG
CCAGGCAG
GCCAGAUACUACUCGGCGCUGCGACACUAC
AUCAACCUCAUCACCAGGCAG
SPDEF neo-
antigen
DNA sequence
RNA sequence
Protein sequence
1
223/522
CTGACTTCCTCCCAGCACATTCCTGCACTCTGCCGT
CUGACUUCCUCCCAGCACAUUCCUGCACUC
LTSSQHIPALCRVHTAPQTQSSKPAASSL
GTCCACACTGCCCCACAGACCCAGTCCTCCAAGCC
UGCCGUGUCCACACUGCCCCACAGACCCAG
QAPQVGPCHGASRQPWAGGRGLPTGT
TGCTGCCAGCTCCCTGCAAGCCCCTCAGGTTGGGC
UCCUCCAAGCCUGCUGCCAGCUCCCUGCAA
QP
CTTGCCACGGTGCCAGCAGGCAGCCCTGGGCTGG
GCCCCUCAGGUUGGGCCUUGCCACGGUGC
GGGTAGGGGACTCCCTACAGGCACGCAGCCCTGA
CAGCAGGCAGCCCUGGGCUGGGGGUAGGG
GACUCCCUACAGGCACGCAGCCCUGA
2
224/523
GTGGCCTCAGCTGCCCACACCTCTTCCCGGCCCCT
GUGGCCUCAGCUGCCCACACCUCUUCCCGG
VASAAHTSSRPLKLALQQTAPWAPGS
GAAGTTGGCACTGCAGCAGACAGCTCCCTGGGCA
CCCCUGAAGUUGGCACUGCAGCAGACAGC
CCAGGCAGCTAA
UCCCUGGGCACCAGGCAGCUAA
3
225/524
TTGACATGCTGTACCCTGAGGACAGCAGCTGGGC
UUGACAUGCUGUACCCUGAGGACAGCAGC
LTCCTLRTAAGQPRPLGPAVGRSHLRSLS
AGCCAAGGCCCCTGGGGCCAGCAGTCGGGAGGA
UGGGCAGCCAAGGCCCCUGGGGCCAGCAG
SARSLTAKPQRAAWTWCPAG
GCCACCTGAGGAGCCTGAGCAGTGCCCGGTCATT
UCGGGAGGAGCCACCUGAGGAGCCUGAGC
GACAGCCAAGCCCCAGCGGGCAGCCTGGACTTGG
AGUGCCCGGUCAUUGACAGCCAAGCCCCAG
TGCCCGGCGGGCTGA
CGGGCAGCCUGGACUUGGUGCCCGGCGGG
CUGA
4
226/525
ATGTGCAGAAGTGGCTCCTGTGGACAGAGCACCA
AUGUGCAGAAGUGGCUCCUGUGGACAGAG
MCRSGSCGQSTNTGCPPWARPSRSWR
ATACCGGCTGCCCCCCATGGGCAAGGCCTTCCAG
CACCAAUACCGGCUGCCCCCCAUGGGCAAG
ARSCAPCRRSSSASARPWVGMCCTPTW
GAGCTGGCGGGCAAGGAGCTGTGCGCCATGTCG
GCCUUCCAGGAGCUGGCGGGCAAGGAGCU
TSGSQRPG
GAGGAGCAGTTCCGCCAGCGCTCGCCCCTGGGTG
GUGCGCCAUGUCGGAGGAGCAGUUCCGCC
GGGATGTGCTGCACGCCCACCTGGACATCTGGAA
AGCGCUCGCCCCUGGGUGGGGAUGUGCUG
GTCAGCGGCCTGGATGA
CACGCCCACCUGGACAUCUGGAAGUCAGCG
GCCUGGAUGA
5
227/526
CTGGGGCGATTCACTACTGTGCCTCGACCAGTGA
CUGGGGCGAUUCACUACUGUGCCUCGACC
LGRFTTVPRPVRRAGPTARWTHHAPGSP
GGAGAGCTGGACCGACAGCGAGGTGGACTCATC
AGUGAGGAGAGCUGGACCGACAGCGAGGU
STCGSSSRSCYSSPTAMAASLGGSTRRRA
ATGCTCCGGGCAGCCCATCCACCTGTGGCAGTTCC
GGACUCAUCAUGCUCCGGGCAGCCCAUCCA
SSKLRTQPRWPGCGASARTVPP
TCAAGGAGTTGCTACTCAAGCCCCACAGCTATGGC
CCUGUGGCAGUUCCUCAAGGAGUUGCUAC
CGCTTCATTAGGTGGCTCAACAAGGAGAAGGGCA
UCAAGCCCCACAGCUAUGGCCGCUUCAUUA
TCTTCAAAATTGAGGACTCAGCCCAGGTGGCCCG
GGUGGCUCAACAAGGAGAAGGGCAUCUUC
GCTGTGGGGCATCCGCAAGAACCGTCCCGCCATG
AAAAUUGAGGACUCAGCCCAGGUGGCCCG
A
GCUGUGGGGCAUCCGCAAGAACCGUCCCG
CCAUGA
6
228/527
CTGGCCCAGGGCCTGAAACCCGCCCTCAGGGGCC
CUGGCCCAGGGCCUGAAACCCGCCCUCAGG
LAQGLKPALRGLSPACPASARP
TCTCTCCTGCCTGCCCTGCCTCAGCCAGGCCCTGA
GGCCUCUCUCCUGCCUGCCCUGCCUCAGCC
AGGCCCUGA
7
229/528
CTGACCTTCCAGAGCCCAAGGTCAGGGAGGGGCA
CUGACCUUCCAGAGCCCAAGGUCAGGGAG
LTFQSPRSGRGNQLPQGDMGPLGPSGP
ACCAACTGCCCCAGGGGGATATGGGTCCTCTGGG
GGGCAACCAACUGCCCCAGGGGGAUAUGG
WGRGASSSGPAAPLEDRGRQGCSPTPA
GCCTTCGGGACCCTGGGGCAGGGGTGCTTCCTCC
GUCCUCUGGGGCCUUCGGGACCCUGGGGC
SDPSISRAEPTEGQ
TCAGGCCCAGCTGCTCCCCTGGAGGACAGAGGGA
AGGGGUGCUUCCUCCUCAGGCCCAGCUGC
GACAGGGCTGCTCCCCAACACCTGCCTCTGACCCC
UCCCCUGGAGGACAGAGGGAGACAGGGCU
AGCATTTCCAGAGCAGAGCCTACAGAAGGGCAGT
GCUCCCCAACACCUGCCUCUGACCCCAGCA
GA
UUUCCAGAGCAGAGCCUACAGAAGGGCAG
UGA
8
230/529
ATGGTGCAGGGAGACATCTGCACCCCTGAGTTGG
AUGGUGCAGGGAGACAUCUGCACCCCUGA
MVQGDICTPELGSQECPREWIIKILEN
GCAGCCAGGAGTGCCCCCGGGAATGGATAATAAA
GUUGGGCAGCCAGGAGUGCCCCCGGGAAU
GATACTAGAGAACTGA
GGAUAAUAAAGAUACUAGAGAACUGA
9
231/530
CTGCACTCTGCCGTGTCCACACTGCCCCACAGACC
CUGCACUCUGCCGUGUCCACACUGCCCCAC
LHSAVSTLPHRPSPPSLLPAPCKPLRLGLA
CAGTCCTCCAAGCCTGCTGCCAGCTCCCTGCAAGC
AGACCCAGUCCUCCAAGCCUGCUGCCAGCU
TVPAGSPGLGVGDSLQARSPETSEGHPL
CCCTCAGGTTGGGCCTTGCCACGGTGCCAGCAGG
CCCUGCAAGCCCCUCAGGUUGGGCCUUGC
RVARPPVANLSAASATSPAGPWFRWPP
CAGCCCTGGGCTGGGGGTAGGGGACTCCCTACAG
CACGGUGCCAGCAGGCAGCCCUGGGCUGG
RCLAETRQWPQLPTPLPGP
GCACGCAGCCCTGAGACCTCAGAGGGCCACCCCT
GGGUAGGGGACUCCCUACAGGCACGCAGC
TGAGGGTGGCCAGGCCCCCAGTGGCCAACCTGAG
CCUGAGACCUCAGAGGGCCACCCCUUGAG
TGCTGCCTCTGCCACCAGCCCTGCTGGCCCCTGGT
GGUGGCCAGGCCCCCAGUGGCCAACCUGA
TCCGCTGGCCCCCCAGATGCCTGGCTGAGACACG
GUGCUGCCUCUGCCACCAGCCCUGCUGGCC
CCAGTGGCCTCAGCTGCCCACACCTCTTCCCGGCC
CCUGGUUCCGCUGGCCCCCCAGAUGCCUG
CCTGA
GCUGAGACACGCCAGUGGCCUCAGCUGCCC
ACACCUCUUCCCGGCCCCUGA
10
232/531
CTGGAGTCCCAGTCCACCCGCCACGCCCGAGCAG
CUGGAGUCCCAGUCCACCCGCCACGCCCGA
LESQSTRHARAGPVRLLPLLL
GGCCTGTCCGCCTTCTACCTCTCCTACTTTGA
GCAGGGCCUGUCCGCCUUCUACCUCUCCU
ACUUUGA
11
233/532
CTGCAAGCTGCTCAACATCACCGCAGATCCCATGG
CUGCAAGCUGCUCAACAUCACCGCAGAUCC
LQAAQHHRRSHGLEPQQCAEVAPVDRA
ACTGGAGCCCCAGCAATGTGCAGAAGTGGCTCCT
CAUGGACUGGAGCCCCAGCAAUGUGCAGA
PIPAAPHGQGLPGAGGQGAVRHVGGAV
GTGGACAGAGCACCAATACCGGCTGCCCCCCATG
AGUGGCUCCUGUGGACAGAGCACCAAUAC
PPALAPGWGCAARPPGHLEVSGLDERA
GGCAAGGCCTTCCAGGAGCTGGCGGGCAAGGAG
CGGCUGCCCCCCAUGGGCAAGGCCUUCCAG
DFTWGDSLLCLDQ
CTGTGCGCCATGTCGGAGGAGCAGTTCCGCCAGC
GAGCUGGCGGGCAAGGAGCUGUGCGCCAU
GCTCGCCCCTGGGTGGGGATGTGCTGCACGCCCA
GUCGGAGGAGCAGUUCCGCCAGCGCUCGC
CCTGGACATCTGGAAGTCAGCGGCCTGGATGAAA
CCCUGGGUGGGGAUGUGCUGCACGCCCAC
GAGCGGACTTCACCTGGGGCGATTCACTACTGTG
CUGGACAUCUGGAAGUCAGCGGCCUGGAU
CCTCGACCAGTGA
GAAAGAGCGGACUUCACCUGGGGCGAUUC
ACUACUGUGCCUCGACCAGUGA
12
234/533
CTGGACCGACAGCGAGGTGGACTCATCATGCTCC
CUGGACCGACAGCGAGGUGGACUCAUCAU
LDRQRGGLIMLRAAHPPVAVPQGVATQ
GGGCAGCCCATCCACCTGTGGCAGTTCCTCAAGG
GCUCCGGGCAGCCCAUCCACCUGUGGCAG
APQLWPLH
AGTTGCTACTCAAGCCCCACAGCTATGGCCGCTTC
UUCCUCAAGGAGUUGCUACUCAAGCCCCAC
ATTAG
AGCUAUGGCCGCUUCAUUAG
13
235/534
GTGGGGCATCCGCAAGAACCGTCCCGCCATGAAC
GUGGGGCAUCCGCAAGAACCGUCCCGCCA
VGHPQEPSRHELRQAEPLHPPVLQEGHH
TACGACAAGCTGAGCCGCTCCATCCGCCAGTATTA
UGAACUACGACAAGCUGAGCCGCUCCAUCC
PEARHLPAPRLPVRAPHLSAWPRA
CAAGAAGGGCATCATCCGGAAGCCAGACATCTCC
GCCAGUAUUACAAGAAGGGCAUCAUCCGG
CAGCGCCTCGTCTACCAGTTCGTGCACCCCATCTG
AAGCCAGACAUCUCCCAGCGCCUCGUCUAC
AGTGCCTGGCCCAGGGCCTGA
CAGUUCGUGCACCCCAUCUGAGUGCCUGG
CCCAGGGCCUGA
14
236/535
CTGCCTGCCCTGCCTCAGCCAGGCCCTGAGATGG
CUGCCUGCCCUGCCUCAGCCAGGCCCUGAG
LPALPQPGPEMGENGQSALLL
GGGAAAACGGGCAGTCTGCTCTGCTGCTCTGA
AUGGGGGAAAACGGGCAGUCUGCUCUGCU
GCUCUGA
15
237/536
GTGCTTCCTCCTCAGGCCCAGCTGCTCCCCTGGAG
GUGCUUCCUCCUCAGGCCCAGCUGCUCCCC
VLPPQAQLLPWRTEGDRAAPQHLPLTPA
GACAGAGGGAGACAGGGCTGCTCCCCAACACCTG
UGGAGGACAGAGGGAGACAGGGCUGCUCC
FPEQSLQKGSDSTKATGSPGLSLLHPPAS
CCTCTGACCCCAGCATTTCCAGAGCAGAGCCTACA
CCAACACCUGCCUCUGACCCCAGCAUUUCC
HSAPHLAWCRETSAPLSWAARSAPGNG
GAAGGGCAGTGACTCGACAAAGGCCACAGGCAG
AGAGCAGAGCCUACAGAAGGGCAGUGACU
TCCAGGCCTCTCTCTGCTCCATCCCCCTGCCTCCCA
CGACAAAGGCCACAGGCAGUCCAGGCCUCU
TTCTGCACCACACCTGGCATGGTGCAGGGAGACA
CUCUGCUCCAUCCCCCUGCCUCCCAUUCUG
TCTGCACCCCTGAGTTGGGCAGCCAGGAGTGCCC
CACCACACCUGGCAUGGUGCAGGGAGACA
CCGGGAATGGATAA
UCUGCACCCCUGAGUUGGGCAGCCAGGAG
UGCCCCCGGGAAUGGAUAA
16
238/537
CTGTCTGACTTCCTCCCAGCACATTCCTGCACTCTG
CUGUCUGACUUCCUCCCAGCACAUUCCUGC
LSDFLPAHSCTLPCPHCPTDPVLQACCQL
CCGTGTCCACACTGCCCCACAGACCCAGTCCTCCA
ACUCUGCCGUGUCCACACUGCCCCACAGAC
PASPSGWALPRCQQAALGWG
AGCCTGCTGCCAGCTCCCTGCAAGCCCCTCAGGTT
CCAGUCCUCCAAGCCUGCUGCCAGCUCCCU
GGGCCTTGCCACGGTGCCAGCAGGCAGCCCTGGG
GCAAGCCCCUCAGGUUGGGCCUUGCCACG
CTGGGGGTAG
GUGCCAGCAGGCAGCCCUGGGCUGGGGGU
AG
17
239/538
GTGCCTGGCCCAGGGCCTGAAACCCGCCCTCAGG
GUGCCUGGCCCAGGGCCUGAAACCCGCCCU
VPGPGPETRPQGPLSCLPCLSQALRWGK
GGCCTCTCTCCTGCCTGCCCTGCCTCAGCCAGGCC
CAGGGGCCUCUCUCCUGCCUGCCCUGCCUC
TGSLLCCSDLPEPKVREGQPTAPGGYGSS
CTGAGATGGGGGAAAACGGGCAGTCTGCTCTGCT
AGCCAGGCCCUGAGAUGGGGGAAAACGGG
GAFGTLGQGCFLLRPSCSPGGQRETGLLP
GCTCTGACCTTCCAGAGCCCAAGGTCAGGGAGGG
CAGUCUGCUCUGCUGCUCUGACCUUCCAG
NTCL
GCAACCAACTGCCCCAGGGGGATATGGGTCCTCT
AGCCCAAGGUCAGGGAGGGGCAACCAACU
GGGGCCTTCGGGACCCTGGGGCAGGGGTGCTTCC
GCCCCAGGGGGAUAUGGGUCCUCUGGGGC
TCCTCAGGCCCAGCTGCTCCCCTGGAGGACAGAG
CUUCGGGACCCUGGGGCAGGGGUGCUUCC
GGAGACAGGGCTGCTCCCCAACACCTGCCTCTGA
UCCUCAGGCCCAGCUGCUCCCCUGGAGGAC
AGAGGGAGACAGGGCUGCUCCCCAACACC
UGCCUCUGA
18
240/539
GTGACTCGACAAAGGCCACAGGCAGTCCAGGCCT
GUGACUCGACAAAGGCCACAGGCAGUCCA
VTRQRPQAVQASLCSIPLPPILHHTWHG
CTCTCTGCTCCATCCCCCTGCCTCCCATTCTGCACC
GGCCUCUCUCUGCUCCAUCCCCCUGCCUCC
AGRHLHP
ACACCTGGCATGGTGCAGGGAGACATCTGCACCC
CAUUCUGCACCACACCUGGCAUGGUGCAG
CTGA
GGAGACAUCUGCACCCCUGA
TP53 neo-
antigen
DNA sequence
RNA sequence
Protein sequence
1
241/540
CTGACATTCTCCACTTCTTGTTCCCCACTGACAGCC
CUGACAUUCUCCACUUCUUGUUCCCCACU
LTFSTSCSPLTASHPHLSLPCHFGFWVFEP
TCCCACCCCCATCTCTCCCTCCCCTGCCATTTTGGG
GACAGCCUCCCACCCCCAUCUCUCCCUCCCC
LLAIGVRQKHPGLPFALSRGSTEQVGLH
TTTTGGGTCTTTGAACCCTTGCTTGCAATAGGTGT
UGCCAUUUUGGGUUUUGGGUCUUUGAAC
WCFVVGRRMGSRTYQLRF
GCGTCAGAAGCACCCAGGACTTCCATTTGCTTTGT
CCUUGCUUGCAAUAGGUGUGCGUCAGAAG
CCCGGGGCTCCACTGAACAAGTTGGCCTGCACTG
CACCCAGGACUUCCAUUUGCUUUGUCCCG
GTGTTTTGTTGTGGGGAGGAGGATGGGGAGTAG
GGGCUCCACUGAACAAGUUGGCCUGCACU
GACATACCAGCTTAGATTTTAA
GGUGUUUUGUUGUGGGGAGGAGGAUGG
GGAGUAGGACAUACCAGCUUAGAUUUUAA
2
242/541
TTGGGCAGCTGGTTAGGTAGAGGGAGTTGTCAAG
UUGGGCAGCUGGUUAGGUAGAGGGAGUU
LGSWLGRGSCQVSAGPAKPCLTTSW
TCTCTGCTGGCCCAGCCAAACCCTGTCTGACAACC
GUCAAGUCUCUGCUGGCCCAGCCAAACCCU
TCTTGGTGA
GUCUGACAACCUCUUGGUGA
3
243/542
ATGGGATTGGGGTTTTCCCCTCCCATGTGCTCAAG
AUGGGAUUGGGGUUUUCCCCUCCCAUGU
MGLGFSPPMCSRLALKVLSFSKV
ACTGGCGCTAAAAGTTTTGAGCTTCTCAAAAGTCT
GCUCAAGACUGGCGCUAAAAGUUUUGAGC
AG
UUCUCAAAAGUCUAG
4
244/543
TTGTTCCCTCTCCCTGTTGGTCGGTGGGTTGGTAG
UUGUUCCCUCUCCCUGUUGGUCGGUGGG
LFPLPVGRWVGSFYSWAAG
TTTCTACAGTTGGGCAGCTGGTTAG
UUGGUAGUUUCUACAGUUGGGCAGCUGG
UUAG
5
245/544
CTGACTCAGACTGACATTCTCCACTTCTTGTTCCCC
CUGACUCAGACUGACAUUCUCCACUUCUU
LTQTDILHFLFPTDSLPPPSLPPLPFWVLG
ACTGACAGCCTCCCACCCCCATCTCTCCCTCCCCTG
GUUCCCCACUGACAGCCUCCCACCCCCAUC
L
CCATTTTGGGTTTTGGGTCTTTGA
UCUCCCUCCCCUGCCAUUUUGGGUUUUGG
GUCUUUGA
6
246/545
CTGTCCCTCACTGTTGAATTTTCTCTAACTTCAAGG
CUGUCCCUCACUGUUGAAUUUUCUCUAAC
LSLTVEFSLTSRPISVKCWHLHLPHRVHCE
CCCATATCTGTGAAATGCTGGCATTTGCACCTACC
UUCAAGGCCCAUAUCUGUGAAAUGCUGGC
G
TCACAGAGTGCATTGTGAGGGTTAA
AUUUGCACCUACCUCACAGAGUGCAUUGU
GAGGGUUAA
7
247/546
CTGGCCTTGAAACCACCTTTTATTACATGGGGTCT
CUGGCCUUGAAACCACCUUUUAUUACAUG
LALKPPFITWGLELDPLEGACSLSLLVGGL
AGAACTTGACCCCCTTGAGGGTGCTTGTTCCCTCT
GGGUCUAGAACUUGACCCCCUUGAGGGUG
VVSTVGQLVR
CCCTGTTGGTCGGTGGGTTGGTAGTTTCTACAGTT
CUUGUUCCCUCUCCCUGUUGGUCGGUGGG
GGGCAGCTGGTTAGGTAG
UUGGUAGUUUCUACAGUUGGGCAGCUGG
UUAGGUAG
8
248/547
TTGTCAAGTCTCTGCTGGCCCAGCCAAACCCTGTC
UUGUCAAGUCUCUGCUGGCCCAGCCAAAC
LSSLCWPSQTLSDNLLVNLST
TGACAACCTCTTGGTGAACCTTAGTACCTAA
CCUGUCUGACAACCUCUUGGUGAACCUUA
GUACCUAA
9
249/548
CTGGATCCACCAAGACTTGTTTTATGCTCAGGGTC
CUGGAUCCACCAAGACUUGUUUUAUGCUC
LDPPRLVLCSGSISFFFFFFFFLFL
AATTTCTTTTTTCTTTTTTTTTTTTTTTTTTCTTTTTCT
AGGGUCAAUUUCUUUUUUCUUUUUUUUU
TTGA
UUUUUUUUUCUUUUUCUUUGA
10
250/549
GTGTTGCCCAGGCTGGTCTCAAACTCCTGGGCTCA
GUGUUGCCCAGGCUGGUCUCAAACUCCU
GVLPRLVSNSWAQAIHLSQPPRVLGLQL
GGCGATCCACCTGTCTCAGCCTCCCAGAGTGCTGG
GGCUCAGGCGAUCCACCUGUCUCAGCCUCC
GATTACAATTGTGA
CAGAGUGCUGGGAUUACAAUUGUGA
ERG neo-
antigen
DNA sequences
RNA sequences
Protein sequences
1
251/550
CTGGACCGGTCACGGCCACCCCACGCCCCAGTCG
CUGGACCGGUCACGGCCACCCCACGCCCCA
LDRSRPPHAPVESCSTISFHSAQN
AAAGCTGCTCAACCATCTCCTTCCACAGTGCCCAA
GUCGAAAGCUGCUCAACCAUCUCCUUCCAC
AACTGA
AGUGCCCAAAACUGA
2
252/551
TTGGCAGTTCCTCCTGGAGCTCCTGTCGGACAGCT
UUGGCAGUUCCUCCUGGAGCUCCUGUCGG
LAVPPGAPVGQLQLQLHHLGRHQRGVQ
CCAACTCCAGCTGCATCACCTGGGAAGGCACCAA
ACAGCUCCAACUCCAGCUGCAUCACCUGGG
DDGSRRGGPALGRAEEQTQHELR
CGGGGAGTTCAAGATGACGGATCCCGACGAGGT
AAGGCACCAACGGGGAGUUCAAGAUGACG
GGCCCGGCGCTGGGGAGAGCGGAAGAGCAAACC
GAUCCCGACGAGGUGGCCCGGCGCUGGGG
CAACATGAACTACGATAA
AGAGCGGAAGAGCAAACCCAACAUGAACUA
CGAUAA
3
253/552
GTGGCCCATCAACAGACGTTGATATGCAACTGCAT
GUGGCCCAUCAACAGACGUUGAUAUGCAA
VAHQQTLICNCMACAVLVEIKYIPFDGQL
GGCATGTGCTGTTTTGGTTGAAATCAAATACATTC
CUGCAUGGCAUGUGCUGUUUUGGUUGAA
SAFSNCEDDPKFPTPLQYYRDYELKGGTE
CGTTTGATGGACAGCTGTCAGCTTTCTCAAACTGT
AUCAAAUACAUUCCGUUUGAUGGACAGCU
DVYRVSV
GAAGATGACCCAAAGTTTCCAACTCCTTTACAGTA
GUCAGCUUUCUCAAACUGUGAAGAUGACC
TTACCGGGACTATGAACTAAAAGGTGGGACTGAG
CAAAGUUUCCAACUCCUUUACAGUAUUAC
GATGTGTATAGAGTGAGCGTGTGA
CGGGACUAUGAACUAAAAGGUGGGACUGA
GGAUGUGUAUAGAGUGAGCGUGUGA
4
254/553
CTGGACTCAGGACATTTGGGGACTGTGTACAATG
CUGGACUCAGGACAUUUGGGGACUGUGU
LDSGHLGTVYNELWRLEGSCSQCYTKPS
AGTTATGGAGACTCGAGGGTTCATGCAGTCAGTG
ACAAUGAGUUAUGGAGACUCGAGGGUUCA
VRRKDTA
TTATACCAAACCCAGTGTTAGGAGAAAGGACACA
UGCAGUCAGUGUUAUACCAAACCCAGUGU
GCGTAA
UAGGAGAAAGGACACAGCGUAA
5
255/554
ATGGCCTTCTTGGCTGCCACAATCAGAAATCACGC
AUGGCCUUCUUGGCUGCCACAAUCAGAAA
MAFLAATIRNHAGILGRRPPVFL
AGGCATTTTGGGTAGGCGGCCTCCAGTTTTCCTTT
UCACGCAGGCAUUUUGGGUAGGCGGCCUC
GA
CAGUUUUCCUUUGA
6
256/555
ATGCTTAATGTTTACAATATGAAGTTATTAGTTCTT
AUGCUUAAUGUUUACAAUAUGAAGUUAU
MLNVYNMKLLVLRMQNVCNKISLA
AGAATGCAGAATGTATGTAATAAAATAAGCTTGG
UAGUUCUUAGAAUGCAGAAUGUAUGUAA
CCTAG
UAAAAUAAGCUUGGCCUAG
7
257/556
TTGGGATTCACTAGCCCTGAGCCTGATGTTGCTGG
UUGGGAUUCACUAGCCCUGAGCCUGAUGU
LGFTSPEPDVAGYPLKTMFISII
CTATCCCTTGAAGACAATGTTTATTTCCATAATCTA
UGCUGGCUAUCCCUUGAAGACAAUGUUUA
G
UUUCCAUAAUCUAG
8
258/557
ATGTTTTCCATTAGATCCCTCAACCCTCCACCCCCA
AUGUUUUCCAUUAGAUCCCUCAACCCUCC
MFSIRSLNPPPPVQVISKSYEQLG
GTCCAGGTTATTAGCAAGTCTTATGAGCAACTGG
ACCCCCAGUCCAGGUUAUUAGCAAGUCUU
GATAA
AUGAGCAACUGGGAUAA
9
259/558
GTGGCACCTGCAACCCAGGAGCAGGAGCCGGAG
GUGGCACCUGCAACCCAGGAGCAGGAGCC
VAPATQEQEPEELPSDSRCSRDGYSSGA
GAGCTGCCCTCTGACAGCAGGTGCAGCAGAGATG
GGAGGAGCUGCCCUCUGACAGCAGGUGCA
GKVMGHRESTDVLQRPKVTHCLEKEKKI
GCTACAGCTCAGGAGCTGGGAAGGTGATGGGGC
GCAGAGAUGGCUACAGCUCAGGAGCUGGG
FFKKLVYLASH
ACAGGGAAAGCACAGATGTTCTGCAGCGCCCCAA
AAGGUGAUGGGGCACAGGGAAAGCACAGA
AGTGACCCATTGCCTGGAGAAAGAGAAGAAAATA
UGUUCUGCAGCGCCCCAAAGUGACCCAUU
TTTTTTAAAAAGCTAGTTTATTTAGCTTCTCATTAA
GCCUGGAGAAAGAGAAGAAAAUAUUUUUU
AAAAAGCUAGUUUAUUUAGCUUCUCAUUA
A
10
260/559
ATGAATCAAAAGTGCCTCAAGAGGAATGAAAAAA
AUGAAUCAAAAGUGCCUCAAGAGGAAUGA
MNQKCLKRNEKSFTGAGEGSRGRDPKTL
GCTTTACTGGGGCTGGGGAAGGAAGCCGGGGAA
AAAAAGCUUUACUGGGGCUGGGGAAGGAA
GRELLKSYYRNEEDAKNVTNMDISSVD
GAGATCCAAAGACTCTTGGGAGGGAGTTACTGAA
GCCGGGGAAGAGAUCCAAAGACUCUUGGG
GTCTTACTACAGAAATGAGGAGGATGCTAAAAAT
AGGGAGUUACUGAAGUCUUACUACAGAAA
GTCACGAATATGGACATATCATCTGTGGACTGA
UGAGGAGGAUGCUAAAAAUGUCACGAAUA
UGGACAUAUCAUCUGUGGACUGA
11
261/560
GTGCTGTTTTGGTTGAAATCAAATACATTCCGTTT
GUGCUGUUUUGGUUGAAAUCAAAUACAU
VLFWLKSNTFRLMDSCQLSQTVKMTQSF
GATGGACAGCTGTCAGCTTTCTCAAACTGTGAAG
UCCGUUUGAUGGACAGCUGUCAGCUUUCU
QLLYSITGTMN
ATGACCCAAAGTTTCCAACTCCTTTACAGTATTACC
CAAACUGUGAAGAUGACCCAAAGUUUCCA
GGGACTATGAACTAA
ACUCCUUUACAGUAUUACCGGGACUAUGA
ACUAA
12
262/561
CTGTGTACAATGAGTTATGGAGACTCGAGGGTTC
CUGUGUACAAUGAGUUAUGGAGACUCGA
LCTMSYGDSRVHAVSVIPNPVLGERTQR
ATGCAGTCAGTGTTATACCAAACCCAGTGTTAGGA
GGGUUCAUGCAGUCAGUGUUAUACCAAAC
NGERGSSRIQKQKCASLSLFVK
GAAAGGACACAGCGTAATGGAGAAAGGGGAAGT
CCAGUGUUAGGAGAAAGGACACAGCGUAA
AGTAGAATTCAGAAACAAAAATGCGCATCTCTTTC
UGGAGAAAGGGGAAGUAGUAGAAUUCAG
TTTGTTTGTCAAATGA
AAACAAAAAUGCGCAUCUCUUUCUUUGUU
UGUCAAAUGA
13
263/562
GTGGGGGCTTTGTTCTCCACAGGGTCAGGTAAGA
GUGGGGGCUUUGUUCUCCACAGGGUCAG
VGALFSTGSGKRWPSWLPQSEITQAFW
GATGGCCTTCTTGGCTGCCACAATCAGAAATCACG
GUAAGAGAUGGCCUUCUUGGCUGCCACAA
VGGLQFSFESRTLCVCQNEVYKSMFFPLF
CAGGCATTTTGGGTAGGCGGCCTCCAGTTTTCCTT
UCAGAAAUCACGCAGGCAUUUUGGGUAGG
I
TGAGTCGCGAACGCTGTGCGTTTGTCAGAATGAA
CGGCCUCCAGUUUUCCUUUGAGUCGCGAA
GTATACAAGTCAATGTTTTTCCCCCTTTTTATATAA
CGCUGUGCGUUUGUCAGAAUGAAGUAUAC
AAGUCAAUGUUUUUCCCCCUUUUUAUAUA
A
14
264/563
ATGAAAACATGTGCTGAATGTTGTGGATTTTGTGT
AUGAAAACAUGUGCUGAAUGUUGUGGAU
MKTCAECCGFCVIIYFVQELVQGRAKEIG
TATAATTTACTTTGTCCAGGAACTTGTGCAAGGGA
UUUGUGUUAUAAUUUACUUUGUCCAGGA
CLAPKWRQPLQVLLASPPVFYF
GAGCCAAGGAAATAGGATGTTTGGCACCCAAATG
ACUUGUGCAAGGGAGAGCCAAGGAAAUAG
GCGTCAGCCTCTCCAGGTCCTTCTTGCCTCCCCTCC
GAUGUUUGGCACCCAAAUGGCGUCAGCCU
TGTCTTTTATTTCTAG
CUCCAGGUCCUUCUUGCCUCCCCUCCUGUC
UUUUAUUUCUAG
15
265/564
TTGGAACAGAAGGACCCCGGGTTTCACATTGGAG
UUGGAACAGAAGGACCCCGGGUUUCACAU
LEQKDPGFHIGASIFMPGMERGL
CCTCCATATTTATGCCTGGAATGGAAAGAGGCCTA
UGGAGCCUCCAUAUUUAUGCCUGGAAUGG
TGA
AAAGAGGCCUAUGA
16
266/565
GTGAACAAGCTACCACTCGTAAGGCAAACTGTATT
GUGAACAAGCUACCACUCGUAAGGCAAAC
VNKLPLVRQTVLLLANKASWIAAISHFTET
ATTACTGGCAAATAAAGCGTCATGGATAGCTGCA
UGUAUUAUUACUGGCAAAUAAAGCGUCAU
RDNV
ATTTCTCACTTTACAGAAACAAGGGATAACGTCTA
GGAUAGCUGCAAUUUCUCACUUUACAGAA
G
ACAAGGGAUAACGUCUAG
17
267/566
CTGCTACATCAGAGTTACCTGGGCACTGTGGCTTG
CUGCUACAUCAGAGUUACCUGGGCACUGU
LLHQSYLGTVAWDSLALSLMLLAIP
GGATTCACTAGCCCTGAGCCTGATGTTGCTGGCTA
GGCUUGGGAUUCACUAGCCCUGAGCCUGA
TCCCTTGA
UGUUGCUGGCUAUCCCUUGA
18
268/567
ATGCTGCCAACCTTGGTCCAGGTGAAGGCAACTC
AUGCUGCCAACCUUGGUCCAGGUGAAGGC
MLPTLVQVKATQKVKIQGDRAKALAETS
AAAAGGTGAAAATACAAGGTGACCGTGCGAAGG
AACUCAAAAGGUGAAAAUACAAGGUGACC
CGCTAGCCGAAACATCTTAG
GUGCGAAGGCGCUAGCCGAAACAUCUUAG
19
269/568
GTGCTGTGGATGTTTTGGCAGTGGGCTTTGAAGT
GUGCUGUGGAUGUUUUGGCAGUGGGCUU
VLWMFWQWALKSAGHDYQCS
CGGCAGGACACGATTACCAATGCTCCTGA
UGAAGUCGGCAGGACACGAUUACCAAUGC
UCCUGA
20
270/569
GTGTCATTTGGATTAGACGGAGCCCAACCATCCAT
GUGUCAUUUGGAUUAGACGGAGCCCAACC
VSFGLDGAQPSIILQQPGKAHKVPVSP
CATTTTGCAGCAGCCTGGGAAGGCCCACAAAGTG
AUCCAUCAUUUUGCAGCAGCCUGGGAAGG
CCCGTATCTCCTTAG
CCCACAAAGUGCCCGUAUCUCCUUAG
21
271/570
ATGATTATCATATTCAATTTTAACAGATGTTTTCCA
AUGAUUAUCAUAUUCAAUUUUAACAGAUG
MIIIFNFNRCFPLDPSTLHPQSRLLASLMS
TTAGATCCCTCAACCCTCCACCCCCAGTCCAGGTT
UUUUCCAUUAGAUCCCUCAACCCUCCACCC
NWDNFG
ATTAGCAAGTCTTATGAGCAACTGGGATAATTTTG
CCAGUCCAGGUUAUUAGCAAGUCUUAUGA
GATAA
GCAACUGGGAUAAUUUUGGAUAA
22
272/571
CTGTACAATCTTACTCCTGCTGGCAAGAGATTTGT
CUGUACAAUCUUACUCCUGCUGGCAAGAG
LYNLTPAGKRFVFFSCLQLAFGPCMW
CTTCTTTTCTTGTCTTCAATTGGCTTTCGGGCCTTG
AUUUGUCUUCUUUUCUUGUCUUCAAUUG
TATGTGGTAA
GCUUUCGGGCCUUGUAUGUGGUAA
23
273/572
ATGTGGCACCTGCAACCCAGGAGCAGGAGCCGGA
AUGUGGCACCUGCAACCCAGGAGCAGGAG
MWHLQPRSRSRRSCPLTAGAAEMATAQ
GGAGCTGCCCTCTGACAGCAGGTGCAGCAGAGAT
CCGGAGGAGCUGCCCUCUGACAGCAGGUG
ELGR
GGCTACAGCTCAGGAGCTGGGAAGGTGA
CAGCAGAGAUGGCUACAGCUCAGGAGCUG
GGAAGGUGA
24
274/573
GTGAGGACCAGTCGTTGTTTGAGTGTGCCTACGG
GUGAGGACCAGUCGUUGUUUGAGUGUGC
VRTSRCLSVPTERHTWLRQR
AACGCCACACCTGGCTAAGACAGAGATGA
CUACGGAACGCCACACCUGGCUAAGACAGA
GAUGA
25
275/574
ATGGCTCAAGGAACTCTCCTGATGAATGCAGTGT
AUGGCUCAAGGAACUCUCCUGAUGAAUGC
MAQGTLLMNAVWPKAGRWWAAQTPL
GGCCAAAGGCGGGAAGATGGTGGGCAGCCCAGA
AGUGUGGCCAAAGGCGGGAAGAUGGUGG
G
CACCGTTGGGATGA
GCAGCCCAGACACCGUUGGGAUGA
26
276/575
ATGGCCTTCCAGACGTCAACATCTTGTTATTCCAG
AUGGCCUUCCAGACGUCAACAUCUUGUUA
MAFQTSTSCYSRTSMGRNCAR
AACATCGATGGGAAGGAACTGTGCAAGATGA
UUCCAGAACAUCGAUGGGAAGGAACUGUG
CAAGAUGA
27
277/576
ATGCTAGAAACACAGGGGGTGCAGCTTTTATTTTC
AUGCUAGAAACACAGGGGGUGCAGCUUUU
MLETQGVQLLFSQ1LQYILKLRKELQLGQI
CCAAATACTTCAGTATATCCTGAAGCTACGCAAAG
AUUUUCCCAAAUACUUCAGUAUAUCCUGA
YHMSPPGDQPGPVTATPRPSRKLLNHLL
AATTACAACTAGGCCAGATTTACCATATGAGCCCC
AGCUACGCAAAGAAUUACAACUAGGCCAG
PQCPKLKTSVLS
CCAGGAGATCAGCCTGGACCGGTCACGGCCACCC
AUUUACCAUAUGAGCCCCCCAGGAGAUCA
CACGCCCCAGTCGAAAGCTGCTCAACCATCTCCTT
GCCUGGACCGGUCACGGCCACCCCACGCCC
CCACAGTGCCCAAAACTGAAGACCAGCGTCCTCA
CAGUCGAAAGCUGCUCAACCAUCUCCUUCC
GTTAG
ACAGUGCCCAAAACUGAAGACCAGCGUCCU
CAGUUAG
28
278/577
TTGGACCAACAAGTAGCCGCCTTGCAAATCCAGG
UUGGACCAACAAGUAGCCGCCUUGCAAAU
LDQQVAALQIQAVARSSFGSSSWSSCRT
CAGTGGCCAGATCCAGCTTTGGCAGTTCCTCCTGG
CCAGGCAGUGGCCAGAUCCAGCUUUGGCA
APTPAASPGKAPTGSSR
AGCTCCTGTCGGACAGCTCCAACTCCAGCTGCATC
GUUCCUCCUGGAGCUCCUGUCGGACAGCU
ACCTGGGAAGGCACCAACGGGGAGTTCAAGATG
CCAACUCCAGCUGCAUCACCUGGGAAGGCA
A
CCAACGGGGAGUUCAAGAUGA
29
279/578
ATGGGAAGCGCTACGCCTACAAGTTCGACTTCCAC
AUGGGAAGCGCUACGCCUACAAGUUCGAC
MGSATPTSSTSTGSPRPSSPTPRSHLCTST
GGGATCGCCCAGGCCCTCCAGCCCCACCCCCCGG
UUCCACGGGAUCGCCCAGGCCCUCCAGCCC
PQTSRTWAPITPTHRR
AGTCATCTCTGTACAAGTACCCCTCAGACCTCCCG
CACCCCCCGGAGUCAUCUCUGUACAAGUAC
TACATGGGCTCCTATCACGCCCACCCACAGAAGAT
CCCUCAGACCUCCCGUACAUGGGCUCCUAU
GA
CACGCCCACCCACAGAAGAUGA
30
280/579
TTGCTGCCCCAAACCCATACTGGAATTCACCAACT
UUGCUGCCCCAAACCCAUACUGGAAUUCAC
LLPQTHTGIHQLGVYTPTLGSPPAICLLIW
GGGGGTATATACCCCAACACTAGGCTCCCCACCA
CAACUGGGGGUAUAUACCCCAACACUAGG
ALTTKDLAEAFPISVHSPAHRHKLYRRT
GCCATATGCCTTCTCATCTGGGCACTTACTACTAA
CUCCCCACCAGCCAUAUGCCUUCUCAUCUG
AGACCTGGCGGAGGCTTTTCCCATCAGCGTGCATT
GGCACUUACUACUAAAGACCUGGCGGAGG
CACCAGCCCATCGCCACAAACTCTATCGGAGAACA
CUUUUCCCAUCAGCGUGCAUUCACCAGCCC
TGA
AUCGCCACAAACUCUAUCGGAGAACAUGA
31
281/580
ATGAAAAAAGCTTTACTGGGGCTGGGGAAGGAA
AUGAAAAAAGCUUUACUGGGGCUGGGGAA
MKKALLGLGKEAGEEIQRLLGGSY
GCCGGGGAAGAGATCCAAAGACTCTTGGGAGGG
GGAAGCCGGGGAAGAGAUCCAAAGACUCU
AGTTACTGA
UGGGAGGGAGUUACUGA
32
282/581
ATGAGGAGGATGCTAAAAATGTCACGAATATGGA
AUGAGGAGGAUGCUAAAAAUGUCACGAAU
MRRMLKMSRIWTYHLWTDLVKDSVCRS
CATATCATCTGTGGACTGACCTTGTAAAAGACAGT
AUGGACAUAUCAUCUGUGGACUGACCUUG
MKS
GTATGTAGAAGCATGAAGTCTTAA
UAAAAGACAGUGUAUGUAGAAGCAUGAAG
UCUUAA
33
283/582
CTGTATTTAAAAATAGAAACATATCAAAAACAAGA
CUGUAUUUAAAAAUAGAAACAUAUCAAAA
LYLKIETYQKQEKRHERDCGPSTDVDMQ
GAAAAGACACGAGAGAGACTGTGGCCCATCAACA
ACAAGAGAAAAGACACGAGAGAGACUGUG
LHGMCCFG
GACGTTGATATGCAACTGCATGGCATGTGCTGTTT
GCCCAUCAACAGACGUUGAUAUGCAACUG
TGGTTGA
CAUGGCAUGUGCUGUUUUGGUUGA
34
284/583
GTGATTGTAGACAGAGGGGTGAAGAAGGAGGAG
GUGAUUGUAGACAGAGGGGUGAAGAAGG
VIVDRGVKKEEEEAEKEETRAGKETSQA
GAAGAGGCAGAGAAGGAGGAGACCAGGGCTGG
AGGAGGAAGAGGCAGAGAAGGAGGAGACC
MKTGLRTFGDCVQ
GAAAGAAACTTCTCAAGCAATGAAGACTGGACTC
AGGGCUGGGAAAGAAACUUCUCAAGCAAU
AGGACATTTGGGGACTGTGTACAATGA
GAAGACUGGACUCAGGACAUUUGGGGACU
GUGUACAAUGA
35
285/584
ATGGAGAAAGGGGAAGTAGTAGAATTCAGAAAC
AUGGAGAAAGGGGAAGUAGUAGAAUUCA
MEKGEVVEFRNKNAHLFLCLSNENFNW
AAAAATGCGCATCTCTTTCTTTGTTTGTCAAATGAA
GAAACAAAAAUGCGCAUCUCUUUCUUUGU
NCLIFKRNIQDLIIMWGLCSPQGQVRDG
AATTTTAACTGGAATTGTCTGATATTTAAGAGAAA
UUGUCAAAUGAAAAUUUUAACUGGAAUU
LLGCHNQKSRRHFG
CATTCAGGACCTCATCATTATGTGGGGGCTTTGTT
GUCUGAUAUUUAAGAGAAACAUUCAGGAC
CTCCACAGGGTCAGGTAAGAGATGGCCTTCTTGG
CUCAUCAUUAUGUGGGGGCUUUGUUCUC
CTGCCACAATCAGAAATCACGCAGGCATTTTGGGT
CACAGGGUCAGGUAAGAGAUGGCCUUCUU
AG
GGCUGCCACAAUCAGAAAUCACGCAGGCA
UUUUGGGUAG
36
286/585
TTGAGTCGCGAACGCTGTGCGTTTGTCAGAATGA
UUGAGUCGCGAACGCUGUGCGUUUGUCA
LSRERCAFVRMKYTSQCFSPFLYNNYITY
AGTATACAAGTCAATGTTTTTCCCCCTTTTTATATA
GAAUGAAGUAUACAAGUCAAUGUUUUUCC
AFIHYELISASQRHTTKETIDIMWP
ATAATTATATAACTTATGCATTTATACACTACGAGT
CCCUUUUUAUAUAAUAAUUAUAUAACUUA
TGATCTCGGCCAGCCAAAGACACACGACAAAAGA
UGCAUUUAUACACUACGAGUUGAUCUCGG
GACAATCGATATAATGTGGCCTTGA
CCAGCCAAAGACACACGACAAAAGAGACAA
UCGAUAUAAUGUGGCCUUGA
37
287/586
TTGCCTCCCCTCCTGTCTTTTATTTCTAGCCCCTTTT
UUGCCUCCCCUCCUGUCUUUUAUUUCUAG
LPPLLSFISSPFWNRRTPGFTLEPPYLCLE
GGAACAGAAGGACCCCGGGTTTCACATTGGAGCC
CCCCUUUUGGAACAGAAGGACCCCGGGUU
WKEAYEAGVVIEKF
TCCATATTTATGCCTGGAATGGAAAGAGGCCTATG
UCACAUUGGAGCCUCCAUAUUUAUGCCUG
AAGCTGGGGTTGTCATTGAGAAATTCTAG
GAAUGGAAAGAGGCCUAUGAAGCUGGGG
UUGUCAUUGAGAAAUUCUAG
38
288/587
CTGCAATTTCTCACTTTACAGAAACAAGGGATAAC
CUGCAAUUUCUCACUUUACAGAAACAAGG
LQFLTLQKQGITSRFAAGFLFQELSLGRQL
GTCTAGATTTGCTGCGGGGTTTCTCTTTCAGGAGC
GAUAACGUCUAGAUUUGCUGCGGGGUUU
TCTCACTAGGTAGACAGCTTTAG
CUCUUUCAGGAGCUCUCACUAGGUAGACA
GCUUUAG
39
289/588
CTGGCAGTTAGGCTGACCCAAACTGTGCTAATGG
CUGGCAGUUAGGCUGACCCAAACUGUGCU
LAVRLTQTVLMEKISHFYFGMQSQDTPT
AAAAGATCAGTCATTTTTATTTTGGAATGCAAAGT
AAUGGAAAAGAUCAGUCAUUUUUAUUUU
FFIEIHIYLDNHSVLSSRLSHEQDHKNKT
CAAGACACACCTACATTCTTCATAGAAATACACAT
GGAAUGCAAAGUCAAGACACACCUACAUU
TTACTTGGATAATCACTCAGTTCTCTCTTCAAGACT
CUUCAUAGAAAUACACAUUUACUUGGAUA
GTCTCATGAGCAAGATCATAAAAACAAGACATGA
AUCACUCAGUUCUCUCUUCAAGACUGUCU
CAUGAGCAAGAUCAUAAAAACAAGACAUG
A
40
290/589
TTGGATAACATGATAATACTGAGTTCCTTCAAATA
UUGGAUAACAUGAUAAUACUGAGUUCCUU
LDNMIILSSFKYIILKLFQNGINSLLLL
CATAATTCTTAAATTGTTTCAAAATGGCATTAACTC
CAAAUACAUAAUUCUUAAAUUGUUUCAAA
TCTGTTACTGTTGTAA
AUGGCAUUAACUCUCUGUUACUGUUGUA
A
41
291/590
TTGCATGAACCTTTTCTCTCTGTTTGTCCCTGTCTCT
UUGCAUGAACCUUUUCUCUCUGUUUGUCC
LHEPFLSVCPCLLACPDVYSDSCTILLLLAR
TGGCTTGCCCTGATGTATACTCAGACTCCTGTACA
CUGUCUCUUGGCUUGCCCUGAUGUAUACU
DLSSFLVFNWLSGLVCGKITKSQSRLCFCS
ATCTTACTCCTGCTGGCAAGAGATTTGTCTTCTTTT
CAGACUCCUGUACAAUCUUACUCCUGCUG
CTTGTCTTCAATTGGCTTTCGGGCCTTGTATGTGGT
GCAAGAGAUUUGUCUUCUUUUCUUGUCU
AAAATCACCAAATCACAGTCAAGACTGTGTTTTTG
UCAAUUGGCUUUCGGGCCUUGUAUGUGG
TTCCTAG
UAAAAUCACCAAAUCACAGUCAAGACUGU
GUUUUUGUUCCUAG
42
292/591
ATGTCCCGGAGGGGTTCACAAAGTGCTTTGTCAG
AUGUCCCGGAGGGGUUCACAAAGUGCUUU
MSRRGSQSALSGLLQLEGSLLLLSLLHRC
GACTGCTGCAGTTAGAAGGCTCACTGCTTCTCCTA
GUCAGGACUGCUGCAGUUAGAAGGCUCAC
GTCNPGAGAGGAAL
AGCCTTCTGCACAGATGTGGCACCTGCAACCCAG
UGCUUCUCCUAAGCCUUCUGCACAGAUGU
GAGCAGGAGCCGGAGGAGCTGCCCTCTGA
GGCACCUGCAACCCAGGAGCAGGAGCCGGA
GGAGCUGCCCUCUGA
43
293/592
GTGCAGCAGAGATGGCTACAGCTCAGGAGCTGG
GUGCAGCAGAGAUGGCUACAGCUCAGGAG
VQQRWLQLRSWEGDGAQGKHRCSAAP
GAAGGTGATGGGGCACAGGGAAAGCACAGATGT
CUGGGAAGGUGAUGGGGCACAGGGAAAGC
QSDPLPGEREENIF
TCTGCAGCGCCCCAAAGTGACCCATTGCCTGGAG
ACAGAUGUUCUGCAGCGCCCCAAAGUGAC
AAAGAGAAGAAAATATTTTTTAA
CCAUUGCCUGGAGAAAGAGAAGAAAAUAU
UUUUUAA
44 (BSS)
294/593
ATGCAGTGTGGCCAAAGGCGGGAAGATGGTGGG
AUGCAGUGUGGCCAAAGGCGGGAAGAUGG
MQCGQRREDGGQPRHRWDELRQLHG
CAGCCCAGACACCGTTGGGATGAACTACGGCAGC
UGGGCAGCCCAGACACCGUUGGGAUGAAC
GEAHATPKHDHERAQSYRASRNSPDECS
TACATGGAGGAGAAGCACATGCCACCCCCAAACA
UACGGCAGCUACAUGGAGGAGAAGCACAU
VAKGGKMVGSPDTVGMNYGSYMEEKH
TGACCACGAACGAGCGCAGAGTTATCGTGCCAGC
GCCACCCCCAAACAUGACCACGAACGAGCG
MPPPNMTTNERRVIVPA
AGGAACTCTCCTGATGAATGCAGTGTGGCCAAAG
CAGAGUUAUCGUGCCAGCAGGAACUCUCC
GCGGGAAGATGGTGGGCAGCCCAGACACCGTTG
UGAUGAAUGCAGUGUGGCCAAAGGCGGGA
GGATGAACTACGGCAGCTACATGGAGGAGAAGC
AGAUGGUGGGCAGCCCAGACACCGUUGGG
ACATGCCACCCCCAAACATGACCACGAACGAGCG
AUGAACUACGGCAGCUACAUGGAGGAGAA
CAGAGTTATCGTGCCAGCA
GCACAUGCCACCCCCAAACAUGACCACGAA
CGAGCGCAGAGUUAUCGUGCCAGCA
45 (BSS)
295/594
GTGGCCAAAGGCGGGAAGATGGTGGGCAGCCCA
GUGGCCAAAGGCGGGAAGAUGGUGGGCA
VAKGGKMVGSPDTVGMNYGSYMEEKH
GACACCGTTGGGATGAACTACGGCAGCTACATGG
GCCCAGACACCGUUGGGAUGAACUACGGC
MPPPNMTTNERRVIVPAGTLLMNAVW
AGGAGAAGCACATGCCACCCCCAAACATGACCAC
AGCUACAUGGAGGAGAAGCACAUGCCACCC
PKAGRWWAAQTPLG
GAACGAGCGCAGAGTTATCGTGCCAGCAGGAACT
CCAAACAUGACCACGAACGAGCGCAGAGUU
CTCCTGATGAATGCAGTGTGGCCAAAGGCGGGAA
AUCGUGCCAGCAGGAACUCUCCUGAUGAA
GATGGTGGGCAGCCCAGACACCGTTGGGATGA
UGCAGUGUGGCCAAAGGCGGGAAGAUGG
UGGGCAGCCCAGACACCGUUGGGAUGA
46
296/595
ATGGAGTACAGACCATGTGCGGCAGTGGCTGGA
AUGGAGUACAGACCAUGUGCGGCAGUGGC
MEYRPCAAVAGVGGERIWPSRRQHLVIP
(BSS + RCA)
GTGGGCGGTGAAAGAATATGGCCTTCCAGACGTC
UGGAGUGGGCGGUGAAAGAAUAUGGCCU
EHRWEGTVQDDQGRLPEAHPQLQRRH
AACATCTTGTTATTCCAGAACATCGATGGGAAGGA
UCCAGACGUCAACAUCUUGUUAUUCCAGA
PSLTSPLPQRESYAMEYRPCAAVAGVGG
ACTGTGCAAGATGACCAAGGACGACTTCCAGAGG
ACAUCGAUGGGAAGGAACUGUGCAAGAUG
ERIWPSRRQHLVIPEHRWEGTVQDDQG
CTCACCCCCAGCTACAACGCCGACATCCTTCTCTCA
ACCAAGGACGACUUCCAGAGGCUCACCCCC
RLPEAHPQLQRRHPSLTSPLPQR
CATCTCCACTACCTCAGAGAGAATCCTACGCTATG
AGCUACAACGCCGACAUCCUUCUCUCACAU
GAGTACAGACCATGTGCGGCAGTGGCTGGAGTG
CUCCACUACCUCAGAGAGAAUCCUACGCUA
GGCGGTGAAAGAATATGGCCTTCCAGACGTCAAC
UGGAGUACAGACCAUGUGCGGCAGUGGCU
ATCTTGTTATTCCAGAACATCGATGGGAAGGAACT
GGAGUGGGCGGUGAAAGAAUAUGGCCUU
GTGCAAGATGACCAAGGACGACTTCCAGAGGCTC
CCAGACGUCAACAUCUUGUUAUUCCAGAA
ACCCCCAGCTACAACGCCGACATCCTTCTCTCACAT
CAUCGAUGGGAAGGAACUGUGCAAGAUGA
CTCCACTACCTCAGAGA
CCAAGGACGACUUCCAGAGGCUCACCCCCA
GCUACAACGCCGACAUCCUUCUCUCACAUC
UCCACUACCUCAGAGA
47
297/596
GTGCGGCAGTGGCTGGAGTGGGCGGTGAAAGAA
GUGCGGCAGUGGCUGGAGUGGGCGGUGA
VRQWLEWAVKEYGLPDVNILLFQNIDGK
(BSS + RCA)
TATGGCCTTCCAGACGTCAACATCTTGTTATTCCA
AAGAAUAUGGCCUUCCAGACGUCAACAUC
ELCKMTKDDFQRLTPSYNADILLSHLHYL
GAACATCGATGGGAAGGAACTGTGCAAGATGACC
UUGUUAUUCCAGAACAUCGAUGGGAAGGA
RENPTLWSTDHVRQWLEWAVKEYGLPD
AAGGACGACTTCCAGAGGCTCACCCCCAGCTACA
ACUGUGCAAGAUGACCAAGGACGACUUCC
VNILLFQNIDGKELCKMTKDDFQRLTPSY
ACGCCGACATCCTTCTCTCACATCTCCACTACCTCA
AGAGGCUCACCCCCAGCUACAACGCCGACA
NADILLSHLHYLRE
GAGAGAATCCTACGCTATGGAGTACAGACCATGT
UCCUUCUCUCACAUCUCCACUACCUCAGAG
GCGGCAGTGGCTGGAGTGGGCGGTGAAAGAATA
AGAAUCCUACGCUAUGGAGUACAGACCAU
TGGCCTTCCAGACGTCAACATCTTGTTATTCCAGA
GUGCGGCAGUGGCUGGAGUGGGCGGUGA
ACATCGATGGGAAGGAACTGTGCAAGATGACCAA
AAGAAUAUGGCCUUCCAGACGUCAACAUC
GGACGACTTCCAGAGGCTCACCCCCAGCTACAAC
UUGUUAUUCCAGAACAUCGAUGGGAAGGA
GCCGACATCCTTCTCTCACATCTCCACTACCTCAGA
ACUGUGCAAGAUGACCAAGGACGACUUCC
GAG
AGAGGCUCACCCCCAGCUACAACGCCGACA
UCCUUCUCUCACAUCUCCACUACCUCAGAG
AG
48
298/597
TTGACTTCAGATGATGTTGATAAAGCCTTACAAAA
UUGACUUCAGAUGAUGUUGAUAAAGCCU
LTSDDVDKALQNSPRLMHARNTAPLPHL
(BSS + RCA)
CTCTCCACGGTTAATGCATGCTAGAAACACAGCTC
UACAAAACUCUCCACGGUUAAUGCAUGCU
TSDDVDKALQNSPRLMHARNT
CTCTTCCACATTTGACTTCAGATGATGTTGATAAA
AGAAACACAGCUCCUCUUCCACAUUUGAC
GCCTTACAAAACTCTCCACGGTTAATGCATGCTAG
UUCAGAUGAUGUUGAUAAAGCCUUACAAA
AAACACA
ACUCUCCACGGUUAAUGCAUGCUAGAAAC
ACA
49
299/598
GTGCAGCTTTTATTTTCCCAAATACTTCAGTATATC
GUGCAGCUUUUAUUUUCCCAAAUACUUCA
VQLLFSQILQYILKLRKELQLGQGVQLLFS
(BSS + RCA)
CTGAAGCTACGCAAAGAATTACAACTAGGCCAGG
GUAUAUCCUGAAGCUACGCAAAGAAUUAC
QILQYILKLRKELQLGQ
GGGTGCAGCTTTTATTTTCCCAAATACTTCAGTATA
AACUAGGCCAGGGGGUGCAGCUUUUAUU
TCCTGAAGCTACGCAAAGAATTACAACTAGGCCA
UUCCCAAAUACUUCAGUAUAUCCUGAAGC
G
UACGCAAAGAAUUACAACUAGGCCAG
REFERENCES
1. http://www.who.int/cancer/en/
2. Ott P A, et al., An immunogenic personal neoantigen vaccine for patients with melanoma. Nature 2017; 547(7662):217-221.
3. Sahin U, et al., Personalized RNA mutanome vaccines mobilize poly-specific therapeutic immunity against cancer. Nature 2017; 547(7662):222-226.
4. Guarnerio J, et al., Oncogenic Role of Fusion-circRNAs Derived from Cancer-Associated
Chromosomal Translocations. Cell 2016; 165(2):289-302.
5. Capel B, et al., Circular transcripts of the testis-determining gene Sry in adult mouse testis. Cell 1993; 73(5):1019-1030.
6. Salzman J, et al., Circular RNAs are the predominant transcript isoform from hundreds of human genes in diverse cell types. PLoS One 2012; 7(2):e30733.
7. Abe N, et al., Rolling Circle Translation of Circular RNA in Living Human Cells. Sci Rep 2015; 5:16435.
8. Legnini I, et al., Circ-ZNF609 Is a Circular RNA that Can Be Translated and Functions in Myogenesis. Mol Cell 2017; 66(1):22-37 e29.
9. Rowe R C et al., Handbook of Pharmaceutical Excipients. Pharmaceutical Press, 2012
10. Wen E P et al., Vaccine Development and Manufacturing, John Wiley & Sons, 17 Nov. 2014
11. http://en.wikipedia.org/wiki/Peptide_synthesis
12. http://en.wikipedia.org/wiki/Expression_vector
13. Creighton T E, Proteins—Structures and Molecular properties—Second edition. WH Freeman and Company.Source: ipg260217.zip (2026-02-17)