Gene YpAngola_A2097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A2097 
Symbolirp1 
ID5800567 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp2173181 
End bp2182672 
Gene Length9492 bp 
Protein Length3163 aa 
Translation table11 
GC content61% 
IMG OID641340009 
Productyersiniabactin synthetase, HMWP1 component 
Protein accessionYP_001606555 
Protein GI162419334 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3321] Polyketide synthase modules and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATAACT TGCGCTTCTC TTCTGCGCCG ACAGCAGATT CCATTGATGC ATCGATCGCT 
CAACACTACC CGGACTGCGA ACCTGTCGCG GTTATCGGCT ACGCCTGCCA TTTTCCTGAA
TCGCCGGATG GCGAAACGTT CTGGCAAAAT CTGCTGGAAG GTCGTGAATG CAGCCGACGC
TTTACGCGCG AAGAGCTTCT GGCCGTCGGT CTGGATGCCG CCATCATTGA CGATCCTCAT
TATGTCAATA TCGGTACGGT GTTAGACAAC GCCGACTGCT TCGACGCCAC CCTGTTTGGC
TATTCGCGAC AGGAAGCGGA GTCGATGGAC CCGCAGCAGC GCCTATTTTT GCAGGCGGTC
TGGCATGCGC TGGAACATGC CGGTTATGCC CCCGGCGCCG TCCCCCATAA GACCGGCGTT
TTCGCCTCTT CCCGGATGAG TACCTACCCC GGTCGCGAAG CATTGAACGT GACAGAAGTC
GCGCAGGTAA AAGGTCTGCA ATCTCTGATG GGCAATGATA AAGACTATAT TGCCACCCGC
GCCGCGTACA AACTCAACCT GCACGGCCCG GCGTTATCGG TACAGACCGC CTGCTCCAGC
TCGCTGGTTG CCGTGCATCT GGCCTGTGAA AGCCTGCGCG CAGGCGAATC CGATATGGCG
GTTGCCGGCG GCGTGGCGCT CTCTTTCCCC CAGCAGGCAG GCTACCGCTA CCAGCCCGGA
ATGATTTTCT CTCCTGATGG TCACTGTCGT CCCTTTGACG CCTCGGCTGA GGGCACCTGG
GCCGGTAACG GTCTCGGCTG CGTGGTGCTG CGTCGCCTGA GAGACGCGCT GCTGTCAGGC
GATCCGATTA TCTCGGTGAT CCTCTCCAGC GCGGTCAACA ACGACGGCAA CAGAAAGGTC
GGCTATACCG CCCCTTCCGT CGCAGGGCAA CAGGCAGTCA TCGAAGAGGC GTTAATGCTG
GCGGCCATCG ACGACAGGCA GGTAGGTTAC ATTGAAACCC ACGGCACCGG CACACCGCTG
GGCGACGCGA TTGAAATTGA AGCGTTACGC AACGTCTATG CGCCTCGCCC GCAGGATCAG
CGCTGTGCGC TCGGTTCCGT GAAAAGTAAC ATGGGCCATC TGGATACCGC GGCGGGCATT
GCCGGACTGC TGAAAACCGT TCTGGCAGTC AGTCGCGGGC AAATTCCTCC CTTACTGAAT
TTTCACACCC CCAACCCGGC GCTGAAACTT GAAGAGAGCC CCTTTACCAT ACCGGTGTCG
GCACAGGCAT GGCAGGACGA AATGCGCTAT GCTGGCGTCT CCTCCTTTGG TATTGGCGGC
ACCAACTGCC ATATGATCGT CGCCTCGCTG CCCGACGCGC TCAACGCGCG CCTCCCCAAT
ACGGATAGCG GCAGAAAAAG TACCGCGCTG CTGCTCAGCG CCGCCAGCGA CAGCGCGTTG
CGGCGGCTGG CGACGGATTA TGCCGGGGCG CTGAGAGAGA ATGCGGATGC CAGCTCTCTG
GCCTTCACAG CCCTGCACGC GCGCCGTCTC GATCTCCCCT TCCGCCTGGC GGCGCCATTA
AACCGTGAAA CCGCCGAGGC GCTCAGCGCC TGGGCCGGTG AGAAATCGGG GGCGCTGGTT
TACAGCGGCC ACGGCGCCAG CGGCAAGCAG GTGTGGCTGT TTACCGGCCA GGGCTCGCAC
TGGCGCACTA TGGGTCAAAC GATGTACCAG CACTCAACGG CGTTTGCCGA CACGCTGGAT
CGCTGTTTTT CCGCCTGTAG CGAAATGCTC ACGCCGTCAC TGCGCGAAGC GATGTTTAAC
CCCGATTCGG CGCAGCTGGA CAATATGGCC TGGGCGCAGC CGGCGATTGT CGCGTTTGAA
ATCGCGATGG CGGCGCACTG GCGTGCTGAA GGACTGAAGC CAGACTTCGC CATTGGGCAT
TCCGTCGGTG AATTTGCCGC TGCCGTTGTC TGCGGACACT ATACGATTGA ACAGGTCATG
CCACTGGTTT GTCGGCGCGG CGCGCTAATG CAGCAGTGCG CAAGCGGCGC GATGGTAGCG
GTATTTGCAG ACGAAGACAC GCTGATGCCG CTGGCTCGCC AGTTTGAGCT GGATCTCGCC
GCCAACAACG GTACGCAACA TACGGTATTT TCCGGGCCGG AAGCCCGTCT CGCGGTATTT
TGCGCCACGC TCTCGCAGCA TGACATTAAC TATCGTCGCC TGAGCGTAAC CGGTGCGGCG
CACTCCGCTT TACTGGAGCC GATACTCGAT CGGTTCCAGG ACGCCTGCGC GGGACTGCAC
GCGGAGCCGG GGCAAATACC GATTATTTCC ACGCTCACCG CCGACGTCAT TGATGAGTCA
ACGCTCAACC AGGCGGATTA CTGGCGCCGA CACATGCGCC AGCCGGTGCG TTTTATCCAG
AGTATTCAGG TGGCGCATCA GCTCGGCGCC CGCGTTTTTC TGGAGATGGG GCCCGATGCC
CAGTTGGTTG CTTGCGGGCA GCGCGAATAC CGCGATAACG CATACTGGAT AGCCAGCGCC
CGGCGTAACA AAGAGGCGAG CGATGTCCTC AATCAGGCCC TGCTCCAGCT TTACGCTGCC
GGCGTCGCCC TACCGTGGGC CGACCTGCTG GCGGGCGATG GACAACGTAT CGCTGCGCCA
TGTTATCCGT TTGATACTGA GCGTTACTGG AAAGAGCGCG TCTCCCCGGC CTGCGAGCCT
GCCGACGCAG CGCTGTCTGC CGGGCTGGAG GTGGCGAGTC GCGCCGCGAC AGCGCTCGAT
CTCCCTCGCC TGGAAGCGCT TAAACAGTGC GCCACGCGAC TGCACGCCAT CTACGTCGAT
CAACTGGTAC AACGCTGTAC CGGCGATGCC ATTGAGAACG GCGTGGACGC CATGACCATC
ATGCGCCGTG GACGTCTGCT GCCCCGCTAC CAGCAGCTAC TCCAGCGCCT GCTGAATAAC
TGCGTGGTCG ACGGCGATTA CCGCTGCACC GACGGGCGAT ACGTCCGCGC CCGCCCCATT
GAACATCAAC AGCGGGAATC ACTGCTGACG GAACTTGCCG GTTATTGTGA AGGTTTTCAG
GCTATTCCCG ACACCATCGC CCGTGCCGGC GATCGGTTAT ATGAAATGAT GAGCGGCGCG
GAAGAACCGG TGGCGATTAT CTTCCCGCAA AGCGCCTCCG ACGGCGTGGA AGTGCTGTAT
CAGGAATTCA GCTTTGGCCG CTATTTCAAC CAAATCGCCG CCGGGGTATT ACGCGGCATT
GTCCAGACGC GTCAGCCCCG CCAGCCGTTG CGTATTCTTG AAGTTGGCGG CGGAACCGGC
GGCACCACCG CGTGGCTGCT GCCGGAACTC AACGGCGTTC CGGCACTGGA GTACCATTTC
ACCGATATCT CGGCGCTGTT CACCCGTCGC GCCCAGCAGA AATTCGCCGA CTATGATTTT
GTGAAGTATA GCGAGCTGGA TCTCGAAAAA GAGGCGCAGT CTCAGGGTTT CCAGGCACAG
TCTTACGATC TTATCGTGGC AGCGAACGTG ATTCACGCCA CCCGCCATAT TGGCCGCACG
CTCGATAATC TGCGCCCCCT GCTCAAGCCG GGCGGGCGCC TGCTGATGCG CGAAATCACC
CAGCCAATGC GTCTGTTTGA CTTCGTTTTC GGCCCGCTGG TTCTTCCGCT ACAGGATCTC
GACGCCCGCG AAGGTGAGTT ATTCCTCACC ACCGCTCAGT GGCAACAACA GTGCCGCCAC
GCCGGATTCA GCAAAGTGGC GTGGCTACCG CAGGATGGCA GCCCGACCGC CGGGATGAGC
GAACATATCA TTCTCGCCAC GCTGCCCGGT CAGGCGGTTA GCGCCGTAAC ATTCACCGCG
CCATCAGAAC CCGTGTTGGG GCAGGCGCTG ACGGATAACG GTGATTATCT CGCCGACTGG
TCTGATTGCG CAGGTCAGCC CGAACGGTTT AACGCCCGCT GGCAGGAGGC CTGGCGTCTG
CTTTCACAGC GTCATGGCGA CGCTCTGCCT GTGGAACCGC CCCCCGTCGC CGCCCCGGAG
TGGCTGGGGA AGGTTCGCTT AAGCTGGCAA AACGAAGCCT TTTCCCGCGG TCAGATGCGC
GTTGAAGCCC GTCATCCTAC TGGCGAGTGG CTGCCGCTAT CGCCCGCCGC GCCTCTTCCT
GCGCCGCAGA CGCATTATCA ATGGCGCTGG ACGCCCCTCA ACGTCGCCAG CATTGACCAT
CCGCTTACCT TTAGCTTCAG CGCCGGTACG CTTGCGCGCA GCGACGAGCT GGCGCAATAC
GGCATCATTC ACGATCCGCA CGCCTCTTCG CGACTGATGA TTGTTGAGGA GAGCGAGGAT
ACGCTGGCCT TAGCGGAGAA AGTGATAGCA GCGCTCACCG CCAGCGCAGC CGGATTGATT
GTGGTTACTC GCCGCGCGTG GCGAGTCGAG GAAAATGAAG CACTCTCTGC ATCCCATCAC
GCGCTATGGG CCTTGCTTCG CGTCGCGGCC AACGAACAGC CGGAACGGTT GCTTGCCGCC
ATCGATCTCG CCGAAAACAC CCCGTGGGAA ACGCTGCATC AAGGGTTGAG CGCAGTCTCA
CTATCACAGC GCTGGCTCGC CGCACGGGGT GACACCCTTT GGCTCCCTTC ACTGGCGCCC
AATACGGGAT GCGCCGCTGA ATTACCGGCA AACGTGTTTA CCGGCGATAG CCGCTGGCAT
CTGGTGACCG GAGCGTTTGG CGGATTAGGC CGCCTTGCCG TGAACTGGCT CAGAGAAAAA
GGGGCGCGAC GCATCGCCCT GCTGGCGCCG CGCGTGGATG AGTCATGGCT ACGCGACGTG
GAGGGCGGGC AGACGCGCGT CTGCCGTTGT GATGTGGGCG ATGCCGGGCA ACTGGCCACG
GTTCTTGACG ATCTGGCGGC CAACGGCGGC ATTGCCGGAG CGATTCATGC CGCTGGCGTC
TTGGCTGACG CGCCCTTGCA GGAGCTTGAT GACCACCAGC TGGCTGCCGT TTTCGCGGTA
AAAGCGCAGG CGGCAAGCCA GCTGTTGCAA ACCCTGCGCA ACCACGACGG ACGCTATCTT
ATTCTCTACT CTTCCGCTGC CGCCACCCTC GGCGCGCCGG GTCAGAGCGC CCATGCGCTG
GCCTGCGGCT ACCTGGACGG GCTGGCCCAG CAGTTTTCCA CCCTTGATGC GCCGAAAACG
CTCTCTGTCG CCTGGGGCGC ATGGGGAGAA AGCGGTCGGG CGGCCACGCC GGAAATGCTG
GCGACGCTCG CCAGCCGAGG TATGGGCGCG TTAAGCGATG CCGAAGGCTG CTGGCACCTG
GAACAGGCGG TGATGCGCGG CGCCCCGTGG CGACTGGCGA TGCGCGTTTT TACCGACAAA
ATGCCCCCGT TACAACAGGC TCTGTTTAAC ATCAGCGCCA CAGAAAAAGC CGCAACGCCG
GTCATTCCTC CTGCTGATGA CAACGCCTTT AACGGCAGCC TGAGCGATGA AACAGCGGTG
ATGGCATGGC TGAAAAAGCG GATTGCGGTT CAGCTAAGGC TGAGCGATCC GGCGTCACTG
CATCCAAACC AGGATCTGTT GCAACTCGGC ATGGACTCGC TGCTCTTCCT TGAACTCAGT
AGCGATATTC AGCACTACCT GGGCGTACGC ATCAATGCGG AACGGGCGTG GCAGGATCTG
TCTCCTCATG GACTCACGCA GCTTATCTGT TCTAAGCCAG AGGCGACGCC TGCCGCTTCG
CAGCCGGAAG TGTTGCGGCA CGACGCCGAC GAGCGTTATG CGCCCTTCCC TTTGACGCCC
ATTCAGCACG CCTACTGGCT GGGGCGAACC CACCTCATTG GCTATGGCGG CGTCGCCTGT
CACGTCCTGT TTGAGTGGGA TAAACGCCAC GATGAGTTCG ATCTCGCCAT ACTGGAGAAA
GCATGGAACC AGCTCATCGC ACGCCACGAT ATGTTGCGTA TGGTGGTTGA TGCCGACGGG
CAGCAGCGAA TCCTGGCGAC AACGCCGGAG TATCACATCC CGCGTGACGA TCTGCGCGCG
CTTTCCCCGG AAGAACAGCG CATCGCGCTG GAAAAACGGC GGCATGAACT GAGCTATCGC
GTTTTGCCTG CCGACCAGTG GCCTCTTTTT GAGCTGGTGG TCAGCGAAAT CGACGATTGC
CATTACCGTC TGCATATGAA CCTCGACCTT TTGCAGTTTG ATGTGCAGAG TTTTAAAGTC
ATGATGGACG ACCTGGCGCA GGTCTGGCGC GGTGAAACGC TGGCGCCGCT CGCTATTACC
TTCCGTGATT ATGTGATGGC TGAACAGGCG CGCCGACAGA CATCGGCATG GCACGATGCC
TGGGATTACT GGCAGGAAAA ACTGCCGCAA CTGCCCTTAG CGCCAGAGCT GCCGGTGGTT
GAGACGCCCC CGGAAACGCC ACACTTCACC ACCTTCAAAT CGACGATCGG CAAGACAGAA
TGGCAGGCCG TGAAACAGCG CTGGCAGCAG CAAGGCGTCA CACCGTCTGC CGCGCTGCTC
ACGCTGTTTG CCGCCACCCT TGAGCGCTGG AGCCGTACCA CAACATTTAC GCTGAACCTG
ACGTTCTTCA ATCGCCAGCC GATCCATCCG CAAATCAACC AGTTGATTGG TGATTTTACC
TCCGTCACGC TGGTTGATTT TAACTTCTCA GCGCCGGTGA CGTTGCAAGA GCAGATGCAA
CAGACCCAAC AGCGCCTCTG GCAAAACATG GCGCACAGTG AAATGAACGG TGTTGAGGTG
ATCCGTGAGC TGGGCCGCCT GCGCGGATCA CAACGTCAAC CGCTGATGCC GGTAGTGTTT
ACCAGTATGC TGGGGATGAC GCTGGAAGGC ATGACTATCG ATCAGGCGAT GAGCCATCTG
TTCGGCGAAC CCTGCTATGT ATTCACGCAA ACGCCGCAGG TCTGGCTGGA TCATCAGGTC
ATGGAGAGCG ACGGCGAGTT GATGTTTAGC TGGTACTGCA TGGACAACGT GCTGGAACCC
GGCGCTGCCG AGGCGATGTT TAATGACTAT TGCGCCATCC TGCAAGCCGT CATCGCCGCC
CCTGAAAGCC TGAAGACTCT CGCCAGCGGC ATCGCCGGGC ACATTCCCCG CCGACGCTGG
CCGCTGAACG CGCAGGCGGA CTACGACCTG CGGGATATTG AGCAGGCGAC GCTCGAATAC
CCCGGCATCC GGCAGGCCAG AGCGGAAATA ACCGAACAGG GCGCGTTGAC GCTGGATATC
GTGATGGCCG ACGATCCGTC GCCATCAGCG GCGATGCCTG ATGAGCACGA ACTTACCCAA
CTGGCGCTGC CGTTGCCTGA GCAGGCGCAG CTTGATGAGC TGGAGGCGAC CTGGCGCTGG
CTGGAGGCGC GTGCGCTACA GGGGATCGCG GCTACGCTAA ATCGTCACGG CCTGTTTACC
ACGCCGGAGA TCGCCCATCG CTTTAGCGCA ATAGTACAGG CGCTGTCCGC GCAAGCGTCT
CACCAGCGTC TGCTGCGCCA GTGGCTACAG TGTCTGACGG AAAGAGAGTG GTTAATCCGC
GAAGGTGAAA GCTGGCGCTG CCGCATTCCG CTCAGCGAGA TTCCTGAGCC TCAGGAAGCG
TGCCCGCAAA GCCAATGGAG CCAGGCGCTG GCGCAGTATC TGGAAACCTG CATCGCCCGG
CACGACGCCC TCTTCTCCGG GCAGTGTTCT CCGCTGGAAT TGCTGTTCAA CGAGCAGCAT
CGCGTTACCG ACGCGCTGTA TCGCGACAAC CCCGCCAGCG CCTGTCTGAA TCGCTATACC
GCGCAGATTG CCGCCTTGTG CAGCGCAGAA CGGATTCTGG AGGTTGGCGC CGGAACCGCA
GCCACTACCG CGCCGGTGCT GAAGGCCACG CGGAACACGC GGCAGTCGTA CCACTTCACG
GACGTCTCCG CGCAGTTCCT CAATGACGCC AGAGCCCGTT TCCATGATGA ATCGCAGGTG
TCTTATGCCT TGTTCGACAT CAACCAGCCG CTGGATTTCA CCGCCCACCC GGAGGCGGGT
TACGACCTGA TCGTTGCCGT CAATGTGCTC CACGACGCCA GCCATGTCGT CCAGACGTTG
CGCAGATTAA AACTGTTGCT GAAAGCCGGC GGACGTTTGC TGATCGTTGA AGCGACGGAG
CGAAACAGCG TATTCCAGCT GGCGAGCGTG GGCTTTATTG AGGGATTAAG CGGATACCGC
GATTTCCGCC GCCGGGATGA GAAACCGATG CTCACCCGCT CCGCATGGCA GGAGGTTCTC
GTTCAGGCCG GGTTTGCAAA CGAGCTGGCG TGGCCCGCGC AGGAATCGTC GCCGCTGCGC
CAGCATCTGC TGGTGGCGCG TTCGCCTGGC GTAAATCGCC CGGATAAAAA AGCCGTGAGC
CGCTATTTAC AGCAGCGCTT TGGCACCGGT CTGCCCATTT TACAGATCCG GCAAAGAGAA
GCGTTATTTA CGCCGCTGCA TGCCCCGTCT GATGCGCCGA CTGAGCCAGC CAAACCCACG
CCAGTTGCCG GGGGGAATCC GGCGCTGGAA AAACAGGTGG CTGAACTCTG GCAATCGCTG
CTGTCTCGCC CCGTGGCAAG GCATCACGAC TTTTTCGAAC TGGGCGGCGA CAGCCTGATG
GCGACAAGGA TGGTCGCGCA GCTAAACCGG AGAGGGATTG CCAGGGCTAA CCTTCAGGAT
CTGTTCAGCC ATTCGACGCT GAGCGACTTC TGCGCCCATC TACAGGCGGC TACGTCAGGA
GAGGACAACC CGATACCCCT TTGCCAGGGC GACGGTGAGG AAACCCTGTT TGTCTTCCAC
GCTTCAGACG GCGATATCAG CGCCTGGCTG CCGCTCGCTA GCGCGTTGAA CAGGCGCGTT
TTCGGCCTGC AAGCAAAATC GCCGCAGCGC TTTGCCACGC TCGACCAGAT GATCGATGAG
TATGTCGGGT GCATCCGTCG TCAGCAGCCT CACGGCCCTT ATGTGCTGGC GGGTTGGTCG
TATGGCGCGT TTCTTGCGGC GGGCGCCGCA CAGCGCCTGT ACGCCAAAGG CGAGCAGGTT
CGGATGGTGT TAATCGATCC CGTGTGCCGA CAGGATTTCT GTTGCGAAAA CCGGGCGGCC
CTGCTGCGCC TGTTAGCCGA AGGACAAACG CCTCTGGCAC TGCCCGAACA TTTCGACCAG
CAGACGCCCG ACAGCCAGCT TGCCGACTTT ATCAGCCTCG CTAAAACGGC CGGTATGGTG
TCGCAAAACC TGACGCTGCA AGCGGCAGAA ACGTGGCTCG ACAACATCGC GCATCTGCTG
CGTTTACTGA CTGAGCATAC GCCGGGCGAA AGCGTTCCGG TCCCCTGTCT CATGGTGTAT
GCCGCCGGGA GACCCGCGCG CTGGACGCCA GCAGAAACCG AGTGGCAGGG CTGGATAAAC
AACGCCGACG ACGCTGTGAT TGAAGCCAGC CACTGGCAAA TCATGATGGA AGCCCCCCAC
GTTCAGGCTT GTGCGCAACA CATTACGCGC TGGCTTTGCG CAACCTCAAC GCAACCGGAG
AACACGTTAT GA
 
Protein sequence
MDNLRFSSAP TADSIDASIA QHYPDCEPVA VIGYACHFPE SPDGETFWQN LLEGRECSRR 
FTREELLAVG LDAAIIDDPH YVNIGTVLDN ADCFDATLFG YSRQEAESMD PQQRLFLQAV
WHALEHAGYA PGAVPHKTGV FASSRMSTYP GREALNVTEV AQVKGLQSLM GNDKDYIATR
AAYKLNLHGP ALSVQTACSS SLVAVHLACE SLRAGESDMA VAGGVALSFP QQAGYRYQPG
MIFSPDGHCR PFDASAEGTW AGNGLGCVVL RRLRDALLSG DPIISVILSS AVNNDGNRKV
GYTAPSVAGQ QAVIEEALML AAIDDRQVGY IETHGTGTPL GDAIEIEALR NVYAPRPQDQ
RCALGSVKSN MGHLDTAAGI AGLLKTVLAV SRGQIPPLLN FHTPNPALKL EESPFTIPVS
AQAWQDEMRY AGVSSFGIGG TNCHMIVASL PDALNARLPN TDSGRKSTAL LLSAASDSAL
RRLATDYAGA LRENADASSL AFTALHARRL DLPFRLAAPL NRETAEALSA WAGEKSGALV
YSGHGASGKQ VWLFTGQGSH WRTMGQTMYQ HSTAFADTLD RCFSACSEML TPSLREAMFN
PDSAQLDNMA WAQPAIVAFE IAMAAHWRAE GLKPDFAIGH SVGEFAAAVV CGHYTIEQVM
PLVCRRGALM QQCASGAMVA VFADEDTLMP LARQFELDLA ANNGTQHTVF SGPEARLAVF
CATLSQHDIN YRRLSVTGAA HSALLEPILD RFQDACAGLH AEPGQIPIIS TLTADVIDES
TLNQADYWRR HMRQPVRFIQ SIQVAHQLGA RVFLEMGPDA QLVACGQREY RDNAYWIASA
RRNKEASDVL NQALLQLYAA GVALPWADLL AGDGQRIAAP CYPFDTERYW KERVSPACEP
ADAALSAGLE VASRAATALD LPRLEALKQC ATRLHAIYVD QLVQRCTGDA IENGVDAMTI
MRRGRLLPRY QQLLQRLLNN CVVDGDYRCT DGRYVRARPI EHQQRESLLT ELAGYCEGFQ
AIPDTIARAG DRLYEMMSGA EEPVAIIFPQ SASDGVEVLY QEFSFGRYFN QIAAGVLRGI
VQTRQPRQPL RILEVGGGTG GTTAWLLPEL NGVPALEYHF TDISALFTRR AQQKFADYDF
VKYSELDLEK EAQSQGFQAQ SYDLIVAANV IHATRHIGRT LDNLRPLLKP GGRLLMREIT
QPMRLFDFVF GPLVLPLQDL DAREGELFLT TAQWQQQCRH AGFSKVAWLP QDGSPTAGMS
EHIILATLPG QAVSAVTFTA PSEPVLGQAL TDNGDYLADW SDCAGQPERF NARWQEAWRL
LSQRHGDALP VEPPPVAAPE WLGKVRLSWQ NEAFSRGQMR VEARHPTGEW LPLSPAAPLP
APQTHYQWRW TPLNVASIDH PLTFSFSAGT LARSDELAQY GIIHDPHASS RLMIVEESED
TLALAEKVIA ALTASAAGLI VVTRRAWRVE ENEALSASHH ALWALLRVAA NEQPERLLAA
IDLAENTPWE TLHQGLSAVS LSQRWLAARG DTLWLPSLAP NTGCAAELPA NVFTGDSRWH
LVTGAFGGLG RLAVNWLREK GARRIALLAP RVDESWLRDV EGGQTRVCRC DVGDAGQLAT
VLDDLAANGG IAGAIHAAGV LADAPLQELD DHQLAAVFAV KAQAASQLLQ TLRNHDGRYL
ILYSSAAATL GAPGQSAHAL ACGYLDGLAQ QFSTLDAPKT LSVAWGAWGE SGRAATPEML
ATLASRGMGA LSDAEGCWHL EQAVMRGAPW RLAMRVFTDK MPPLQQALFN ISATEKAATP
VIPPADDNAF NGSLSDETAV MAWLKKRIAV QLRLSDPASL HPNQDLLQLG MDSLLFLELS
SDIQHYLGVR INAERAWQDL SPHGLTQLIC SKPEATPAAS QPEVLRHDAD ERYAPFPLTP
IQHAYWLGRT HLIGYGGVAC HVLFEWDKRH DEFDLAILEK AWNQLIARHD MLRMVVDADG
QQRILATTPE YHIPRDDLRA LSPEEQRIAL EKRRHELSYR VLPADQWPLF ELVVSEIDDC
HYRLHMNLDL LQFDVQSFKV MMDDLAQVWR GETLAPLAIT FRDYVMAEQA RRQTSAWHDA
WDYWQEKLPQ LPLAPELPVV ETPPETPHFT TFKSTIGKTE WQAVKQRWQQ QGVTPSAALL
TLFAATLERW SRTTTFTLNL TFFNRQPIHP QINQLIGDFT SVTLVDFNFS APVTLQEQMQ
QTQQRLWQNM AHSEMNGVEV IRELGRLRGS QRQPLMPVVF TSMLGMTLEG MTIDQAMSHL
FGEPCYVFTQ TPQVWLDHQV MESDGELMFS WYCMDNVLEP GAAEAMFNDY CAILQAVIAA
PESLKTLASG IAGHIPRRRW PLNAQADYDL RDIEQATLEY PGIRQARAEI TEQGALTLDI
VMADDPSPSA AMPDEHELTQ LALPLPEQAQ LDELEATWRW LEARALQGIA ATLNRHGLFT
TPEIAHRFSA IVQALSAQAS HQRLLRQWLQ CLTEREWLIR EGESWRCRIP LSEIPEPQEA
CPQSQWSQAL AQYLETCIAR HDALFSGQCS PLELLFNEQH RVTDALYRDN PASACLNRYT
AQIAALCSAE RILEVGAGTA ATTAPVLKAT RNTRQSYHFT DVSAQFLNDA RARFHDESQV
SYALFDINQP LDFTAHPEAG YDLIVAVNVL HDASHVVQTL RRLKLLLKAG GRLLIVEATE
RNSVFQLASV GFIEGLSGYR DFRRRDEKPM LTRSAWQEVL VQAGFANELA WPAQESSPLR
QHLLVARSPG VNRPDKKAVS RYLQQRFGTG LPILQIRQRE ALFTPLHAPS DAPTEPAKPT
PVAGGNPALE KQVAELWQSL LSRPVARHHD FFELGGDSLM ATRMVAQLNR RGIARANLQD
LFSHSTLSDF CAHLQAATSG EDNPIPLCQG DGEETLFVFH ASDGDISAWL PLASALNRRV
FGLQAKSPQR FATLDQMIDE YVGCIRRQQP HGPYVLAGWS YGAFLAAGAA QRLYAKGEQV
RMVLIDPVCR QDFCCENRAA LLRLLAEGQT PLALPEHFDQ QTPDSQLADF ISLAKTAGMV
SQNLTLQAAE TWLDNIAHLL RLLTEHTPGE SVPVPCLMVY AAGRPARWTP AETEWQGWIN
NADDAVIEAS HWQIMMEAPH VQACAQHITR WLCATSTQPE NTL