Gene YpAngola_A4116 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagYpAngola_A4116 
Symbol 
ID5802596 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameYersinia pestis Angola 
KingdomBacteria 
Replicon accessionNC_010159 
Strand
Start bp4387412 
End bp4396213 
Gene Length8802 bp 
Protein Length2933 aa 
Translation table11 
GC content51% 
IMG OID641341894 
Producthypothetical protein 
Protein accessionYP_001608399 
Protein GI162418664 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value0.863096 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCCTCCTG CTGCACGTGC GACTGAACCT TATACGCTGG GGCCGGGTGA CTCTATTCAA 
TCGATAGCAA AAAAATATAA TATTACGGTT GATGAACTGA AAAAACTGAA TGCTTATCGT
ACCTTTTCCA AACCTTTCGC ATCACTGACA ACAGGGGATG AGATTGAAGT TCCCCGCAAA
GAGTCATCTT TCTTTAGCAA TAATCCTAAT GAAAATAATA AAAAAGATGT TGATGATTTG
TTAGCCAGAA ATGCTATGGG AGCCGGTAAG TTACTTTCTA ATGACAATAC CTCTGATGCC
GCAAGTAATA TGGCGCGTTC AGCAGTGACA AATGAAATTA ACGCATCTTC TCAGCAGTGG
TTGAACCAAT TTGGTACTGC GCGGGTACAA TTGAATGTAG ATAGCGATTT TAAGCTAGAT
AACAGCGCGC TGGACCTATT GGTACCACTC AAAGACAGTG AAAGTTCACT CCTGTTTACT
CAACTTGGTG TTCGCAATAA AGACAGCCGC AACACAGTAA ATATTGGTGC CGGAATACGT
CAGTACCAAG GTGACTGGAT GTACGGTGCT AATACCTTCT TTGACAATGA TCTTACTGGA
AAAAACCGGC GTGTGGGGGT GGGTGCTGAA GTTGCGACTG ACTACCTTAA ATTCTCCGCT
AACACCTATT TTGGTTTGAC CGGCTGGCAT CAGTCTCGTG ACTTCAGCAG TTACGATGAG
CGTCCGGCTG ATGGCTTCGA TATCCGTACC GAGGCTTATT TACCCGCCTA TCCACAATTA
GGCGGTAAAT TAATGTATGA AAAATACCGT GGTGATGAAG TCGCCTTATT TGGTAAGGAT
GATCGCCAGA AAGACCCACA TGCTGTGACG TTGGGGGTTA ATTACACCCC GGTTCCTTTG
GTCACTATTG GTGCAGAACA CCGTGAGGGG AAAGGTAACA ACAACAATAC CAGCGTTAAT
GTGCAACTGA ATTATCGCAT GGGGCAACCG TGGAATGATC AGATTGATCA GTCAGCGGTG
GCGGCTAACC GGACACTGGC GGGCAGCCGT TATGACCTGG TTGAACGTAA TAATAATATT
GTTCTGGATT ATAAAAAGCA GGAATTAATA CATCTGGTTC TGCCTGACAG AATCAGTGGT
TCGGGTGGCG GTGCCATTAC ATTGACCGCA CAGGTACGTG CAAAATATGG CTTTAGCCGT
ATTGAATGGG ATGCGACACC GTTAGAAAAT GCGGGCGGCA GTACCTCCCC ACTCACTCAG
AGTTCATTGT CGGTTACCTT ACCTTTCTAT CAACATATTC TCAGAACAAG CAATACGCAT
ACAATAAGTG CGGTTGCCTA TGATGCCCAA GGGAACGCCT CAAATCGTGC GGTGACATCT
ATTGAAGTCA CTCGTCCAGA GACCATGGTG ATCAGCCATC TGGCGACAAC GATTGATAAT
GCGACGGCTA ACGGTATTGC GACTAACACG GTACAAGCCA CAGTAACCGA TGGCGACGGC
CAACCGATTA TCGGGCAGCT CATCAACTTT GCGGTTAATA CTCAGGCAAC ATTAAGTACG
ACAGAGGCAA GAACAGGAGC TAATGGTACT GCCAGTACCA CACTGACGCA TACCGTCTCG
GGTGTAAGTA GGGTTAGCGT TACGTTGGGT TCTAGTAGCC GAAGTGTGGA TACGACGTTT
GTGGCTGATG AAAGCACGGC GGAGATCACC GCCGCAAATC TGACAGTGAC AACAAATGAC
TCAGTGGCTA ATGGCAGTGA CACTAACGTT GTTCGGGCGA AGGTTACCGA TGCCTATACT
AATGCTGTTG CTAATCAATC CGTGATATTC AGTGCCAGTA ATGGTGCAAC TGTCATCGAT
CAAACAGTGA TAACCAATGC CGAGGGGATC GCCGACTCCA CGCTGACCAA TACCACCGCA
GGGGTTTCGG TAGTGACTGC AACGTTGGGA GGCCAATCTC AACAGGTTGA TACGACATTT
AAACCTGGGT CGACAGCGGC GATCAGTTTG GTGAAATTGG CTGACCGGGC GGTTGCCGAT
GGCATCGACC AGAATGAAAT CCAAGTCGTG TTACGGGATG GGACAGGCAA TGCCGTGCCA
AATGTGCCGA TGAGTATTCA GGCAGATAAT GGCGCGATAG TGGTTGCTTC AACACCGAAT
ACCGGTGTAG ATGGCACGAT TAATGCCACA TTCACGAACC TTCGGGCAGG AGAATCCGTT
GTTAGCGTGA CGTCTCCTGC ATTGGTGGGT ATGACGATGA CAATGACGTT CTCTGCTGAT
CCGAGGACGG CGGTTGTTTC TACGTTGGCC GCAATTGATA ACAATGCCAA AGCGGATGGA
ACTGACACCA ACGTGGTGCG TGCGTGGGTC GTTGATGCAA ATGGTAATTC AGTACCGGGT
GTTTCTGTAA CATTTGATGC TGGGAATGGT GCTGTTTTGG CACAGAATCC AGTGGTGACA
GACCGTAATG GCTATGCAGA AAATACACTC ACCAACCTGG CTATAGGTAC CACTACAGTC
AAAGCCACGA CGGTAACCGA CCCTGTTGGT CAGACCGTCA ATACCCACTT TGTGGCCGGT
GCAGTAGATA CCATCACCCT GACGGTGCCG GTTAACGGCG CGGTGGCTAA TGGTGTGAAT
ACTAACAGCG TGCAGGCGGT GGTCAGCGAC AGCGGGGGCA ACCCGGTTAC CGGTGCGACG
GTAGTCTTCA GCTCCACCAA TGCCACAGCG CAAGTCACTA CGGTGATCGG CACCACCGGT
GCGGACGGGA TCGCCACGGC GACCCTGACC AATACCGTGG CCGGGACCAG CAATGTGGTC
GCCACCATTG ATACGGTTAA CGCCAATATC GACACCGCCT TTGTGGCGGG TGCAGTTGCG
ACCATCACCC TGACTGCGCC AGTTAATGGC GCGGTGGCGG ATGGTGCAGA CACCAATCAG
GTGGACGCAT TGGTAGAGGA TGCTAACGGC AACCCGATCA CCGGTGCTGC GGTGGTCTTT
AGTTCGGCCA ACGGGGCAAC TATTCTTTCC TCGACCATGA ACACCGGTGT AAATGGAGTG
GCATCAACGC TCCTGACTCA TACCGTGGCC GGGACCAGCA ATGTGGTCGC CACCGTTGAT
ACGGTTAACG CCAATATTGA CACGACCTTT GTGGCCGGTG CGGTCGCGAC CATCACACTG
ACGACGCCGG TTAACGGCGC GGTGGCGGAT GGGGCAAACA GCAACAGCGT GCAGGCAGTG
GTCAGCGACA GCGACGGCAA TCCGGTTACC GGTGCGGCTG TAGTCTTCAG TTCTGCCAAC
GCCACAGCCC AAATTACCAC AGTGATCGGC ACCACCGGTG CGGACGGGAT CGCCACGGCG
ACCCTGACCA ATACCGTGGC CGGGACCAGC AATGTGGTCG CCACCATTGA TACGGTTAAC
GCCAATATCG ACACCGCCTT TGTGGCGGGT GCAGTTGCGA CCATCACCCT GACTGCGCCA
GTTAATGGCG CGGTGGCGGA TGGTGCAGAC ACCAATCAGG TGGACGCATT GGTACAGGAT
GCTAATGGCA ATGCGATCAC CGGTGCCGCC GTGGTCTTTA GTTCAGCCAA TGGAGCAGAT
ATTATTGCCC CGACCATGAA CACCGGTGTA AATGGAGTAG CATCAACACT CTTGACTCAT
ACCGTGGCCG GGACCAGTAA CGTAGTGGCC ACCATTGATA CGATCAGCGC CAATATCGAC
ACTGCCTTTG TGGCGGGTGC AGTTGCGACC ATCACCCTGA CTGCGCCAGT TAATGGCGCG
GTGGCGGATG GTGCAGACAC CAATCAGGTG GACGCATTGG TAGAGGATGC TAACGGCAAC
CCGATCACCG GTGCTGCGGT GGTCTTTAGT TCGGCCAACG GGGCAACTAT TCTTTCCTCG
ACCATGAACA CCGGTGTAAA TGGCGTGGCA TCAACGTTCC TGACCCATAC CGTGGCCGGG
ACCAGCAATG TGGTCGCCAC CATAGGCAGC GTTACTGAGA ATATCGACAC CGCCTTTGTG
GCCGGTGCAG TTGCGACCAT TACGCTGACT GCGCCAGTTA ATGGCGCGGT GGCGGATGGA
GTGAACACTA ACAGCGTGCA GGCGGTGGTC AGCGACAGCG ACGGCAATGC GGTCACCGGT
GCAACCGTAG TCTTTAGCTC TGCCAACGCC ACAGCACAAA TTACCACAGT GATCGGCACC
ACCGGTGCGG ACGGGATCGC CACAGCGACC CTGACCAATA CCGTGGCGGG GACCAGCAAT
GTGGTCGCCA CCATTGATAC GGTTAACGCC AATATCGACA CGACCTTTGT GGCCGGTGAG
CTGGAGAATA TCGTCGTCAG TATTATTAAC AATAATGCAC TGGCAAATGG CGCAGATACC
AATATTGTCG AAGCCTTTGT GACTGACCGT TTCGGTAATG GCGTGGCGAA TCAAAGCCTA
ATATTTGGCA CCAATGGGGC GTCCATTGTG GGTTCATCAA CAGTGACGAC TAATCTCGAT
GGTCGTGTTA GAGCGAGTGC TACGCATACT GTGGCGGGGA GCAGTAATAC GGTGATTGCA
ATAAGTGGCG CTCATCAAGG ATATGCCAGA GTAACCTTTG TTGCCGATGT TTCGACAGCC
CAGCTTAAGC TAACGTCGTT CTTGGATAAC CAGCTTGCGA ATGGTAAAGC CGGTAACATT
GCACAAGCGT TGGTTACCGA TGCTCATGAC AACCTATTAG CTAATCAATC CGTTAGCTTT
GCTCTCGATA ACGGTGCAGT CATTGAGTCT CAGGGCGATG CCAGTAGCGC CTCTGGAATT
GTCTTGATGA GATTCAACAA TACGCTTGCA GGTATGACAA CGGTGACGGC AACGCTCGAT
TCAACCGGGC AAACTGAAAC CCTCGAGACG CATTTTGTGG CGGGAAAAGC GGCATCGATT
GAAATGACGA TGACGAAAGA TAATGCCGTG GCTAACAATA TCGATACCAA CGAAGTCCAG
GTGTTAGTGA CGGATGTAGA CGGTAACGCG ATCAACGGCG CGGTGGTCAA CCTCACTTCT
AACAGTGGCA TGAACATTAC ACCAAACTCG GTAACGACAG GCAGCGATGG TACGGCGACG
GCGACCTTGA CGCATACCCT GGCAGGGAGC CTCCCGATCA ATGCGCGGAT CGATCAGGTG
AGTAAAACGA TTAATGCCAC CTTTATCGCT GATGCTTCGA CTGCGCAGAT TATTGCGGGT
GACATGTTCA TCATCGTTAA CGATCAAGTT GCTAATGGGC AGGCGGTTAA CGCGGTTCAG
GCAAGAGTCA CTGATAGCTA TGGTAACCCT ATTAAGGATC AAACGGTTGA ATTCGTGCTG
AGTAATAATG GCACCATTCA ATATGAGCTC GATGTGACAT CAGTTGAAGG TGGCGTTATG
GTGACATTCA CTAATACCCT GGCGGGTATT ACCAATGTGA CCGCGACCGT GGTATCCAGT
GGCAGCAGCC GCAATATTGA TACTACCTTT ATTGCCGATG TGACGACGGC ACACATTGCT
GCAAGTGATT TGATGGTTAT TGTTGATGAC GCGGTCGCGG ATAACCTGGA TAAAAATGAG
GTCCATGCAC GGGTCACCGA TGCGAAGGGC AACGTGTTAT CGGGTCAGAC GGTTATCTTC
ACCTCTGGCA ACGGTGCTGC TATCACGACA GTCAATGGTA TCAGTGATGG CGATGGTCTG
ACCAAGGCCA CCTTAACCCA TACCTTGGCG GGTACCAGTG TGGTGACTGC AAGGGTCGGT
AACCGGGTGC AGAGCAAAGA TACGACCTTT ATTGCGGATA GAACCACCGC AACTATTAGG
GCATCAGACC TGACCATTAC CCGGAACAAT GCGCTAGCTG ATGGGGTTGC TACTAATGCC
GCTCGTGTGA TTGTTACTGA TGCCAATGGG AACCCGGTGC CGAGTATGTT TGTGGGTTAT
ACCTCGGATA ATGGCGCACT ACTGACACCA ACATCAGGGA TGACGGATAG CAGTGGGACG
TTTAGCACAA CCTTTACACA TACGACAGCG GGTATCAGTA AGGTGACTGC GGCGATCGTA
ACGATGGGGA TAAGCCAAAC TAAAGACGCC GTCTTTATTG CAGACAGATC CACTGCCCAT
GTGTCGGAGT TGATCGTTGT GAAAAATGAT TCGCTTGCCA ACAATAGCGA TAGAAATATC
GTGCAGGCGC ACATTAAGGA TGCTCATGGC AACGTGGTTA CGGGAATGAA TGTGAACTTT
AGTGCCACGG AGAATGTGAC GTTGACCGCA AACACTGTCA CCACGAATTC TCAAGGGTAT
GCAGAAAATA CCTTAAGGCA TAACGCGCCG GTTACCAGTG CGGTGACTGC AACGGTCGCT
ACTGACTTGG TGGGTCTCAC CGAGGATGTC CGGTTTGTTG CTGGTGCCGG TGCCCGAATT
GAGCTATTCA GGTTGAATGA TGGGGCGGTG GCCGATGGCA TCCAAACTAA CAGAGTTGAA
GCCAGGGTCT ATGATGTCTC TGATAACCTG GTGCCGAATA GTAACGTGGT GTTCAGCGCA
GATAATGGTG GCCAATTAGT GCAGAACGAT GTGCAGACTG ATGCCTTGGG TAGTGCTTAT
GTCACGGTTA GCAATATTAA TACTGGCGTG ACTAAGGTCA CTGTAACTGC AGATGGTGTG
TCGGCCTCAA CCACGACGAC CTTTATCGCC GATAGGGATA CGGCCACATT GGTCACGGAT
CGCTTTTTGA TCACTCATGA TAATGCGGTA GCGAATGGGG TTGTAGAAAA TAGAGTGTTA
TTACACCTTG TGGATGCCAA TGATAATTCG GTTTCTGGGG TCGAAGTTAA CTTTAGTGCC
ACTAATGGTG CGTCAATCAA TGCATCAGCT ATCACTGATA TAAATGGGTT TGCTATCGGT
GTACTGACAA ACACTCTCTC AGGGCCAAGT GACGTTACGG TAACGCTGGT GACGCCAGGG
GGGACTGAGA GCCTGACGGT TACGCCTCAA TTTATTGCCG ATATAAACAC CGCCAATATT
GCCACTGGTG ACTTTGTCAT TATCGATGAT GGCGCCGTGG CCAATAGCGT GGACGCCAAT
GAAGTCCGCG CTAGGGTGAC TGACAATCAG GGGAATGCTA TTGCTGGCTA CAGCGTTGTT
TTCTCATCAC AAAATGGCGC GACCATCACC ACCAGTGGTA TTACTGGCGT CGATGGGTGG
GCTAGCGCGA AGCTGACCCA TATTAAAGCT GGAGAGAGCG GGATCTTAGC GCGACTCTCG
CGGCCTATGG CGACGGTGCA CACGCTGATG CCGTACTTTA TTGCGGATGT GAGCACGGCA
ACGTTACAAC TTTTTAATTT CAACCCTATT CCGATAATTG CCGATGGGGT AATGCAATTC
TTCGTGCTAG GAAGGGTTTT TGATGCCAAC CAGAACCCAG TAGGGGGGCA GCAAGTGGCC
TTCAGTGCAA CAAATGAGGT GACCCTAACT GAGAGTAATG GCTCGATCAG TACTCCGGAA
GGAAGTGTGC TCTTATCTGT CACGAGTACT CAGGCTGGGG TTCACCCTAT TACGGGGACC
TTGGTATCGA ATAACTATAC GGACACGTTT GGTGCCGCAT TTATCGCGAA CAAAAATACC
GCTCAATTGT CCACCTTGAT GGTCGTTGAT AACAATGCAC TGGCAGATGG TGTTACACGT
AACCAGGTCC GGGCGCATGT TGTCGATAGT ACGGGCAATT CGGTGGCCGA TATGGCCGTG
ACATTTACCG CCAACCGTGG TGCGCAACTG AGTAAGGTAA CGGTACTGAC CGATAATAAC
GGGGATGCCG TCAATACGCT GACCAACAGT TTGGTTGGCG TGACGGTTGT GACGGCCAAA
CTGGGTACGG CAGGAACGCC TTTGACTGTT GACACGGTCT TTACTGCCGG GCCGCTGGCG
ACACTGACAC TGGTGACAAC GGTCAATAAT GCCTTTGCGG ATAACAGTGC TACCAATACG
GTACAGGCGA CGCTTAAAGA TGTCAGCGGG AACCCAATCG TTGGGGAAGT GGTCGCCTTT
GCGGCAAGCA ATGGGGCGAC GATCACCGCC ACCGATGGTG GGGTAAGCAA TGCTAACGGT
ATTGTCTTGG CTACCTTAAC TAATGGAACA GCCGGGGTTA GCACGGTTAC GGCGACGATA
GAGACCTTGA CGGAGACAAC AGACACCACC TTTATTGCTA TGAAGAATCT GGATGTGACC
GTGAATGGTA CAACGTTTAA CGGGGATGCC GGGTTCCCAA CCACCGGTTT TGTGGGGGCC
ACCTTCAAGG TCAATTCGGG TGGAGATAAT AGCCTCTATG ACTGGAGCAG CAGTGCCCCA
GCGCTGGTAT CGGTCAGCGG TGATGGTGTG GTGACATTTA ATGCGGTATT CCCGACGGGT
ACACCGACAA TCACTATATC TGCCACCCCG AAAGGCGGCG GTAGCCCACT CTCGTACAGT
TTTAGAGTCA ACCAGTGGTT TATCAATAAT AATGGCGCTA CGTTAAATCG CGCTGATGCA
ATAACGCATT GTGAAAATGT GGGCTATACG ATGCCAACGT CTACGCAGGT CACCAATGCG
GCGACCTGGA TGTCAGGCAA GCGGGCTGTT GGTAACTTGT GGTCAGAATG GGGTGACTTC
AGTGCCTATA CTGCGCCGGG CTGGGTGCCT GCTGAGTTCT TCTGGCTCAG TAATAATCAT
GATGCCAGTA CGGCTCTGGC TATTGGTTTG TCAACGGGTA CGCTGACGAC GATGGGTGAT
TTTATGGCCA TAACTCATGT GATGTGTACC CGCCCAATCT AG
 
Protein sequence
MPPAARATEP YTLGPGDSIQ SIAKKYNITV DELKKLNAYR TFSKPFASLT TGDEIEVPRK 
ESSFFSNNPN ENNKKDVDDL LARNAMGAGK LLSNDNTSDA ASNMARSAVT NEINASSQQW
LNQFGTARVQ LNVDSDFKLD NSALDLLVPL KDSESSLLFT QLGVRNKDSR NTVNIGAGIR
QYQGDWMYGA NTFFDNDLTG KNRRVGVGAE VATDYLKFSA NTYFGLTGWH QSRDFSSYDE
RPADGFDIRT EAYLPAYPQL GGKLMYEKYR GDEVALFGKD DRQKDPHAVT LGVNYTPVPL
VTIGAEHREG KGNNNNTSVN VQLNYRMGQP WNDQIDQSAV AANRTLAGSR YDLVERNNNI
VLDYKKQELI HLVLPDRISG SGGGAITLTA QVRAKYGFSR IEWDATPLEN AGGSTSPLTQ
SSLSVTLPFY QHILRTSNTH TISAVAYDAQ GNASNRAVTS IEVTRPETMV ISHLATTIDN
ATANGIATNT VQATVTDGDG QPIIGQLINF AVNTQATLST TEARTGANGT ASTTLTHTVS
GVSRVSVTLG SSSRSVDTTF VADESTAEIT AANLTVTTND SVANGSDTNV VRAKVTDAYT
NAVANQSVIF SASNGATVID QTVITNAEGI ADSTLTNTTA GVSVVTATLG GQSQQVDTTF
KPGSTAAISL VKLADRAVAD GIDQNEIQVV LRDGTGNAVP NVPMSIQADN GAIVVASTPN
TGVDGTINAT FTNLRAGESV VSVTSPALVG MTMTMTFSAD PRTAVVSTLA AIDNNAKADG
TDTNVVRAWV VDANGNSVPG VSVTFDAGNG AVLAQNPVVT DRNGYAENTL TNLAIGTTTV
KATTVTDPVG QTVNTHFVAG AVDTITLTVP VNGAVANGVN TNSVQAVVSD SGGNPVTGAT
VVFSSTNATA QVTTVIGTTG ADGIATATLT NTVAGTSNVV ATIDTVNANI DTAFVAGAVA
TITLTAPVNG AVADGADTNQ VDALVEDANG NPITGAAVVF SSANGATILS STMNTGVNGV
ASTLLTHTVA GTSNVVATVD TVNANIDTTF VAGAVATITL TTPVNGAVAD GANSNSVQAV
VSDSDGNPVT GAAVVFSSAN ATAQITTVIG TTGADGIATA TLTNTVAGTS NVVATIDTVN
ANIDTAFVAG AVATITLTAP VNGAVADGAD TNQVDALVQD ANGNAITGAA VVFSSANGAD
IIAPTMNTGV NGVASTLLTH TVAGTSNVVA TIDTISANID TAFVAGAVAT ITLTAPVNGA
VADGADTNQV DALVEDANGN PITGAAVVFS SANGATILSS TMNTGVNGVA STFLTHTVAG
TSNVVATIGS VTENIDTAFV AGAVATITLT APVNGAVADG VNTNSVQAVV SDSDGNAVTG
ATVVFSSANA TAQITTVIGT TGADGIATAT LTNTVAGTSN VVATIDTVNA NIDTTFVAGE
LENIVVSIIN NNALANGADT NIVEAFVTDR FGNGVANQSL IFGTNGASIV GSSTVTTNLD
GRVRASATHT VAGSSNTVIA ISGAHQGYAR VTFVADVSTA QLKLTSFLDN QLANGKAGNI
AQALVTDAHD NLLANQSVSF ALDNGAVIES QGDASSASGI VLMRFNNTLA GMTTVTATLD
STGQTETLET HFVAGKAASI EMTMTKDNAV ANNIDTNEVQ VLVTDVDGNA INGAVVNLTS
NSGMNITPNS VTTGSDGTAT ATLTHTLAGS LPINARIDQV SKTINATFIA DASTAQIIAG
DMFIIVNDQV ANGQAVNAVQ ARVTDSYGNP IKDQTVEFVL SNNGTIQYEL DVTSVEGGVM
VTFTNTLAGI TNVTATVVSS GSSRNIDTTF IADVTTAHIA ASDLMVIVDD AVADNLDKNE
VHARVTDAKG NVLSGQTVIF TSGNGAAITT VNGISDGDGL TKATLTHTLA GTSVVTARVG
NRVQSKDTTF IADRTTATIR ASDLTITRNN ALADGVATNA ARVIVTDANG NPVPSMFVGY
TSDNGALLTP TSGMTDSSGT FSTTFTHTTA GISKVTAAIV TMGISQTKDA VFIADRSTAH
VSELIVVKND SLANNSDRNI VQAHIKDAHG NVVTGMNVNF SATENVTLTA NTVTTNSQGY
AENTLRHNAP VTSAVTATVA TDLVGLTEDV RFVAGAGARI ELFRLNDGAV ADGIQTNRVE
ARVYDVSDNL VPNSNVVFSA DNGGQLVQND VQTDALGSAY VTVSNINTGV TKVTVTADGV
SASTTTTFIA DRDTATLVTD RFLITHDNAV ANGVVENRVL LHLVDANDNS VSGVEVNFSA
TNGASINASA ITDINGFAIG VLTNTLSGPS DVTVTLVTPG GTESLTVTPQ FIADINTANI
ATGDFVIIDD GAVANSVDAN EVRARVTDNQ GNAIAGYSVV FSSQNGATIT TSGITGVDGW
ASAKLTHIKA GESGILARLS RPMATVHTLM PYFIADVSTA TLQLFNFNPI PIIADGVMQF
FVLGRVFDAN QNPVGGQQVA FSATNEVTLT ESNGSISTPE GSVLLSVTST QAGVHPITGT
LVSNNYTDTF GAAFIANKNT AQLSTLMVVD NNALADGVTR NQVRAHVVDS TGNSVADMAV
TFTANRGAQL SKVTVLTDNN GDAVNTLTNS LVGVTVVTAK LGTAGTPLTV DTVFTAGPLA
TLTLVTTVNN AFADNSATNT VQATLKDVSG NPIVGEVVAF AASNGATITA TDGGVSNANG
IVLATLTNGT AGVSTVTATI ETLTETTDTT FIAMKNLDVT VNGTTFNGDA GFPTTGFVGA
TFKVNSGGDN SLYDWSSSAP ALVSVSGDGV VTFNAVFPTG TPTITISATP KGGGSPLSYS
FRVNQWFINN NGATLNRADA ITHCENVGYT MPTSTQVTNA ATWMSGKRAV GNLWSEWGDF
SAYTAPGWVP AEFFWLSNNH DASTALAIGL STGTLTTMGD FMAITHVMCT RPI