Gene Mvan_5057 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMvan_5057 
Symbol 
ID4644794 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium vanbaalenii PYR-1 
KingdomBacteria 
Replicon accessionNC_008726 
Strand
Start bp5413374 
End bp5423510 
Gene Length10137 bp 
Protein Length3378 aa 
Translation table11 
GC content64% 
IMG OID639808527 
Producthypothetical protein 
Protein accessionYP_955834 
Protein GI120406005 
COG category 
COG ID 
TIGRFAM ID[TIGR01643] YD repeat (two copies) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGTGCAA CCCCGGCTCG TCATCGCGCC CAGCGGCAGG GCGTGGTTCC TGAAGCCGGC 
TTCTGGGCGG CGAAAGACCT TCTGGAGCTG ACAGCGCAGG GGCGGTCCGG CAAGCACGCC
CGCAAGGGAA CCTCGAAGTA CACCCTTCAC ATCGGCCGCG TTGGCGCCCT GGCGGTGTCG
CTCGGTGTGG GGTTGGCCGT CGCGAACGCT GGAGTTGCAT ACGCAGACAG TGACACGGAC
TCGTCGGTGT CCGAGTCCCA AGCCGACGAC TCCGCAGCAC CGCCGAGCTC GCCGCAGGTC
GACGCGACCG ATCAGACCGG GGTCGACAAT GACGAGGAAC CCGGTACCGC AGACGATGTC
GATGATGAAG AGGTCGGCGA GGACCTTGCT CCGGAGGAAG CCGACCCGTC CGCAAGCGAG
GACACGGACG TCGATTCCGA CGAAGAGCTG CCTGGGTCCT CAGGCGGATC CGGTGAATCA
CCGGGCGGAT CGGACAGCAC GCCCGACGAG CAGCAGGCTG CCGATCCCGA TGTGACAGCG
CCGGACTCCG AGCAGGTCGA TTCCTCACCC GACGAGACGG TGGTCAGCAC GCCCGAGGAC
GACGGGGCGG AGGAAGGCCC TGCCCTCGAT GATGCCCATC CTCAGGGCGA CGTGGCCGTC
GTGGACGTGG TGAACACCAC GACCGAACCG GTGGAACGGG CCTCCGCTAC TCCGTCCGAG
AGCGTCGAGA CCGTCGACAT CGTCACCGCG CTGGTGTCCA CCGTGGTGTC CCCGTTCGCC
AGCCCGGATG CGCCGGCACA GGTGCCGTGG TTCGATGCGT TGCTGGCATG GGTACGTCGC
CAGATCACCC ACACGTTCTT CAACAAGACA CCCGTATACG GTCCCATCAC GGTTGAGCAG
ATCTTCACGG GTCAGTTGTT CATCGATCTG AACGCGACAG ACCCCAATGG TGATCCGCTG
ACCTACGAGA TAGTCCAGCC GAACCACGGG GTCGTGGTCC GAGATGTGAT CACCGGCAAT
TTCATCTACA CACCGACCAC CATCGTCACC GGTGAGCCGT TGCAGGACAC CTTCCAGGTC
GTCATCAGGG ACGACTCGGA ACACCTCACC GGCGCGTTGG GATCCTTCCA GAAGCTCCTG
CACGGTGTCG CAAGGCTTTT CGGCCTTGCG CAGCGAGACA ACATCACCGT CACCATCCCC
GTCACGATCG ACCCCCTCGT TCAGTTGCCG CCCACGGTGG TCACCGCCGG ACTGCCGATC
TTCAAGCTGG GAGACTCGCC GGTCAAGGTG CTGTCCTCTG CGATCATCGC CGATGCGGAC
TCGGATCAGA TCTTTCAAGC CACGATCAGG ATCTCGACCG CAGGACAGGA CGGCGACGTA
TTGAATTACG TTGCACCGGA TGGAAGTCCG ATAACGGCAA CGTGGGATGT GACGACCAAG
ACCTTGACAC TGTCCGGGCT GGCGACAGCC GAACAGTATG AACAGGCGCT GCTCGCCGTA
ACATTTTCCA CGACCAAGGG CGGACTGCCT CGCGGCGTGT CGATCTCCGT CACCGACGAG
ACCGGTGTCC AGAGTCTGGT TCCAGGTGCG GCGATTGTGA CTGTGATCGG GTTGCCGCCC
ACGGTGACGG TGTTCGGGTT GCCGATCTTC AAGCTGGGCG GGTCCCCGGT GAAGGTCGTG
TCGTCGGTGA CGTTGGGTGA TCTGGATTCG GAGAACCTGT CCGAGGCCTC CATTGTGATC
GCGACGGCGT ATCAGGACGG TGACGTGCTC AGTTACGTTG CTCCTAGCGG CAATCCGATC
ACCGCGTCCT GGGATGCCGC GACGCGCACA TTGACGTTGT CGGGTGTGGC GACTCTCGAC
CAGTACGAGG AAGCGATCAA GGCGGTCACG TTCTCGGCGA CTGAGGGTGG TTTGTCGCGC
GGGATTTCCT TCAGTGTCAC CGATGAGGCC GGCGTGCAGA GTCTGGTCCC AGGTGCGGCG
ATCGTGAATG TCATCGGTCT TCCGCCGACG GTCACCGTCT TTGGCATCCC GATCTTCAAA
CTGGGCGGCG CCCCGGTCAA CGTCGTGTCG TCGGTGACCC TCGGAGACCT GGATTCGGAC
AATCTGTCCG AGGCCACGAT CGTGATCGCG ACTGCATATC AGGACGGTGA CGTCCTTTCC
TACACGGCGC CGTCGGGCAG TCCGATCACC GCATCCTGGG ATGCCGCGAC GCGCACATTG
ACGTTGTCGG GTGTGGCGAC TCTCGACCAG TACGAGGAGG CGATCAAGGC GGTCACCTTC
AGTGCGACTG AGGGTGGTCT GTCGCGCGGG ATTTCCTTCA GTGTCACTGA CGATGCCGGC
GTGCAGAGTC TGGTGCCCGG CGCCGCCGTG GTGTCCGTCA TCGGGTTGCC GCCGACGGTG
ACGGTCTTTG GCGTCCCGAT CTTCAAACTG GGCGGCGCCC CGGTCAAGGT CGTGTCGTCG
GTGACGTTGG GTGACCTGGA TTCGGAGAAC CTGTCCGAGG CCTCCATTGT GATCGCGACG
GCGTATCAGG ACGGTGACGT CCTCAGTTAC GTTGCCCCGA GCGGCAATCC GATCACCGCG
TCCTGGGATG CCGCGACTCG CACATTGACG TTGTCGGGTG TGGCGACTCT CGACCAGTAC
GAGGAAGCGA TCAAGGCGGT CACGTTCTCG GCGACTGAGG GTGGTTTGTC GCGCGGGATT
TCCTTCAGTG TCACCGATGA GGCCGGCGTG CAGAGTCTGG TCCCAGGTGC GGCGATCGTG
AATGTCATCG GTCTTCCGCC GACGGTCACC GTCTTTGGCA TCCCGATCTT CAAACTGGGC
GGCGCCCCGG TCAACGTCGT GTCGTCGGTG ACCCTCGGAG ACCTGGATTC GGACAATCTG
TCCGAGGCCA CGATCGTGAT CGCGACTGCA TATCAGGACG GTGACGTCCT CAGTTACGTT
GCCCCGAGCG GCAATCCGAT CACCGCGAGC TGGGATGCGG CCACGCGCAC ATTGACGCTC
TCCGGTGTGG CGTCGTTGGA TCAGTACGAG GAGGCGATCA AGGCGGTCAC GTTCTCGGCG
ACTGAGGGTG GTTTGTCGCG CGGGATTTCC TTCAGTGTCA CCGATGAGGC CGGCGTGCAG
AGTTTGGTGC CCGGCGCCGC CGTGGTGTCC GTCATCGGGT TGCCGCCGAC GGTGACGGTC
TTTGGCGTCC CGATCTTCAA ACTGGGCGGC GCCCCGGTGA AGGTCGTGTC GTCGGTGACG
TTGGGTGATC TGGATTCGGA GAACCTGTCC GAGGCCTCCA TTGTGATCGC GACGGCGTAT
CAGGACGGTG ACGTCCTCAG TTACGTTGCT CCTAGCGGCA ATCCGATCAC CGCGTCCTGG
GATGCCGCGA CGCGCACATT GACGTTGTCG GGTGTGGCGA CTCTCGACCA GTACGAGGAA
GCGATCAAGG CGGTCACGTT CTCGGCGACT GAGGGTGGTT TGTCGCGCGG GATTTCCTTC
AGTGTCACTG ACGAGGCCGG CGTGCAGAGT CTGGTCCCAG GTGCGGCGAT CGTGAATGTC
ATCGGTCTTC CGCCGACGGT CACCGTCTTT GGCATCCCGA TCTTCAAACT GGGCGGCGCC
CCGGTCAAGG TCGTGTCGTC GGTGACCCTC GGAGACCTGG ATTCGGAGAA CCTGTCCGAA
GCCTCCATTG TGATCGCGAC CGCGTATCAG GACGGCGATG TCCTTTCCTA CACGGCGCCG
TCGGGCAGTC CGATCACCGC ATCCTGGGAT GCCGCGACGC GCACATTGAC GTTGTCGGGT
GTGGCGTCGT TGGATCAGTA CGAGGAGGCG ATCAAGGCGG TCACCTTCAG TGCGACTGAG
GGTGGTTTGT CGCGCGGGAT TTCCTTCAGT GTCACTGACG ATGCCGGCGT GCAGAGTCTG
GTGCCCGGCG CCGCCGTGGT GTCCGTCATC GGGTTGCCGC CGACGGTCAC CGTCTTCGGG
ATCCCGATCT TCAAACTGGG CGGCGCCCCG GTCAAGGTCG TGTCGTCGGT GACGTTGGGT
GATCTGGATT CGGAGAACCT GTCCGAAGCC TCCATTGTGA TCGCGACCGC GTATCAGGAC
GGTGACGTCC TCAGTTACGT TGCTCCTAGC GGCAATCCGA TCACCGCGTC CTGGGATGCC
GCGACTCGCA CATTGACGTT GTCGGGTGTG GCGACTCTCG ACCAGTACGA GGAAGCGATC
AAGGCGGTCA CCTTCAGTGC GACTGAGGGT GGTCTGTCGC GCGGGATTTC CTTCAGTGTC
ACTGACGAGG CCGGCGTGCA GAGTCTGGTC CCAGGTGCGG CGATCGTGAA TGTCATCGGT
CTTCCGCCGA CGGTCACCGT CTTTGGCATC CCGATCTTCA AACTGGGCGG CGCCCCGGTC
AAGGTCGTGT CGTCGGTGAC CCTCGGAGAC CTGGATTCGG AGAACCTGTC CGAAGCCTCC
ATTGTGATCG CGACCGCGTA TCAGGACGGC GATGTCCTTT CCTACACGGC GCCGTCGGGC
AGTCCGATCA CCGCATCCTG GGATGCCGCG ACGCGCACAT TGACGTTGTC GGGTGTGGCG
TCGTTGGATC AGTACGAGGA GGCGATCAAG GCGGTCACCT TCAGTGCGAC TGAGGGTGGT
CTGTCGCGCG GGATTTCCTT CAGTGTCACT GACGATGCCG GCGTGCAGAG TCTGGTGCCC
GGCGCCGCCG TGGTGTCCGT CATCGGGTTG CCGCCGACGG TCACCGTCTT CGGGATCCCG
ATCTTCAAGC TGGGTGGCGC CCCGGTCAAG GTCGTGTCGT CGGTGACGTT GGGTGATCTG
GATTCGGAGA ACCTGTCCGA AGCCACGATC GTGATCGCGA CCGCGTATCA GGACGGTGAC
GTCCTCAGTT ACGTTGCTCC TAGCGGCAAT CCGATCACCG CGTCCTGGGA TGCCGCGACT
CGCACATTGA CGTTGTCGGG TGTGGCCACA CTCGATCAGT ACGAAGAGGC GATCAAGGCG
GTCACGTTCA GCACGACCGA GGGCGGTATC GCGCGCGGCA TTTCCGTGAG CGTGACGGAT
GACGCACAGG TGCAGAGCCT GGTGCCCGGT GCGGCGATCG TCAGTGTGAT CGGGTTGCCG
CCGACGGTGA CGGTCTTTGG CGTCCCGATC TTCAAACTGG GCGGCGCCCC GGTGAAGGTC
GTGTCGTCGG TGACGTTGGG TGATCTGGAT TCGGAGAACC TGTCCGAGGC CTCCATTGTG
ATCGCGACGG CGTATCAGGA CGGTGACGTG CTCAGTTACG TTGCTCCTAG CGGCAATCCG
ATCACCGCGA GCTGGGATGC CGCGACTCGC ACATTGACGC TGTCGGGTGT GGCGTCGTTG
GATCAGTACG AGGAGGCGAT CAAGGCGGTC ACGTTCTCGG CGACTGAGGG TGGTTTGTCG
CGCGGGATTT CCTTCAGTGT CACCGATGAG GCCGGCGTGC AGAGTTTGGT GCCCGGCGCC
GCCGTGGTGT CCGTCATCGG GTTGCCGCCG ACGGTCACCG TCTTCGGGAT CCCGATCTTC
AAACTGGGCG GCGCCCCGGT GAAGGTCGTG TCGTCGGTGA CGTTGGGTGA TCTGGATTCG
GAGAACCTGT CCGAAGCCAC GATCGTGATC GCGACCGCGT ATCAGGACGG TGACGTGCTG
AGCTACGTTG CTCCTAGCGG CAATCCGATC ACCGCGAGCT GGGATGCCGC GACTCGCACA
TTGACGTTGT CGGGTGTGGC GACTCTCGAC CAGTACGAAG AGGCGATCAA GGCGGTCACC
TTCAGTGCGA CTGAGGGTGG TCTGTCGCGC GGGATTTCCT TCAGTGTCAC TGACGATGCC
GGCGTGCAGA GTCTGGTGCC CGGCGCCGCC GTGGTGTCCG TGATCGGGTT GCCGCCGACG
GTGACGGTCT TTGGCGTCCC GATCTTCAAA CTGGGCGGCG CCCCGGTCAA GGTCGTGTCG
TCGGTGACGT TGGGTGATCT GGATTCGGAG AACCTGTCCG AAGCCACGAT CGTGATCGCG
ACCGCGTATC AGGACGGTGA CGTGCTGAGC TACACGGCGC CGTCGGGGAA TCCGATCACC
GCGAGCTGGG ACGCGGCCAC CAGAACCCTG ACCTTGTCGG GCGTGGCCAC ACTCGATCAG
TACGAAGAGG CGATCAAGGC GGTCACGTTC AGCGCGACCG AGGGCGGTAT CGCGCGCGGG
ATTTCTGTGA GCGTGACGGA TGACGCGCAG GTGCAGAGCC TGGTGCCCGG TGCGGCGATC
GTCAGTGTGA TCGGGTTGCC GCCGACGGTC ACGGTCTTTG GCGTCCCGAT CTTCAAACTG
GGCGGCGCCC CGGTGAAGGT CGTGTCGTCG GTGACGTTGG GTGACCTGGA TTCGGAGAAC
CTGTCCGAGG CCTCCATTGT GATCGCGACG GCGTATCAGG ACGGTGACGT GCTGAGTTAC
ACCGCGCCGT CAGGGAATCC GATCACCGCA AGCTGGGATG CCGCGACGAG GACGTTGACC
CTCTCCGGTG TGGCCACACT CGATCAGTAC GAAGAGGCGA TCAAGGCGGT CACGTTCAGC
ACGACCGAGG GCGGTATCGC GCGCGGCATT TCCGTGAGCG TGACGGATGA CGCACAGGTG
CAGAGCCTGG TGCCCGGTGC GGCGATCGTC AGTGTGATCG GGTTGCCGCC GACGGTGACG
GTCTTTGGCG TCCCGATCTT CAAACTGGGC GGCGCCCCGG TGAAGGTCGT GTCGTCGGTG
ACGTTGGGTG ATCTGGATTC GGAGAACCTG TCCGAGGCCT CCATTGTGAT CGCGACGGCG
TATCAGGACG GTGACGTGCT CAGTTACGTT GCTCCTAGCG GCAATCCGAT CACCGCGAGC
TGGGATGCCG CGACTCGCAC ATTGACGTTG TCGGGTGTGG CGACTCTCGA CCAGTACGAA
GAGGCGATCA AGGCGGTCAC CTTCAGTGCG ACTGAGGGTG GTCTGTCGCG CGGGATTTCC
TTCAGTGTCA CCGATGAGGC CGGCGTGCAG AGTCTGGTCC CAGGTGCGGC GATCGTGAAT
GTCATCGGTC TTCCGCCGAC GGTCACCGTC TTTGGCATCC CGATCTTCAA ACTGGGCGGC
GCCCCGGTCA ACGTCGTGTC GTCGGTGACC CTCGGAGACC TGGATTCGGA CAATCTGTCC
GAGGCCACGA TCGTGATCGC GACTGCATAT CAGGACGGTG ACGTCCTTTC CTACACGGCG
CCGTCGGGCA GTCCGATCAC CGCATCCTGG GATGCCGCGA CGCGCACATT GACGTTGTCG
GGTGTGGCCA CACTCGATCA GTACGAAGAG GCGATCAAGG CGGTCACGTT CAGCGCGACC
GAGGGCGGTA TCGCGCGCGG GATTTCTGTG AGCGTGACGG ATGACGCGCA GGTGCAGAGC
CTGGTGCCCG GTGCGGCGAT CGTCAGTGTG ATCGGGTTGC CGCCGACGGT CACGGTCTTT
GGCGTCCCGA TCTTCAAACT GGGCGGCGCC CCGGTGAAGG TCGTGTCGTC GGTGACGTTG
GGTGACCTGG ATTCGGAGAA CCTGTCCGAG GCCTCCATTG TGATCGCGAC GGCGTATCAG
GACGGTGACG TGCTGAGTTA CACCGCGCCG TCAGGGAATC CGATCACCGC AAGCTGGGAT
GCCGCGACGA GGACGTTGAC CCTCTCCGGT GTGGCCACAC TCGACCAGTA CGAAGAAGCC
ATCAAGGCGG TCACCTTCAG CACGACCGAG GGCGGTATCG CGCGCGGTAT CTCCGTGAGC
GTGACTGATG ACGCGCAGGT GCAGAGTCTG GTCCCGGGTG CGGCGATCGT GACTGTGATC
GGGTTGCCGC CGACGGTGAC GGTGTTCGGG ACGCCGATCT TCAAACTGGG CGGCGCCCCG
GTGAAGGTCG TGTCGTCGGT GACGTTGGGT GATCTGGATT CGGAGAACCT GTCCGAAGCC
ACGATCGTGA TCGCGACCGC GTATCAGGAC GGTGACGTGC TGAGCTACAC GGCGCCGTCG
GGGAATCCGA TCACCGCGTC CTGGGACGCG GCCACCAGAA CCCTGACCTT GTCGGGCGTG
GCGACTCTTG ACCAGTACGA GGAAGCGATC AAGGCGGTCA CCTTCAGCAC GACCGAGGGT
GGTCTGGCCC GTGGTGTGTC GGTGTCGGTG ATTGACGATG CCGGCGTGCA GAGTCTGGTC
CCGGGTGCGG CGATCGTGAA TGTGATCGGG CTGCCGCCGA CGGTCACCGT GTTCGGGACG
CCGATCTTCA AACTGGGCGG CGCCCCGGTC AAGGTCGTGT CGTCGGTGAC GTTGGGTGAT
CTGGATTCCG AGAATCTGAC TGAAGCCACG CTGGTGATCA GCTCGGCCTA CCAGACAGGC
GATGTGCTGA GCTACACGGC GCCGTCGGGG AATCCGATCA CCGCAAGCTG GGACGCCGCG
ACGAGGACGT TGACCCTCTC CGGGGTAGCG ACGCTCGATC AGTACGAGGA AGCGATCAAG
GCGGTCACCT TCAGCACGAC CGAGGGCGGT ATCGCGCGCG GGATTTCTGT GAGCGTGACG
GATGACGCGC AGGTGCAGAG TCTGGTCCCG GGTGCGGCGA TCGTGAATGT GATCGGGTTG
CCGCCGACGG TGACGGTGTT CGGGACGCCG ATTTTCAAGC TGGGCGGGTC CCCGGTCAAG
GTGGTGTCCT CGGTGAGCCT CGGCGATCTG GACTCGGACA ACTTGTCGGA AGCCACGATC
GTCGTCGCGA CCGCGTATCA GGACGGCGAT GTGCTGAGTT ACACCGCGCC GTCGGGGAAT
CCGATCACCG CGAGCTGGAA CGCGGCCACC AGAACCCTGA CCTTGTCGGG CGTGGCCACA
CTCGACCAGT ACGAAGAGGC GATCAAGGCG GTCACCTTCA CCACTAACCA GGGTGGCCTG
TCGAGGGGCA TCCAGATTCA CGTCACCGAT GACTCCGCCG TCAAGAGCCT CGTTCCGGGT
TCCGCGATCG TGACCGTGGT GGGCCTGCCG CCGTCGGTGG CCACTATCGG CGCACCGACC
TACACAATCG GCACCGCTCC GGTGAAGCTC ATCGCGTCGG CCAGTATCGC CGATGCGGAC
TCCGACAGCA TGTCGAAGGC CACCGTGACG ATCGCCACCC TCGGACAAGA CGGCGATGTC
CTGGGATACA TTGCCCCCTC AGGGAATCCG ATCACGGCAA GCTGGAACGC GGCCACCCGC
ACGCTGACGC TGGCTGGAGT GGCCACCAAG GCGCAGTACG AGGAGGCACT TGAGGCCGTC
ACCTTCTCTG CGACCGGGGG CGCGCTCCTC GTCCGTGGGA TATCGATCAC GGTGACCGAC
GACACCAACG TGGACAGCCT GCTGCCCGGT GCGGCGACCG CCAATGTCAG GTACTCGTTA
CAGCCCTCGG TGGTGACGGT CGGCACGCCC ACACACACCA TCGGCACCGC GCCCGTAACG
CTGCTGTCGT CCGCGACAAT CACCGACGCA GACTCCGACA TGTTCTCGTC GGCGAGGGTG
ACGATCGAGA CCCTCGGTCA GTCCGGTGAT GTGCTGGGAT ACGTTCAGCC CTCCGGCAAC
CCGATTACCG CAACGTGGGA TGCCGGCAGC AAGACGCTCA CACTGACGGG AATCGGCACC
AAGGCCCAGT ACGAGGAGGC GCTCGAGGCC GTCACGTTCT CGGCCACGGG CGGAATCCTG
TTTGTTCGTG GGATTTCCGT GTCGGTGTCT GACGATACCG GGGTCAGCAG CTCTGGACTG
CTCAACGGCC TGGCCACTGC AACCGTCCGT GAGAACTCGG CGCCCGGGTT GTGGATCACC
GGTGGAAAGT CCTACGACCG CAACGATCCT CCGATGAATC CCGTTGTCAC GCTCGACATC
AGCGACGACC TCGGCTACCT GTCCGGCGCC ACGCTGAAGG TCACTTCCTT CGTGCAGTCG
AACGACACCC TCGGTTACGT GCAGCCTTCG GGTAATCCAG TGACGGCATC GTGGGACTCG
GGTTCCAAGA CCCTGACGCT GTCCGGGACG GCGACGGTGG AACAGTACGA GCAGGCGCTG
CGGGCGGTGA CGTTCTGGGC GAACCAGGGC GGATGGACGA CTCGGACGAT CGCCGTCACC
GTCACGGACA ACGGTGGAAA GAGTGCTTCG GGTTCGATGA CCGTCAGCGT GTGGTAA
 
Protein sequence
MCATPARHRA QRQGVVPEAG FWAAKDLLEL TAQGRSGKHA RKGTSKYTLH IGRVGALAVS 
LGVGLAVANA GVAYADSDTD SSVSESQADD SAAPPSSPQV DATDQTGVDN DEEPGTADDV
DDEEVGEDLA PEEADPSASE DTDVDSDEEL PGSSGGSGES PGGSDSTPDE QQAADPDVTA
PDSEQVDSSP DETVVSTPED DGAEEGPALD DAHPQGDVAV VDVVNTTTEP VERASATPSE
SVETVDIVTA LVSTVVSPFA SPDAPAQVPW FDALLAWVRR QITHTFFNKT PVYGPITVEQ
IFTGQLFIDL NATDPNGDPL TYEIVQPNHG VVVRDVITGN FIYTPTTIVT GEPLQDTFQV
VIRDDSEHLT GALGSFQKLL HGVARLFGLA QRDNITVTIP VTIDPLVQLP PTVVTAGLPI
FKLGDSPVKV LSSAIIADAD SDQIFQATIR ISTAGQDGDV LNYVAPDGSP ITATWDVTTK
TLTLSGLATA EQYEQALLAV TFSTTKGGLP RGVSISVTDE TGVQSLVPGA AIVTVIGLPP
TVTVFGLPIF KLGGSPVKVV SSVTLGDLDS ENLSEASIVI ATAYQDGDVL SYVAPSGNPI
TASWDAATRT LTLSGVATLD QYEEAIKAVT FSATEGGLSR GISFSVTDEA GVQSLVPGAA
IVNVIGLPPT VTVFGIPIFK LGGAPVNVVS SVTLGDLDSD NLSEATIVIA TAYQDGDVLS
YTAPSGSPIT ASWDAATRTL TLSGVATLDQ YEEAIKAVTF SATEGGLSRG ISFSVTDDAG
VQSLVPGAAV VSVIGLPPTV TVFGVPIFKL GGAPVKVVSS VTLGDLDSEN LSEASIVIAT
AYQDGDVLSY VAPSGNPITA SWDAATRTLT LSGVATLDQY EEAIKAVTFS ATEGGLSRGI
SFSVTDEAGV QSLVPGAAIV NVIGLPPTVT VFGIPIFKLG GAPVNVVSSV TLGDLDSDNL
SEATIVIATA YQDGDVLSYV APSGNPITAS WDAATRTLTL SGVASLDQYE EAIKAVTFSA
TEGGLSRGIS FSVTDEAGVQ SLVPGAAVVS VIGLPPTVTV FGVPIFKLGG APVKVVSSVT
LGDLDSENLS EASIVIATAY QDGDVLSYVA PSGNPITASW DAATRTLTLS GVATLDQYEE
AIKAVTFSAT EGGLSRGISF SVTDEAGVQS LVPGAAIVNV IGLPPTVTVF GIPIFKLGGA
PVKVVSSVTL GDLDSENLSE ASIVIATAYQ DGDVLSYTAP SGSPITASWD AATRTLTLSG
VASLDQYEEA IKAVTFSATE GGLSRGISFS VTDDAGVQSL VPGAAVVSVI GLPPTVTVFG
IPIFKLGGAP VKVVSSVTLG DLDSENLSEA SIVIATAYQD GDVLSYVAPS GNPITASWDA
ATRTLTLSGV ATLDQYEEAI KAVTFSATEG GLSRGISFSV TDEAGVQSLV PGAAIVNVIG
LPPTVTVFGI PIFKLGGAPV KVVSSVTLGD LDSENLSEAS IVIATAYQDG DVLSYTAPSG
SPITASWDAA TRTLTLSGVA SLDQYEEAIK AVTFSATEGG LSRGISFSVT DDAGVQSLVP
GAAVVSVIGL PPTVTVFGIP IFKLGGAPVK VVSSVTLGDL DSENLSEATI VIATAYQDGD
VLSYVAPSGN PITASWDAAT RTLTLSGVAT LDQYEEAIKA VTFSTTEGGI ARGISVSVTD
DAQVQSLVPG AAIVSVIGLP PTVTVFGVPI FKLGGAPVKV VSSVTLGDLD SENLSEASIV
IATAYQDGDV LSYVAPSGNP ITASWDAATR TLTLSGVASL DQYEEAIKAV TFSATEGGLS
RGISFSVTDE AGVQSLVPGA AVVSVIGLPP TVTVFGIPIF KLGGAPVKVV SSVTLGDLDS
ENLSEATIVI ATAYQDGDVL SYVAPSGNPI TASWDAATRT LTLSGVATLD QYEEAIKAVT
FSATEGGLSR GISFSVTDDA GVQSLVPGAA VVSVIGLPPT VTVFGVPIFK LGGAPVKVVS
SVTLGDLDSE NLSEATIVIA TAYQDGDVLS YTAPSGNPIT ASWDAATRTL TLSGVATLDQ
YEEAIKAVTF SATEGGIARG ISVSVTDDAQ VQSLVPGAAI VSVIGLPPTV TVFGVPIFKL
GGAPVKVVSS VTLGDLDSEN LSEASIVIAT AYQDGDVLSY TAPSGNPITA SWDAATRTLT
LSGVATLDQY EEAIKAVTFS TTEGGIARGI SVSVTDDAQV QSLVPGAAIV SVIGLPPTVT
VFGVPIFKLG GAPVKVVSSV TLGDLDSENL SEASIVIATA YQDGDVLSYV APSGNPITAS
WDAATRTLTL SGVATLDQYE EAIKAVTFSA TEGGLSRGIS FSVTDEAGVQ SLVPGAAIVN
VIGLPPTVTV FGIPIFKLGG APVNVVSSVT LGDLDSDNLS EATIVIATAY QDGDVLSYTA
PSGSPITASW DAATRTLTLS GVATLDQYEE AIKAVTFSAT EGGIARGISV SVTDDAQVQS
LVPGAAIVSV IGLPPTVTVF GVPIFKLGGA PVKVVSSVTL GDLDSENLSE ASIVIATAYQ
DGDVLSYTAP SGNPITASWD AATRTLTLSG VATLDQYEEA IKAVTFSTTE GGIARGISVS
VTDDAQVQSL VPGAAIVTVI GLPPTVTVFG TPIFKLGGAP VKVVSSVTLG DLDSENLSEA
TIVIATAYQD GDVLSYTAPS GNPITASWDA ATRTLTLSGV ATLDQYEEAI KAVTFSTTEG
GLARGVSVSV IDDAGVQSLV PGAAIVNVIG LPPTVTVFGT PIFKLGGAPV KVVSSVTLGD
LDSENLTEAT LVISSAYQTG DVLSYTAPSG NPITASWDAA TRTLTLSGVA TLDQYEEAIK
AVTFSTTEGG IARGISVSVT DDAQVQSLVP GAAIVNVIGL PPTVTVFGTP IFKLGGSPVK
VVSSVSLGDL DSDNLSEATI VVATAYQDGD VLSYTAPSGN PITASWNAAT RTLTLSGVAT
LDQYEEAIKA VTFTTNQGGL SRGIQIHVTD DSAVKSLVPG SAIVTVVGLP PSVATIGAPT
YTIGTAPVKL IASASIADAD SDSMSKATVT IATLGQDGDV LGYIAPSGNP ITASWNAATR
TLTLAGVATK AQYEEALEAV TFSATGGALL VRGISITVTD DTNVDSLLPG AATANVRYSL
QPSVVTVGTP THTIGTAPVT LLSSATITDA DSDMFSSARV TIETLGQSGD VLGYVQPSGN
PITATWDAGS KTLTLTGIGT KAQYEEALEA VTFSATGGIL FVRGISVSVS DDTGVSSSGL
LNGLATATVR ENSAPGLWIT GGKSYDRNDP PMNPVVTLDI SDDLGYLSGA TLKVTSFVQS
NDTLGYVQPS GNPVTASWDS GSKTLTLSGT ATVEQYEQAL RAVTFWANQG GWTTRTIAVT
VTDNGGKSAS GSMTVSVW