Gene Dbac_2188 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDbac_2188 
Symbol 
ID8377863 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfomicrobium baculatum DSM 4028 
KingdomBacteria 
Replicon accessionNC_013173 
Strand
Start bp2507516 
End bp2516266 
Gene Length8751 bp 
Protein Length2916 aa 
Translation table11 
GC content64% 
IMG OID645001411 
Productglycosyltransferase 36 
Protein accessionYP_003158688 
Protein GI256829960 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3459] Cellobiose phosphorylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGACGA CGAGCGACAT TGGCAAGAGC TCCTTTTCAA TTCTCTGGAC ACGTCTGACC 
GGACGCGGTC CATACGACAC GAGCATCGAC GAGGAGCTCC CTCTCCGGGC GGAACTGTTC
AGCGCCTCCC AGATGGAGCA GCACGGCAAA ATTCTGGCGA GCATGCACCA GCTCACGGAA
AAGCGCACGC ACGACCGCCT CCTGTCGCGT CTGGCCGAAA ACGAGCGGGT GTTGCGGAAC
GTGCACCGCC TGCTGACGGA AGACGTCAAG AACGACCGGC GCATGACCCC GGCGGGGGAA
TGGCTGCTCG ACAATTTCTA TGTGATCGAA GAACAGATCC GCACCGCCGC CAAACACTTG
CCCAAGGGTT ACAGCCGCGA ACTGCCGCAA CTGCGCGGCG GGCATTCGGC CGGTCTCCCG
CGCGTGTACG ATCTCGCCCT GGAGGCCATC TCGCACGGCG ACGGCCGGGT GGACCCGGAA
TATCTGAGCC GCTTCGTGGC CGCCTACCAG ACCGTGACAT CCCTCAAGCT CGGCGAACTG
TGGGCCATCC CCATCATGCT GCGCCTGGCC CTGATCGAGA ACCTGCGCCG GGTCGCCGCC
CGCCTCGCCG TGCGCAGGAA AGAACGCAAC AATGCCGATT ACTGGGCCAA TCTGATGACC
GAGGTGGCCG CCAGCGACCC CAAAAGCCTG ATCCTGACCA TCGCCGACAT GGCCAGGTCC
GACCCGCCCA TGACCAGCTC GTTTGTCGCG GAACTCACCC GCCGTCTGCA GGGACAGGGT
CCGGCGCAAT CCCTCCCGCT GTCCTGGATC GAACGCCGCC TGTCCGAATC CGGCCAAACC
ATCGATCAAC TGGTGCGCCT GGAAAACCAG CAGCAAGCCT CCGACCAGAT CTCCATCAGC
AACAGCATCG GCAGCCTGCG TTCCCTGAAC GCCACCAACT GGAACGATTT TGTCGAGTCC
TTGAGCACGG TCGAAAGTAT TCTCGGGCAA GACCCCGTTT CCATCTACGC CAAGATGAAT
TTTTCCACCC GCGACAGATA TCGCCATGTC GTGGAAAAAA TAGCAAAAAA ATGCCCGCAT
TCCGAATCCG AGGTGTCGCG CAAGGCGCTG GAACTGGCAC AGGAAGGCGC GGCGAGCCAG
AGCGAACAAC GCAAGGGCCA TGTCGGGTTC TATCTGGTCG GTGACGGACG TGCCCAGCTT
GAGCGGCTGG TTCAGATGCG CCGCTCGCCC GCTGCGGCGC TGTGCGTTCT TGGCCGCCAA
TTTCTCTTCT ATGGCGGCGG CATCGTGCTC CTGACGGCCA TGCTGACCAC GTTGCTTGCG
TCCAAGGCCC ATGGCGACGG ACTGAGCGGC TGGCAGCTCG GCCTTTTCTG CCTGCTCCTC
GTACTCTGCT GCAGCCACCT GGCGGTGTCG CTCGCGAATT GGCTGGCCAC GCTCCTGGCC
ACCCCGCGCC TGCTCCCGCG CATGGACTTT TCCAAGGGAA TCCCGCCCGA GGCGCGGACC
CTGGTCGTGG TCCCGACAAT GCTCACCAGC GCACGCTGCG TCGAGGAACT GGCCAACGAT
CTCGAAGTCC GATTTCTGGG GAATCGCGAC CGGCACCTCC ATTTTGCCCT GCTCACCGAC
CTTGCCGACG CAGCCCAGGA AACCATGCCG GAGGACGAAG AACTGGTCAG CCTGGCCCGC
GGCATGATCG AAGCGCTGAA CGCCAAATAC CCGAGCGATG CCGGCAGCAC CTTTTTCCTG
CTGCATCGTC CGCGCAGATT CAACGCCGCC GAAAAGACGT GGATGGGCTT TGAACGCAAG
CGCGGAAAAC TCGGTGATTT GAACGCCCTG CTGAGCAAGG GCTCGGCTGA TGCGATGGAT
CGCTTCGCGC TCATCGTCGG CGACACGTCC ATCCTGCCGG ACATCAGGTT CGTCATCACC
CTGGACACCG ACACGCAGCT GGCCCGCGAC TCGGCGCGAA AGTTTGTGGG CGCCATGTGC
CATCCGCTGA ACCGCGCCGT CTACGACGCG GCACGCGGTC GCGTCACGGC CGGCTACGGC
ATCCTGCAGC CACGGGTGGC GATCAGCCTG GCCGGCTCGA ATCGTTCGAA ATATGCGCGG
ATGTGCGCCA GCGAGGTCGG CATCGACCCC TACACCAACG CCGTCTCCGA TGTGTATCAG
GACCTGTTCG GCGAAGGTTC GTTCATCGGC AAGGGCATCT ACGACGTGGA CGTCTTCAGG
CAGGCGCTCG AAGGCCGCTT CCCGGACAAC CGGATTCTGA GCCACGACCT CCTGGAAGGC
TGTTACGCCA GGTCAGGACT TTTGAGCGAT GCGGAACTGT ATGAGCAGTA TCCGTCCCGC
TACAGCGAGG ACGTGAGCCG CAGGCATCGC TGGATCCGGG GCGACTGGCA GATCGCGCGT
TGGCTGATGC CCCGGGTTCC CGGCGCGGAT GGGCGGCGTT ACAAAAACCC CATTTCCATG
CTGTCCCAAT GGAAGATTCT CGACAACCTG CGGCGCAGCC TGACCGCCAC GGCGCTGACC
CTGCTTCTGA TCCTGGGCTG GACGGTTCTG CCCGCCGCCC TGTTCTGGAC CACGGTGGTG
CTCGGCATCA TTCTGATCCC TTCCCTGGTG GACTCGCTTT CAAACCTCCT CAAAAAACCC
GATGACGTTT CCCCAGCGCA GCACTTCGCC GCGTCGGCAG GCCTGACCCT CCAACATTTG
GGGCGCACGG CCTTCACCCT TGTCTGTCTG CCCTACGAGG CCTATTTCAG CCTGGACGCG
ATCCTGCGCA CCATCTGGAG GATGCTGGTC ACGCATCGGG GACTGCTGGA GTGGAATCCG
TCATGCAACG CACAAAGCGA CCAAGCCGGT TTCGCCGGCT CACTTACGGC CATGTGGATC
GGGCCGCTTC TGGCCATCAG CGTTGCGAGC GTGCTTGCCG TATCGGCGCC GCAGACGCTG
CCGGTGGCAG GTGGACTGCT CGCGCTCTGG CTTGTATCCC CGGCCATAGT CTGGTGGATC
AGCCGCGCCA TCCCTCGCCG CAAGGACGCC CTGACCAGGG AACAGATTCT CTTTCTGCGC
AGGGCGGCCC GCAAGACCTG GGCCTTCTTC GAAACATTCG TCGGCCCCGA GGATCAGTGG
CTCCCACCCG ACAATTTTCA GGAGCACCCC GCTCCCGTGG TTGCGCACCG CACCTCTCCG
ACCAACATGG GCCTTGCGCT GCTGGCGAAC CTCTCCGCCT GCGATTTTGG GATCATCTCC
GCCGGAAAAT TCCTCGACCG CACCATGAAG ACGCTGCGCA CCATGGGCAC GCTGGAACGC
TATCGAGGGC ATTTCTACAA CTGGTACGAC ACCCTGACCC TGGCCCCGCT GCCTCCGAGC
TACATCTCCT CCGTGGACAG CGGCAACCTG GCCGGTCATC TGTTGACCTT GAAGCCTGGC
CTGCTCGAAC TTGCGGGCCG CAAGATTCTG GAACCCCGCA CTTTCGAGAG CTTGCGCGAC
ACGCTTGAAA TCCTGGCCGA GGCCTTGAAA TCCACCCCAA AAGCGGCAGT GGACCCGCTT
GCCGGCCTCA GGCTCGACCT GAACGCCGTG AACGCCGCGC CCCGCTCCGA GCCGCTCACC
CTGGTGGCTT CACGGCAGCG GCTCGAACGG CTGGCGGCAT CGGCCACGAC CCTGATCGAC
AGCTTAAGCG CCTCGGGCTC GAACTCGAAC GCACTCTGGT GGGCAGGCGC CTTTGCCGAG
CAATGCCGCG ACGCGCTCGC GGAGCTGACC AGCCTTGCCC CGTGGGCGGA GCTTGCCATC
GTGCACAAAA ATCTGCCGGA TCTTCCCGCT CTCGACCGGA TCCCGACCTT AAGCGAACTG
ACGACCCTGG CCGAGGCCAT CCTGCCGGAC ATGACGCGGC GGCAGGAAAC GGCGTCCTCC
CTGGATTACA ACGCGTGGGG AGAGCTGCAA AGCCTGGTCA CCAAGGCCGG CCAGCACGCC
GAGGCGCGGA TCAAGACCTG CGAGCGCCTC GCCCTGCAAT GCGGCACAAT GGCCCAGATG
GAATTCGAGT TCCTGTGCGA CGAAACGAGC CATCTGCTGG CCATCGGCTA CAATGTCTCT
GAATGTCGAC GAGACAACAG CTTCTACGAC CTCTTGGCCT CGGAAGCGCG GCTGGCCACC
TTCGTGGCCA TTGCCCAGGG GCAGATTCCG CAGGAGAGCT GGTTCTCGCT GGGGCGCCTG
TTGACCACCG CCGGAGGCGA GCCGGTGCTG TTGTCCTGGA GCGGCTCCAT GTTCGAATAC
CTGATGCCGA TGCTGGTCAT GCCGACCTAC GAGTCCACCC TGCTTGAGCA AACCTGCAAC
GCCTCCGTGG CCAGACAGAT CGAATACGGC AAGAAGCGCT CCGTGCCCTG GGGCATTTCG
GAGTGCGGAT ACAACGCCAT CGACGTGCAC CAGAACTATC AATACCGGGC CTTTGGCGTG
CCTGGCCTTG GCCTCAAACG CGGACTGGCC GAGGATCTGG TCATCGCGCC CTACGCCTCG
GCCCTGGCGC TCATGGTCAC GCCGGGTGAG GCATGCCGGA ATCTCATCCG CCTGGCGGAC
GAAGGATTCG ACACCAGATT CGGCTTTTAC GAGGCCATCG ACTACACGCC CACGCGCCTG
CCGCGCGGCC AATCAAGCGT CGTGGTCCGG TCCTTCATGG CCCATCATCA GGGCATGAGC
CTGCTCTCCA TGGCCCATCT GCTGCTTGAT CGCCCCATGC AAAAGCGATT TGCCTCCGAC
CCGCTGTTCC AGGCGACCAT GCTGCTCCTG CAGGAGCGCA TCCCCAAGAC CACCGCGTTC
TATGCGCACA CCTCCGAACT GTCCGAACTC CAGATGGACT CGGACAGCGG GGGCTCGCAC
ATCCGCGTGT TCTCACGCCC GGACACGCCC GCCCCGGAAG TGCAGCTGCT TTCCAACGGC
CGCTACCACG TCATGGTCAC CAATGCCGGT GGCGGCTACA GCCGCTGGAA GGGTCTGAGC
GTAACCAGGT GGCGCGAAGA CCGCACCTGC GACAACTGGG GCACGTTCTG TTTCCTGCGT
GATCTGGACA CCCGGGAATA TTGGTCCACG GCGTACCAGC CGACGCTCAA ACAGGCGCAG
CGCTATGAAG CCATCTTCTC CGAAGGCCGG GCCGAATTCC GCGTGCAGGA CCACAATTAC
GACACCCATA CGGAGATCAC GGTGTCACCC GAAGACGATA TCGAGCTGCG CCGGGTGCGC
ATCACCAACC GCGCCCGGAC GCGCCGGACC CTTGATCTGA CCAGTTACGC CGAGGTGACC
CTGGCCGACT CCGCCTCCGA CGACCTGCAC CCCGCTTTCA GCAACCTGTT TGTCCAGACC
GAGATCATCC CCAAGCAACA GGCCATCCTC TGCTCCAGAC GGCCGCGCTC CCGGGGGGAA
AAGCCGCCCT GGATGCTGCA CATGATGATC GTGCACGACG TACGCCCCGT CGACATCTCC
TTTGAAACGG ACCGCATGCA ATTCCTCGGG CGCGAAAACA CCGTCGCCGA GCCGCTGGCC
ATGAGCTGCA AGGGCGAATA TTTCTCCGGC GCGCTCTCGG GCAGCGAAGG CTCCGTTCTG
GACCCCATTG TCTCCATCCG CGGCCGCATC GTCCTTGAGC CTGAAGAATC GGTAACCCTC
AACATCATCT CCGGCATCGC CGAAACCCGG GAAGAAACCC TGCGCCTGGT TGAAAAATAC
CAGGACCGCC GCATCGCGGA CAGGGTCTTC GACCTGGCCT GGACACACGG CCAGGTGCTT
TTGCGGCAGC TCAACGCATC CGAGGCCGAC GCGCAGCTCT ATGGCCGCCT TGCGGGATCG
GTCATTTATG CCAATGCGGC GCTGCGGGCC GAGCCGGGCA TTCTCATCAA GAATCGGCGA
GGACAATCCG GACTCTGGAG TCATGCGATT TCCGGGGATT TGCCCATCGT GCTGCTCCAG
ATCCAGGACA TCGCCAACAT CGAACTGGTC CGCCAGCTCG TGCAGGCCCA CGCCTACTGG
CGCCTGAAAG GGCTCAAAGT CGATCTGGTC ATCTGGAACG AGGATCAGAC CGGATACAGG
CAGGTCCTCT ATGATCAGAT CATGGGCCTC GTCGCGGCCG GGGTCGAATC CGGCGGAACC
GAACAGCCCG GCGGGATCTT CGTGCGCTCC GGCAATCGCT TCGCGGAAGA GGACCGCATC
CTGATCCAGA CGGTCGCGCG CGTCGTCATC TCCGACACCC TCGGGACCCT GGCGCAGCAG
ATCGACGGCC GCACCTTAAC GCGCATCAGG ATTCCCCAGC TGGTACCGAG CAGAACGCAT
CGCTCCGATC CCCTACCGCT AGCTCCGCAG CCACGCCACG ACCTGACCTT TTTCAACGGC
CAGGGCGGAT TCACTCCGGA CGGACGCGAA TATGTCATCA CCACCGCCCA GGACCTTGCC
ACGCCCGCGC CCTGGGTCAA TGTGCTGGCA AACCCGCTCT TCGGGAGCGT CATCTCCGAA
TGCGGCGGCT CCTACACCTG GAGCGAGAAC GCCCATGAAT TCCGCCTCAC CCCCTGGGAC
AACGACCCTG TCAGCGGCGA GGGCGGGGAA GCCTTCTACC TGCGCGACGA AGAACGGGGG
CATTACTGGT CGCCCACGCC GCGCCCTTGC CGCGGAGTGA CACCGTATGT GACCAGGCAC
GGGTTCGGAT ACAGCGTCTT CGAGCATACC GAACGCGGGA TCCGCTCGGA ACTCATGATC
TATGTCGCCC TCGACGCGCA GATTAAATTC TCGGTGCTGA AAGTGTCCAA TGTTTCCGAG
CGCTCCCGGC GGATTTCCGC GACCGGATAC GTGGAGTGGG TGCTTGGCGA CCTGCGTCCG
AAATCCGCGT TTCACGTCAT CACGGAAATC GACCAGCACA GCGGGGCTCT GCTGGCGTGC
AACCCGTACA ATCCCGAATT CGGCACGCGA ACGGCGTTTT TCGATGTGGA CGACGTCTCC
CGCACGATCA CCGCCGACCG CACGGAATTT CTGGGCCGCA GCGGCACGCT CCGAGCGCCC
GCCGCCATGA CCCGCACCCG CCTCTCCGGC AAGACGGGGA CGGCCATGGA CCCATGCGCG
GCGATCCAGG TCGGTTTCGA TCTCGAAGTC GGAGAAGAGC GGGAGATCAT CTTCAAACTG
GGCGTGGGAA CCGACGCCAC GGACGCGCAG CAGCTGATCC AGCGTTTCAG GGGCGCGCCC
GCGGCGCGGC ACGCCCTCGA CAAGGTATGG CAGCACTGGG CTCATACCCT CGGGGCCATT
CATGTCCAGA CTCCCGACCA ATCGCTCAAC GTGCTCGCCA ACGGCTGGCT CCTGTACCAG
ACCCTGGCCT GCCGCCTGTG GGCGCGCAGC GCGACCTACC AGTCGGGCGG GGCGTTCGGT
TTTCGCGACC AGCTTCAAGA CGTCATGGCC CTTGTCCATG CCAGGCCGGG CCTGGTCCGT
GAACATCTCC TGCTCTGCGC CTCCCGCCAA TTCGAGGAAG GCGACGTGCA GCACTGGTGG
CATCCGCCCA TGGGCCGGGG CGTGCGCACC AAATGCTCGG ACGACTACCT GTGGCTGCCG
CTCGCGACGT GCCGCTATGT CGCCGCCATC GGGGACACCG GCATTTTGGA TGAAAGCGTC
CCGTTCCTGC GGATGCGCGC GCTCGGTGAC GACGAGGAAT CCTGCTACGA TCTGCCCGAA
CGCTCGGACC AGAGCGCCAG CCTCTACGAA CACTGCGTGC GGGCCATCCG GCACGGCTTG
CGCTTCGGCG TGCACGGGTT GCCGCTCATC GGCTCCGGCG ATTGGAACGA CGGCATGAAT
CTGGTGGGCG AACACGGCAA AGGCGAGAGC GTCTGGCTCG GCTTCTTCCT GCATCATGTG
CTCGACGCCT TCGCCACGCT GGCCAATGGA CGGGGAGACG CCCCGTTTGC CGAAGAGTGC
CGGCAGGAAG CCGCCAAACT GAGCCGCATC ATCGAAGAGA GCGGTTGGGA TGGAAACTGG
TACCGCCGTG CTTACTTTGA TGACGGCACG CCGCTGGGTT CAGCGAAAAA CGACGAATGC
CGCATCGACT CCATCGCTCA GAGCTGGTCC GTGCTCTCGG GGGCGGGAGA CCCCAAACGG
GCGCGCACGG CCATGCAGTC CCTGGACAGG CATCTGGTTC GACGCGAGCA CGGGCTGGTT
CAACTTCTGG ACCCGCCTTT CGACACGTCC GACCTCAACC CCGGCTACAT CAAGGGATAT
GTGCCCGGAG TACGTGAAAA CGGGGGCCAG TACACCCACG CAGCCATCTG GGCTGCCATG
GCCTTTGCCC GTCAGGGGGA TACGAAACGC GCCTGGGAGA TTTTCGACAT GATCAACCCG
GTCAACCACG CCCGCACCCC CGAAGAGACG GCCATCTACA AGGGCGAGCC CTACGTCGTC
GCCGCCGATG TCTACGGCGC GCCGCCGCAT ACGGGCCGTG GCGGCTGGAC ATGGTACACG
GGCTCGGCCG CCTGGATGTA TCGGCTCATC ATGGAGTCGC TTCTGGGCCT GAGCCTTGAA
GTGGACAGGC TGCGCATCAC GCCCTGCCCC CATCCGCAAT GGCAGGAATT CACGGTGCAC
TACCGCTTCC GGGAAACGGT CTATCACATC ACCGTGTCGC AGTTTAAAGG CAAGAAGACC
GGCACGCGCC TGACCCTCGA TGATGTCGAG CAGACAGGCC AGACCATTGC GTTGGTGGAC
GACCACCAGG AACACTGGGT CAAGGTGCGC ATGAACGTGA GCGGTGTTTG A
 
Protein sequence
MKTTSDIGKS SFSILWTRLT GRGPYDTSID EELPLRAELF SASQMEQHGK ILASMHQLTE 
KRTHDRLLSR LAENERVLRN VHRLLTEDVK NDRRMTPAGE WLLDNFYVIE EQIRTAAKHL
PKGYSRELPQ LRGGHSAGLP RVYDLALEAI SHGDGRVDPE YLSRFVAAYQ TVTSLKLGEL
WAIPIMLRLA LIENLRRVAA RLAVRRKERN NADYWANLMT EVAASDPKSL ILTIADMARS
DPPMTSSFVA ELTRRLQGQG PAQSLPLSWI ERRLSESGQT IDQLVRLENQ QQASDQISIS
NSIGSLRSLN ATNWNDFVES LSTVESILGQ DPVSIYAKMN FSTRDRYRHV VEKIAKKCPH
SESEVSRKAL ELAQEGAASQ SEQRKGHVGF YLVGDGRAQL ERLVQMRRSP AAALCVLGRQ
FLFYGGGIVL LTAMLTTLLA SKAHGDGLSG WQLGLFCLLL VLCCSHLAVS LANWLATLLA
TPRLLPRMDF SKGIPPEART LVVVPTMLTS ARCVEELAND LEVRFLGNRD RHLHFALLTD
LADAAQETMP EDEELVSLAR GMIEALNAKY PSDAGSTFFL LHRPRRFNAA EKTWMGFERK
RGKLGDLNAL LSKGSADAMD RFALIVGDTS ILPDIRFVIT LDTDTQLARD SARKFVGAMC
HPLNRAVYDA ARGRVTAGYG ILQPRVAISL AGSNRSKYAR MCASEVGIDP YTNAVSDVYQ
DLFGEGSFIG KGIYDVDVFR QALEGRFPDN RILSHDLLEG CYARSGLLSD AELYEQYPSR
YSEDVSRRHR WIRGDWQIAR WLMPRVPGAD GRRYKNPISM LSQWKILDNL RRSLTATALT
LLLILGWTVL PAALFWTTVV LGIILIPSLV DSLSNLLKKP DDVSPAQHFA ASAGLTLQHL
GRTAFTLVCL PYEAYFSLDA ILRTIWRMLV THRGLLEWNP SCNAQSDQAG FAGSLTAMWI
GPLLAISVAS VLAVSAPQTL PVAGGLLALW LVSPAIVWWI SRAIPRRKDA LTREQILFLR
RAARKTWAFF ETFVGPEDQW LPPDNFQEHP APVVAHRTSP TNMGLALLAN LSACDFGIIS
AGKFLDRTMK TLRTMGTLER YRGHFYNWYD TLTLAPLPPS YISSVDSGNL AGHLLTLKPG
LLELAGRKIL EPRTFESLRD TLEILAEALK STPKAAVDPL AGLRLDLNAV NAAPRSEPLT
LVASRQRLER LAASATTLID SLSASGSNSN ALWWAGAFAE QCRDALAELT SLAPWAELAI
VHKNLPDLPA LDRIPTLSEL TTLAEAILPD MTRRQETASS LDYNAWGELQ SLVTKAGQHA
EARIKTCERL ALQCGTMAQM EFEFLCDETS HLLAIGYNVS ECRRDNSFYD LLASEARLAT
FVAIAQGQIP QESWFSLGRL LTTAGGEPVL LSWSGSMFEY LMPMLVMPTY ESTLLEQTCN
ASVARQIEYG KKRSVPWGIS ECGYNAIDVH QNYQYRAFGV PGLGLKRGLA EDLVIAPYAS
ALALMVTPGE ACRNLIRLAD EGFDTRFGFY EAIDYTPTRL PRGQSSVVVR SFMAHHQGMS
LLSMAHLLLD RPMQKRFASD PLFQATMLLL QERIPKTTAF YAHTSELSEL QMDSDSGGSH
IRVFSRPDTP APEVQLLSNG RYHVMVTNAG GGYSRWKGLS VTRWREDRTC DNWGTFCFLR
DLDTREYWST AYQPTLKQAQ RYEAIFSEGR AEFRVQDHNY DTHTEITVSP EDDIELRRVR
ITNRARTRRT LDLTSYAEVT LADSASDDLH PAFSNLFVQT EIIPKQQAIL CSRRPRSRGE
KPPWMLHMMI VHDVRPVDIS FETDRMQFLG RENTVAEPLA MSCKGEYFSG ALSGSEGSVL
DPIVSIRGRI VLEPEESVTL NIISGIAETR EETLRLVEKY QDRRIADRVF DLAWTHGQVL
LRQLNASEAD AQLYGRLAGS VIYANAALRA EPGILIKNRR GQSGLWSHAI SGDLPIVLLQ
IQDIANIELV RQLVQAHAYW RLKGLKVDLV IWNEDQTGYR QVLYDQIMGL VAAGVESGGT
EQPGGIFVRS GNRFAEEDRI LIQTVARVVI SDTLGTLAQQ IDGRTLTRIR IPQLVPSRTH
RSDPLPLAPQ PRHDLTFFNG QGGFTPDGRE YVITTAQDLA TPAPWVNVLA NPLFGSVISE
CGGSYTWSEN AHEFRLTPWD NDPVSGEGGE AFYLRDEERG HYWSPTPRPC RGVTPYVTRH
GFGYSVFEHT ERGIRSELMI YVALDAQIKF SVLKVSNVSE RSRRISATGY VEWVLGDLRP
KSAFHVITEI DQHSGALLAC NPYNPEFGTR TAFFDVDDVS RTITADRTEF LGRSGTLRAP
AAMTRTRLSG KTGTAMDPCA AIQVGFDLEV GEEREIIFKL GVGTDATDAQ QLIQRFRGAP
AARHALDKVW QHWAHTLGAI HVQTPDQSLN VLANGWLLYQ TLACRLWARS ATYQSGGAFG
FRDQLQDVMA LVHARPGLVR EHLLLCASRQ FEEGDVQHWW HPPMGRGVRT KCSDDYLWLP
LATCRYVAAI GDTGILDESV PFLRMRALGD DEESCYDLPE RSDQSASLYE HCVRAIRHGL
RFGVHGLPLI GSGDWNDGMN LVGEHGKGES VWLGFFLHHV LDAFATLANG RGDAPFAEEC
RQEAAKLSRI IEESGWDGNW YRRAYFDDGT PLGSAKNDEC RIDSIAQSWS VLSGAGDPKR
ARTAMQSLDR HLVRREHGLV QLLDPPFDTS DLNPGYIKGY VPGVRENGGQ YTHAAIWAAM
AFARQGDTKR AWEIFDMINP VNHARTPEET AIYKGEPYVV AADVYGAPPH TGRGGWTWYT
GSAAWMYRLI MESLLGLSLE VDRLRITPCP HPQWQEFTVH YRFRETVYHI TVSQFKGKKT
GTRLTLDDVE QTGQTIALVD DHQEHWVKVR MNVSGV