Gene Cthe_1221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1221 
Symbol 
ID4809913 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1455455 
End bp1464223 
Gene Length8769 bp 
Protein Length2922 aa 
Translation table11 
GC content42% 
IMG OID640106644 
Productglycosyltransferase 36 
Protein accessionYP_001037646 
Protein GI125973736 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3459] Cellobiose phosphorylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGATGC AACTATACAT TTTATATCTG TTGGGTTTAT TTGGTATATT ATTGTGCTTA 
TTTTTGTTAG CAATTTTTTC AAATTGCAAT GAAAGACAAC GTCAACTAAA AGTGCAGGAT
GCATCTTTGA CTTTTGATGA ACTGGAAGCC TATGCGAAAG AAATAGCAAT TGAACATTCT
GTATCCGGGA AAAAGAGTAT GTTTTCCTGG CCAATACCCA GGATGAATGA TAATTACAGG
TATATAATGT CCGTATATAA AGAAATGAAT GAAGATGTAC AAAAGGGTAT CAGCACTACG
CCTGCTGCAG AATGGCTTTT GGATAATTTT TATATTATAG AGGAACAAGT AAAGAGTTTA
AGAAGGGACC TTACAAAGGA GGTCTATGCA AAACTTCCCG TGCTTGACAG CGGACACCTT
AAAGGCTATG CACGGATATA TTCCATTGCC CTGGAATTGC TTTCCCATAC CGACGGCAGG
ATTGATGAAA AAGTGCTGGT TAACTACATA AAGGCATATC AGTCCAACAA TGTACTTACC
GGAAGGGAAT TGTGGGCTTT CCCTATTATG CTCAAACTTG TGCTTATAGA AAAAACAAGA
TATATTTGCG AAAAAATTGC AAAAGCCCAG GAACAAAGAA GAAAAGTGGA AGAAATACTT
AAAGCTTTTG ATGAAAATAT TGAAAACACC ACTCAACTTA TAACTGCCAT AGATAATGAG
CTTAAAGGGA AATACGAGGT AAATTCGGCA TTTATTGAAT ATCTTGCATA CAAATTCAGA
AAGATGGGAA GAGCCTATAC CCATGTTCTG CGTTATATAG ATGAAAGGCT TGGTGAAAGC
GGCACAACGG TTGATGACAT TACCCAGAAA GAGCATAATG AACAGACAGC AAGCAAGGCA
TCCATTGGCA ACTGTATAAT GAGCCTGAAA TTTATTTCAA CCGTTAATTG GGTGGATATT
TTTGAACAGC TTAGCAAAGT GGAGCAGATT TTAAGAGAAG ATCCCTCAGG TTTTTATTCC
TTGATGGATT TTGATTCGAG AAATTATTAC AGAAACAGAG TGGAAAAACT GGCTCTAAAA
TATAAAGTTT CAGAATCCCA TGTTGCCAAA AAGGCTGTTG AACTTGCCAG AAATGCGGTG
GAAAACGGCA ATTTGACCGA TAAGCGTTTG ACCCATGTAG GGTATTATCT TGTCGGAAAA
GGTATTTGTG AGCTGGAAAA AGAAATAGGC TATGAAAAAA GCTTTAATCA AAGGATGTTT
GAAAGAATAA AAGAGCATCC TGCATGTTTG TATTTTGGTT TTATCGGGTT TATAACTGTC
TTATTATTGC TATGCGTGAC GAAGTATTCT CTTTTCAGAG CGGAGAAATA CGGTATTGCC
CTGTCAATTA TTGCAGTTTT GGCAACGATT ATTCCTGCAA CCGACATTGC AGTTAATTTT
GTAAACTGGG TATTGTGCAA GATGATTAAG CCTTCGCTGC TGCCAAAACT TGACTTTGAA
AACGGAATAC CTGAAGAGTA TGCCACAATG GTTGTCATAC CTGCTCTGCT TCCTGACGAG
AATCGGGCGA GGGAGCTGAT TGACAACCTT GAAGTATATT ATCTTGCCAA CCGGGAGAAG
AATTTGTATT TCTCAATTGC CGGGGATTTC AAGGATGCTC CCAACAAAGA AATGGCCGGT
GATAAAAAGA TAATTGAAAC TGCGCTGGGC AGAATTGCAG AGCTTAATGA AAAATATGGC
AGAAAAAACG AAGGCGGAGA AAAAGACTCC CGGGACATAT TTTATTATTT TCATAGACAC
AGACAGTTTA ACGAAAAGCA GAACAAATGG ATGGGATGGG AGAGAAAAAG AGGAGCTCTT
CTTGAGTTTA ACGAAGTCCT TCTGGGCTCA AGAACTACGA GCTATTCAAT AATGTCCCAT
GACGTGTCAC AGCTTCCGAA AATAAAATAT GTCATTACTT TGGATGCAGA TACCATACTG
CCTTTGGGTG CGGCCAGGAA GCTGATAGGA ACCATGGCCC ATCCTTTGCA CAGGCCTGTG
ATTGACGAAC AGAAGGGAAT AGTAACCGAG GGCTACGGGC TTTTACAGCC AAGAATAGGT
TTTGATATTG AAAGCGTTAA CAAGTCGCTG TTTTCAAGGA TATTTGCCGG TGAGGAAGGA
ATAGACCCTT ATGCCAGCGC TATTTCAGAC GTCTATCAGG ATCTTTTCGG CGAGGGGATA
TTTACCGGTA AAGGTATATA TGACCTTGAG GTTTTCCAAA AACTTTTGAA GGATGCCATA
CCCGACAATA CCGTACTTAG CCATGACCTT CTTGAAGGTT CGTATGTCAG AGCAGGGCTT
GTGACGGATA TTGAGTTTAT CGATGGTTAT CCTTCAAAGC TGAACTCCTA TGCCATGAGG
CTTCACCGTT GGGTGAGGGG AGACTGGCAG CTTCTGCCGT GGCTTCGTGG CAAAACAAAA
GACAGAAAGG GAAATGTGAT AAAGAATCCT CTTTCATTGA TTTCCAGATG GAAGATATTA
GATAACCTTC GAAGAAGTAT AGTAGCACCT TCAATAACGC TGCTTATTGC TTTGGGATTC
AGTATTTTGC CGGGCAGTTC CCTTTTCTGG TTGGGGGCCT CCCTTTTAAC TATATATTTT
CCGCTGATTA CGGGAACTAT AGACTATATT GCATCAAAGC CTTTAGGGGC AATTACTTCA
AAAAGATACA AACCTGCCAT ATGCGGGCTG AAGGCGTCTT TTCTGCAAAT GACATTGCAG
TTTGTCTTCC TGCCGTATAA TGCATGGCTT ATGGTGCATG CGGCGGTATT GAGCCTTGTC
AGGGTTCTGT TTACAAAAAG GAACATGCTT GAGTGGGTAA CCGCACTGGA TGCTGAAAGA
GGCCTTAAGA ATTCACTTAA AGGTTATGTA ATAAAGATGA AGGCGGCAGC ATTTCAGGCA
CTGGTTGTTG TGGTTCTGGC TTTTGCATTT AAAACCGGTT TTTCGGCGGC AGTATCCGTT
CTTCCGTTTG CCGTGTGGGT TTCATCGCCT TTTATAGCTT ATTGGATAAG CAAAGAGACG
GTTTACAAAA CAGAGACTTT AAGCGACGAG GAGAATCTGG AACTTAGACG TATTGCCAGA
AAGACATGGA GATATTATGA GGAGTTTGTA AACAGGAGAA ACAACTACCT TGCGCCGGAC
AACTTTCAGG AAGACCCTCC GAATGGCATA GCGTACAGGA CGTCCCCAAC CAATATTGGA
TTGGGTATGC TTGCAGCTCT CACAGCCAGA GACTTGGGTT ATATAGGAAC TTTGGAGCTT
TGTGACATTA TTTCAAGAAC AATGAGTACC GTTGAAAAGA TGGAAAAGTG GAACGGCCAT
CTTTACAACT GGTATGATAC ACGGACACTG GAGACTTTAA GACCAAGGTA TATTTCCACC
GTTGACAGCG GAAACTTTGT CTGTTACCTC ATAACTCTCA AAGAAGGTTT GGCGGAGTAT
CTCAACAGAC CTCTTGAGGA CAGGGCGTTT ATTGACGGAA TAAGGGATAC GGCAAGTCTG
ATTGCCGACG AAAACGAGAA CCCTTACAAG GATATTTCAT GCCTCAAAGA GTGCATTGTA
ATTTCCGAAG GCAGAAGTTA TGTGGATATA CCGCAGATGA TGAAGGCATT GACAAAGCTT
TCGGAAGACG GAAATAAGAT GAAGGACAGC AAGGATGTAT GGAAGGCAAA GGTTGACAGT
ATGATAGAGA TGCTGAAAAT TGAGCTGTAC ACCTACATGC CTTGGTGCGA CATGATTGAC
GAACTGACCG AAGCTTTTGA AAAGAGCGAG GCTGATATAA AAGAAGCTTT TCATGGCATA
ATAAGGAAGC TGAATTCCGA CTACTCCCTC AAAGCCATGC CTGTGGTATA CAGGGAAACA
ATAAAACAAA TCGAAAAGCT CAGGAAAAAG TTAAAAGACG GACAGCAAAA GAATATAGAA
GGACTTGACA GGCTTAAGGA GGCTTTGGAA GGGGCAACGG AAAGTGCGGA CAAACTGGTG
AAAAGATATG TGGATTTAAT AAACAGGATA TGCAGAATTG CTGATGAGAC GGAATTTGTG
CATTTGTATG ACAAAAAGAA GCAGCTGTTT TCCATTGGAT ACAATATTGA AGAAAACAGT
CTTACAAATT CCTATTATGA CTTACTGGCT TCTGAAGCCC GGCAGACCAG TTACATTGCC
ATTGCAAGAG GAGAGGTGGA CCAGCAGCAC TGGTTCAAGC TTGGCAGAAC ACTTACCCAG
ATAGATCGGT ACAAAGGAAT GGTTTCATGG AGCGGAACCA TGTTTGAATA TTTTATGCCT
TTACTCATTA TGAAGAGTCA CAAAAATACT TTGCTTGACG AAACCTATTC CTTTGTGGTA
AGGAGCCAGA AAAAGTACGG AAAACAGAGA AATCTGCCCT GGGGTATATC CGAGTCGGGA
TTTTATTCCT TTGACATAAA CCTGGATTAC CAATACAAGG CTTTTGGTGT GCCATGGCTG
GGGCTAAAAA GAGGGCTTGT TGAGGATATG GTTGTTTCTC CTTATGCGAC CATGCTGGTT
CTTCCTTTGG TTCCGAGAGA TGCAATGGAC AATTTGAAAA GACTGATTGC CGAGGGTGCC
TACGGACATT ATGGTATGTA CGAGGCAATT GATTACACTC CCGAAAGAAT TCCTTTAGGC
GAGAAGAAAG GAATTGTAAA GAGCTATATG GCTCACCACC AGGGTATGAG CATATTGGCT
CTGAACAACT ATTTTAACGA CAATATAATG CAAAAACGTT TCCATGCCGA CCCTGTGGTT
GATGCCGCAA AACTTCTTCT GATGGAAAAA GTACCGTCAA ATATAGTGTT TACAAAGGAA
AACAAGGAGA AAATACTTCC TTTCAAGGAT GTGGTATATG ATGAGAAAGA TTTCCTGAGA
GAGTGCGGTA TGCCTGACCC TGTACTTCCG AAGGCCCATA TACTGTCAAA CGGCAACTAC
TCAGTCATGG TTACCGACAG GGGTACCGGC TACAGCAGAT GGAAAAACCT AGATGTAACC
CGCTGGAGAG AAGATGTGAC TTTGGATAAT TATGGAATGT TCTTCTATAT AAGAGATGTA
CAGAACGATG AAGTATGGAC TTCCACCTTT GCACCCGGAA GGAAAAAACC GGACGAATAC
AAAGTTGAGT TTACTTCGGG AAAAGCAAAA TACTACAGAA AAGACGGGGA TATTGACACA
TTGACAGAGA TAGTTGTTTG CGCCGGGGAA AATGCTGAGA TTAGAAGTAT TACTTTGGCA
AATCACGGAC AGGAAAGCTG TGTGATGGAG ATAACCAGCT ATTTTGAACC GGTTTTGTCC
CACCACGGTG CGGATATTGC CCATCCGGCC TTTGGAAACC TGTTTATCAG GACGGAGTTT
TTGGCGGAAC ATAACTGTCT GATTGCCGGA AGAAGGCCAA GATCGGAAAA GGAAAAACCT
GTATGGATAA TGAATACCGT GGTTTTGGAA GGAGAAGGGG TAGGAAGCCT GCAGTATGAA
ACCGACAGAA TGCAGTTTAT AGGAAGAGGC AGAAATGTGT CGGAACCGGT GGCGCTGGAA
CCTCACAGGC CGCTTACCAA TTCCGTTGGT GCAGTGTTGG ATCCGGTTAT GAGTTTCAGG
CAGATAGTCA GAGTTGAACC CGGTAAATCA GTAAAAATAT CCTTTGTAAC TGCGGTGGCA
AACAGTCGGG AAGATGTAGT GGAGATGGCC ACGAAGTTTA AAAGCCCGCA GGTGATAAAG
GATGAGCTTG GCATGGCTGT TACAAAGAGC CGAGTGGAAG CAAGATATTT AAATCTTGAT
ACGGAAGAAA TAGAACTGTA TCAGGATATG ATTTCCCATA TACTGTTTAT CAGTCCTCTG
CAAAGACAGA AACAAAAATG GGTAATGAAC AACAAAAAAG GACAACCGGG TCTTTGGCCT
TATGGTATTT CGGGAGATAT ACCGATTGTA CTTGTCATGC TTGACAAAAC CGATGACATA
GATATTGTGA GGGAAGTACT GAAGGCTCAT GAATACTGGC GTTTAAAGAA ACTGGCCGTG
GATCTGGTGA TCCTCAATGA GGAAGAAAAC AGCTACACCA ACCCGGTAAA CAGTTTGCTA
ATGGATATAA TTGCCGAAAG TCATGCCCAT GACCTGATAA ACAAACCGGG AGGAGTGTTT
ATTCTTAAGA AAAGTAACAT GCCTCCGGAG GATATTGATT TGATTTGTTC GGTTTCAAGG
ATAATATTAA AAGGTGATGC GGGTGACTTG AAAGACCAGG TAAAATATGC AAGAAGCATT
GCTTTGGCAG AGTTTAAGCA ATTTGAAAAG AAACCGGCAA GTTACGACTC CAAGCTTGCA
AAGGATTTGG AGCTCAACTT CTACAACGGT CTTGGCGGGT TTGGCAAGGA TGGCAAAGAG
TATGTCATTT TCCTTGAAAA CGGCCAGAAT ACTCCTCTGC CATGGATAAA TGTAATTTCC
AATCAGAGAT TTGGATTTAT AGTAACGGAG TCCGGTTCAG GCTATACATG GTTTGAAAAC
AGCCGGGAGA ACAAGCTTAC ACCGTGGTCA AACGACCCGG TAAGCGATAC ACCGGGAGAA
ATACTGTATG TCATGGATGA ACATGCAGGA GATGTATGGT CCGTAACACC GTTGCCTGTG
AGAGAAAAAG AGCCGTATAT GATAAGGCAT GGTTTTGGCT ATACGGTCTT CAGTCATGCA
AGCCATGGCA TTGAACAGGA AATGGTGCAG TTTGTTCCGG TTGATGATTC GGTAAAAATT
AGTATTCTGA AACTAAAAAA TCAATCACAG GAAAATAGGG GCTTGAGTCT GACATATTAT
ATAAGGCCGG TCCTTGGAGT CAGCGATCAG TTTACTGCCA TGCATATAAA TACTAAGGCT
GACAATGGCA TGATTGTGAT AAAGAATAAC TATAATGACG AGTTTCCCGG CAGAGTTGCC
TTCATTGATT CTTCATTGAA AGTCAACTCA CTGACCTGTG ACAGAAAAGA GTTCTTTGGA
GCAGGAGATA TTGCAAATCC TGAAGGAATA AAACGTACTT CTCTTTCAGG AACAACCGGA
GCGGGTTTTG ACCCTTGTGC TGCCATAAGC GTGAGTGTAA ACCTCAAACC TGATGAGGAA
AAAGAGATCA TTTTCCTTCT TGGAGCAGGC AGAGACGAAG AAGAAGCAAG GCAGCTTTCT
GCGAAATACA AAAAATTGGA AGAGGCTAAA AAAGCATTGG GCGAAGTGAA GAAATTCTGG
GAATTAAAGC TTGGGGCACT GCAGTTTGAA ACTCCGAATA CGGCAATGGA TATACTGCTT
AACGGATGGC TTCTGTATCA GGTTGTTTCC TGCAGGCTCT GGACGAGATC AGGCTTTTAC
CAGTCGGGTG GCGCATATGG CTTTAGAGAT CAGCTTCAGG ACAGTATCTC ATTGACCCAT
ATATGGCCGG AGGCCACCAG AAACCAGATA CTTCTTCACT CAAGGCACCA ATTTATAGAA
GGGGATGTAC AGCACTGGTG GCATGAAGAA AAATACAAAG GTACGAGAAC AAAATTTTCC
GATGACCTTC TATGGATGCC CTATGCTACG ATTGAATATA TAAGAATTAC CGGGGATTAT
GACATACTTT ACGAAGAGAC ACCGTTTTTA GAGGATGAAC CGTTAAAGGA ATTTGAAGAT
GAAGCTTATC GTGTTCCGAG GATATCTCAT ACGGTATCGA CCCTTTATGA CCACTGTATC
AGAGCCATCA ACCGGTCTTT AAAGTTTGGA GAACATGGAA TACCGTTAAT TGGCTCCGGA
GACTGGAATG ACGGAATGAA TACAGTGGGC AACAAAGGAA AGGGAGAAAG TGTATGGCTT
GGCTGGTTCC TTTATTCCAT ACTTAAAAAT TTTGCTCCGC TGTGCGAAAG AATGGGTGAT
AATGAACTTG CAAAAAGGTA TCTGGACACG GCAGACCGGA TTGTTGAGAA TATTGAGAAA
AATGCCTGGG ATGGAAAGTG GTACAGAAGA GCATATTTTG ACAACGGGGT GCCTTTGGGT
TCCATACAGA ACAGTGAATG CCAGATTGAC TCTCTGGCCC AGTCTTGGGC TGTAATATCC
GAAGGAGGAG ACAAAGAGAG GATTGCTGAA GCCATGAGTG CCCTTGAAAA CTATCTGGTA
AAACGGGATG AGGGACTTAT AAAGCTTCTT ACTCCTCCTT TTGACGAAGG AGATTTGGAA
CCGGGCTATA TAAAGAGTTA TGTGCCGGGA GTCCGTGAAA ACGGTGGACA ATATACCCAT
GCTGCTGCCT GGGTTGTCAT GGCTTTTGCA AAGATGGGAG ACGGGGAAAA AGCGATGGAG
CTTTTTGACC TGTTAAATCC TATAAATCAC TCAAGAACCC ATATTGAATA TTCCAGGTAC
AAGGTTGAGC CTTATGTAAT GGCTGCGGAT GTTTATTCAG TTCCGCCTCA TACAGGCAGG
GGAGGATGGA CCTGGTATAC AGGCTCGGCG GGATGGATCT ATCGTGTTGG TTTTGAATAT
ATTTTAGGAT TTAAAAAGCG GGGAGAAACT CTTGAGATAG ACCCTTGCAT ACCGGGAAAA
TGGACGGATT TTACGATTAA ATATCGTTAT TATGATACTG ATTATATTAT AGAAGTGAAA
AACCCTGAAG GAGTGAATAC CGGAGTCAAA AAGGTCATTG TTGACGGAAA AGTTTGCGAT
GACGGAAAAG TTCAGCTTGT CAATGACAAA GACACACACA AGGTAGAGGT CTATATGGGA
AAAAAGTAA
 
Protein sequence
MQMQLYILYL LGLFGILLCL FLLAIFSNCN ERQRQLKVQD ASLTFDELEA YAKEIAIEHS 
VSGKKSMFSW PIPRMNDNYR YIMSVYKEMN EDVQKGISTT PAAEWLLDNF YIIEEQVKSL
RRDLTKEVYA KLPVLDSGHL KGYARIYSIA LELLSHTDGR IDEKVLVNYI KAYQSNNVLT
GRELWAFPIM LKLVLIEKTR YICEKIAKAQ EQRRKVEEIL KAFDENIENT TQLITAIDNE
LKGKYEVNSA FIEYLAYKFR KMGRAYTHVL RYIDERLGES GTTVDDITQK EHNEQTASKA
SIGNCIMSLK FISTVNWVDI FEQLSKVEQI LREDPSGFYS LMDFDSRNYY RNRVEKLALK
YKVSESHVAK KAVELARNAV ENGNLTDKRL THVGYYLVGK GICELEKEIG YEKSFNQRMF
ERIKEHPACL YFGFIGFITV LLLLCVTKYS LFRAEKYGIA LSIIAVLATI IPATDIAVNF
VNWVLCKMIK PSLLPKLDFE NGIPEEYATM VVIPALLPDE NRARELIDNL EVYYLANREK
NLYFSIAGDF KDAPNKEMAG DKKIIETALG RIAELNEKYG RKNEGGEKDS RDIFYYFHRH
RQFNEKQNKW MGWERKRGAL LEFNEVLLGS RTTSYSIMSH DVSQLPKIKY VITLDADTIL
PLGAARKLIG TMAHPLHRPV IDEQKGIVTE GYGLLQPRIG FDIESVNKSL FSRIFAGEEG
IDPYASAISD VYQDLFGEGI FTGKGIYDLE VFQKLLKDAI PDNTVLSHDL LEGSYVRAGL
VTDIEFIDGY PSKLNSYAMR LHRWVRGDWQ LLPWLRGKTK DRKGNVIKNP LSLISRWKIL
DNLRRSIVAP SITLLIALGF SILPGSSLFW LGASLLTIYF PLITGTIDYI ASKPLGAITS
KRYKPAICGL KASFLQMTLQ FVFLPYNAWL MVHAAVLSLV RVLFTKRNML EWVTALDAER
GLKNSLKGYV IKMKAAAFQA LVVVVLAFAF KTGFSAAVSV LPFAVWVSSP FIAYWISKET
VYKTETLSDE ENLELRRIAR KTWRYYEEFV NRRNNYLAPD NFQEDPPNGI AYRTSPTNIG
LGMLAALTAR DLGYIGTLEL CDIISRTMST VEKMEKWNGH LYNWYDTRTL ETLRPRYIST
VDSGNFVCYL ITLKEGLAEY LNRPLEDRAF IDGIRDTASL IADENENPYK DISCLKECIV
ISEGRSYVDI PQMMKALTKL SEDGNKMKDS KDVWKAKVDS MIEMLKIELY TYMPWCDMID
ELTEAFEKSE ADIKEAFHGI IRKLNSDYSL KAMPVVYRET IKQIEKLRKK LKDGQQKNIE
GLDRLKEALE GATESADKLV KRYVDLINRI CRIADETEFV HLYDKKKQLF SIGYNIEENS
LTNSYYDLLA SEARQTSYIA IARGEVDQQH WFKLGRTLTQ IDRYKGMVSW SGTMFEYFMP
LLIMKSHKNT LLDETYSFVV RSQKKYGKQR NLPWGISESG FYSFDINLDY QYKAFGVPWL
GLKRGLVEDM VVSPYATMLV LPLVPRDAMD NLKRLIAEGA YGHYGMYEAI DYTPERIPLG
EKKGIVKSYM AHHQGMSILA LNNYFNDNIM QKRFHADPVV DAAKLLLMEK VPSNIVFTKE
NKEKILPFKD VVYDEKDFLR ECGMPDPVLP KAHILSNGNY SVMVTDRGTG YSRWKNLDVT
RWREDVTLDN YGMFFYIRDV QNDEVWTSTF APGRKKPDEY KVEFTSGKAK YYRKDGDIDT
LTEIVVCAGE NAEIRSITLA NHGQESCVME ITSYFEPVLS HHGADIAHPA FGNLFIRTEF
LAEHNCLIAG RRPRSEKEKP VWIMNTVVLE GEGVGSLQYE TDRMQFIGRG RNVSEPVALE
PHRPLTNSVG AVLDPVMSFR QIVRVEPGKS VKISFVTAVA NSREDVVEMA TKFKSPQVIK
DELGMAVTKS RVEARYLNLD TEEIELYQDM ISHILFISPL QRQKQKWVMN NKKGQPGLWP
YGISGDIPIV LVMLDKTDDI DIVREVLKAH EYWRLKKLAV DLVILNEEEN SYTNPVNSLL
MDIIAESHAH DLINKPGGVF ILKKSNMPPE DIDLICSVSR IILKGDAGDL KDQVKYARSI
ALAEFKQFEK KPASYDSKLA KDLELNFYNG LGGFGKDGKE YVIFLENGQN TPLPWINVIS
NQRFGFIVTE SGSGYTWFEN SRENKLTPWS NDPVSDTPGE ILYVMDEHAG DVWSVTPLPV
REKEPYMIRH GFGYTVFSHA SHGIEQEMVQ FVPVDDSVKI SILKLKNQSQ ENRGLSLTYY
IRPVLGVSDQ FTAMHINTKA DNGMIVIKNN YNDEFPGRVA FIDSSLKVNS LTCDRKEFFG
AGDIANPEGI KRTSLSGTTG AGFDPCAAIS VSVNLKPDEE KEIIFLLGAG RDEEEARQLS
AKYKKLEEAK KALGEVKKFW ELKLGALQFE TPNTAMDILL NGWLLYQVVS CRLWTRSGFY
QSGGAYGFRD QLQDSISLTH IWPEATRNQI LLHSRHQFIE GDVQHWWHEE KYKGTRTKFS
DDLLWMPYAT IEYIRITGDY DILYEETPFL EDEPLKEFED EAYRVPRISH TVSTLYDHCI
RAINRSLKFG EHGIPLIGSG DWNDGMNTVG NKGKGESVWL GWFLYSILKN FAPLCERMGD
NELAKRYLDT ADRIVENIEK NAWDGKWYRR AYFDNGVPLG SIQNSECQID SLAQSWAVIS
EGGDKERIAE AMSALENYLV KRDEGLIKLL TPPFDEGDLE PGYIKSYVPG VRENGGQYTH
AAAWVVMAFA KMGDGEKAME LFDLLNPINH SRTHIEYSRY KVEPYVMAAD VYSVPPHTGR
GGWTWYTGSA GWIYRVGFEY ILGFKKRGET LEIDPCIPGK WTDFTIKYRY YDTDYIIEVK
NPEGVNTGVK KVIVDGKVCD DGKVQLVNDK DTHKVEVYMG KK