Gene Cthe_3052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_3052 
Symbol 
ID4811124 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3581242 
End bp3590070 
Gene Length8829 bp 
Protein Length2942 aa 
Translation table11 
GC content38% 
IMG OID640108473 
ProductYD repeat-containing protein 
Protein accessionYP_001039441 
Protein GI125975531 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3209] Rhs family protein 
TIGRFAM ID[TIGR01643] YD repeat (two copies) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAATGCAA AACTATACGC CTTTGGTGGA TGCTATCCTG AACCGCAAAA TCGGGATAAT 
ACCATATATC TTGACACGGT ATCGGAGTAT GACCCTGTAA AAAACCTCTG GACAGAGTAT
GCACCGGGTT CATCACCCAA CCCGAATAAA AAGATGCGTG TACCGAGATC AAATATGGCT
GTGGCTACAA CAGACAACAG AATCTATATA ATAGGTGGGT TTGACGGCTT TAACTACCTT
AATACCGTCG AGGTATACAA TCCATCGATA GGTGAATTTG ACAATTCCGT AGCTTTCCCT
GCAATTAGCG AGGCAAAAAG CGGTGCAGGA GCTGTGGTAA TTGGAAACAA ACTCTATGTC
ATCGGCGGTT ATAACGGCGC AAGATACAGT GATACGGTTG AAGTGTGTGA CTTATCTGCG
GACAAGCCTC AGTGGACTGT TAAACCAAAA ACTTCAAACT GGATGACTCC AAGGGCGGAA
TTCGGTATAG CAACTTATGG AGGAAAAATA TATGTATTTG GCGGACAAGG TGAAAGCGGA
TATCTATCAT CTATTCAAGA ATACGACCCG GCAACAAATA CATGGAGAAC TTTAAACACA
AAACTGACTG AAGCAAGGGC GGAACTTAAA GCATTGACAA TGAGTGGAAA AATATACATC
TTGGGAGGTA CTAACGGTAG AGCTTCAGAT ACCGTTGAAG AATTTGATCC CTATGAAAAA
ACCATAAAGA AATTACCCCG CCTTAGCAGA GCAAAGAGTT CCTTTGGGGC AGTTGTGGCG
TACAATAAAA TCTATATAGT TGGGGGAACA GATGGGTATA AAGTCCTGAG CGAAGTTCAC
GAATATTTCA CTCAGGTAAT ACCTGGTTTG ACATATCTTG ATGGCTTAAA TGGACTTGAG
GGCAATACCG GCATCTTTGA CCTCAATGGT GTTAACAACG TTTCCGGAAG TTATTCTACT
CAGGTAGAAG ACTTCGTTAT CGACAGCCCT GCTATAGATG TTACGGTTAC GAGGACATAC
AACTCAAACA ACATTAAAAT AGTTAATGGA ACCGTTCAGG AGGATGGTTG GAGCTTTAAT
TTCGAATCTT CAATCAGCGA AAAGAAAGAC GGTATTTATA AGAGAGTAAC GGCGTCAGCC
CTTAACTTAA GACAAGAGCC GCCTAATATA GAACAGCTTT CTTATTTGGA TCCAATGCAA
TGGCGTATAA TAAAGAGCCT TGCCTATGGA AGTATTGTGG AGTTTCAAGG TTATATTATA
GGAAACAAAT GGATAAAAGT AAGAACTATT GACGGTCTGC ATACCGGTTG GGTATGTGCC
GACTATGTTG AAGATATTGA CGGTATGGAG GTAACTTATC CGTCGGGAGC GAAAGTAGTG
TTTAGATCAA CCGGAAACGG CAAATATCTG CCGCCTCCGG GAATATATGA CGAACTGCGG
AGTTTGGGCA ACAATGTATT CAGGCTTACA ACCAAAGATG ACCAAATAAC TTATGAATAT
CATAATGGTA AACTTGTGAA AGTCATTGAC AGGTATGGAA ATACAATCAA ATATCATTAT
GAAGACGGAA AATTGAGAAA AATATACGAT TGCGACCCAA GTAATGAAAA TAACTCAATA
GGGAGAATAC TTACCATTAA CTACGAAGGC GATAAAGTCA AAAGCATACA GGATAGTACA
GGAAGGATAG TAACTTATAC ATACAATAAT AGCGGAATGC TGGAAACAGT AAAGGACCTT
AACGGCAATA TTACAAAATA TTTGTACTAC CCTGCTGATG ACGAGATTGA AGGACAGAGA
TACAGACTTA AAACGGTTTC GAAAATAAAT GATTCGAATC AAGAAGTTAA AATACTTACA
AATGTATATG ACGGCTATGG ACGTATGTAC AAACAATACG ATACTGAAGG CAGGCCCACA
TACTACCTTT ATACAGATTT GATTTGCGAT GAAAAGGGAG AACAGCCTAC GGATAAAAAT
GAGGTTGCAC GCACTGTTAT TGACAAAAGA GGCAAGATTT CAAAAGAGAT ATACAACATC
AACTTTGCAG GAAAGCCTCT AAAGACAATC GACGCAAAAG GTAGAGAGAC GACCTATAAA
TATGAAATTA AACATAGTAC CGGTATAATT GACATTACCA ACTATACATA TAATGATTTG
AAAAAAAGCC CGGCATATTA TGATATCCGA AATAAAAACC TGCCGGAGAT AGTTACGAAA
ACATTTAACG GAAGCACGAC TAAAGTGGAA AAAGACGAAA AAGGTAATAT TCTGATGATA
ACTTACCCGG ACAATTCCAC AGTAAAGTAC AGCTATTATG AAAACGGGGA CTTAATGTAT
GAAATTGACC AGATGAACCG CCAGACATTC TATATGTATG AAAACTACGA GTCATATCAG
GAAAACGGAG TTACGAAATA CAGAAGCAGG CTGACCAAAA TTGTAAAGCC TGCAGAGCCC
GGAGTGGTAT ATACAAGTAC AAATCCGCCG ACAGATAGTG TGCTCAAACC AAGTGATGCC
GTAACTCGTT ATAAATATGT CAGCGGACCA AATAAAATAA AGGGTTGTCT CGTAGAGAAG
ATAACCTATC CAAACGGTAC GTGGATTACA TATAAGTACT ATGACAATGG AAATTTGATG
GCAAAAAGCG ATAAAGCAGA CCTTAGTGCG TCTGGCAATA TAGATGCTGC TTATAAATAT
GAGTACGATA ATTGCTTTAG AGTAAGTTCG GAGACAACAC CGGTTGGATA TGTAACAGAA
TATCGTTACA ACAACACAAA CCAGGTAACC CATGTTATAA AGAAAGATTT GGACGGAAAG
ACAGTAAGTG TTCACAGGAC TTACTACGAT TATGACGGAA GAAAGAAAAA AGAAGTTAAT
CCTGCAATGT ACAACAGCAT GTATGATACA GGAAAAGACT ATATTGGAAC GGCATCTGCG
ATTTATGAGT ATGATGTATA CGGCAGAGTG ACCAAGGTTA CGAATAATAT CGACGGCATA
AGCTTTTCAA GCACTATTAC CGAATATGAA TATGATGCGG AAGGAAACCT CATTGGCAAG
AAGAACTACA GCAAGGCTTC AGCCGGAGCA AATGAGGTAT TGGAATCCTA TTACATTTAT
GAGTATGATT CTTTGAACAG GCTGGAAACT ACCTATTTCA AAGAAGATGA CAGTCCGGAT
ACAATTGCAG TTAAGCTCGA GGAGTATATC TATGAGGATA TATATGGCGA CAGTGTTCGC
AAATCAAAGA AGATATACAA ACAGTATTTG AATGACGGCG AATTTGCAAA AACCGAATAC
ATTTACGACT ATGCCGGAAG AGAAGTAGAG GTAATACATC CTGATGAGAC TAAGGTTATA
AAAACAGTTA TTACATATTA TCCCAACGGA AATGTAAAGA CTTTAAAAGA TCCAAGAGGA
AGCATCAGTT ATTATACTTA TAGGAATTAC GACGCAAGCA AAAATCTATA TTTTGATGAA
AAATATGTTC CTGTGGAAAG GCTTAACAAC AATACATATG AAATATCTTA TTCAAGAATT
AATTACGATA AAGCAGGAAG AAAAGTCGAA GAGATAAACT ATGTCGATTT GATTGTGGCT
ACACTGAACA GTGACGGTAC ATATAGTTTG GCATCGAATG CTCTTGCCGG CAAGAAATAC
AACAGTATAT CATATTCATA TTACAATAAT AACAAGGTAA AACAGGAAAG TAGCTCAAAC
GGCAAGAAAA TAGAGTATTT CTATGACAAT GACGGCAATC TTATTGAAGA AACAACGACC
TTTGACAAGG ATGCTTATGG ACGCGACAGG AAAAAATCGG TTCTGTACCT TGATTATAAT
GCCTATGGCA AACCGGGAAG AACTGCAGTC CTGATAAATA ATGAAGATAT TTCCAAGGAT
TTAGTGCTAA ACGGTGATAA AGTTGTACCG TTTGAAAAAT TGTTCCCTGT AAATGAACAA
GGTGTAACAG TTTATGGACC TTATGAGTAC CATAGCAATT GCAGTGCAAT TGTTACAACT
TACAGCGATT TTGATGCTCT TGGAAACGCA AGGAGAGTAG TATATCCCAA CAACTTTGTT
GAGAATTTCT CCTATGACAG CCTTGGAAGA GTAGTAAAGA AGAGTGTAAC AATTAAAGAT
ATGACAAATC CGGATACTAC AGGGCGTTAT AAGACGGTCG AGACTATTAC AGCTTACAAC
TGGGAAGGTA AGGTTGCAAA AGTTGAAGTT TTGGCCAAAT ATGAGGGCAT AGTTGACAAA
TTGAGCAGTA CAACTTATGA ATATGACGGC AGAGGTTTCA TGGTAAAATC AACTACGGAC
GTTACGTTCA ATAAATATGA TGCGGCTAAT AAAACCGTAA AACAGGAAAA GGAAAGCGTA
ACTGTCGCAT ATGAGTATGA TACTGCGGGA AGATTGATAG CCGAGGTGTC AGCGGAAAAC
TATGTTGAAG GTAAAAAGCC GTCGCAGACA GGAAACTACG TAAAATATAC TTACTATGAT
TCAGGAAGGT TAAAGACAAA AAGCTTTGAA GGTATTACCA AGAGATACAA TCCAGCGACC
AAATCTTTTG AAGATGTTGC ATCGAGTATT GTAATTGAAG CTTATGCTTA TGATGAAAAC
GGTAATGTCA TCAAAAAAGT GGACGGTGAA GCATATAACA AAGCAGGAGG TTCCATTGAT
AACGCTTATG GTATAGAGTA TACGTATAAT CTTGCGAATC AGTTGAAGAC CGAAAAGGAT
CCTGAATTTA CAAGTACCAG GGCCGACTGG AATTACAACA AGAAATACGT ATATGACGGA
ATTGGAAGAG TGGTGTACGA GTACACAGCT TATGGTAAGA ACTTGATGGT TACTAAATAT
GACTCATCCA CACCTCAAAT AGCTCCTTCC AAAGTATCAG CATCACTTGG GCTTGGCTAT
GCATTGACCA GGTATGCTTA CGATGATGCC AACAGAAAGC TTGATGTGTA TGTTTTGGAG
AACATTGACG ATCCTTTGGG AGATGAAGTC AGAATTAAAA CTGAAACTTA TGATTATACA
GGAAATGTAA AAAGTGTTAC CGATGCCAAA GGAAATCGTA CGGTTTACGA ATACAACGTT
TTGGGCAATG TAAGGAGCGT AACTTATCCT TCGGATGAAT CGATTGGATC AGACAAAGTT
GAATATAAGT ATGATTCAAT GGGTAATGTC AAGAGTGAAA AGAACAGCAT TGATGTAATT
AAGGAGTATG AGTATGATGA ACAGGGAAGG ATGCTCAGCA GCAAAATCTA TGGAACCGGT
GGAATAAACA AAGAGACAAT AACACGCTAC GGCTATGATT TACAAGGTAA TTTGGCATAT
GAAATTGCTC CAAACGGTAA TGTTACCAGA TATCAGTATG ACGAACTGGG AAGACTTATA
AAGACTGTAG TGAACAATAA GAAATTGATC GATCCTGACA CGAAGGAGAT AAGTACTTTA
AGAACATCAC AGCATGAAGA ATATACATGC TATGACAAAA ACGGAAATGT TGTAAAAGAA
GTGGAAATAG TAAAGTCTAT AGGCAAAGAT ACAAAATCGT CTGTGAGGGT GTATGCCTAT
AAATACGACA ATATGGGAAG ACTTATTGAA AAAGTTGACC CATCCGGTGC GGCAATAGAG
AAAATCAATT ATAATTTAAA CAGTGCCCAG ATACAATCCT TTGATGGCGA AGACCATCTG
AAGGAATTCG AGTATGATAA AAATGGTAGG CTCGTAACCA CAAGGGATTA CCATGACGGT
AATGTTGTGC ATGAGTGGAA GCAGACATAT GATGCTGCAG GCAACATTTT TACAAAAGAT
GACGGGCGTG GAAATGAAAC AGTATATATT TACAATTGGC GCGGTCAGTT AACTGAAGTC
AGACTGTTGG AATTTGACTC TGTAGACAGC AGCGGTCAGA AGAGATATAA AGATGTAAAG
ACTTTGACAC GCTATACCTA TGATGCTAAC GGCAATATGG AATCCCAAAG CTTCGAGGGA
AAACCGGCTG TTGAATATGA GTACAATGCC AGAAATCTGG TAAAGAGAAA GATTTATCCG
GGTACTGTCA ATAATGTTGA AACTTACGGC TATTATGAAA ATGGAGCGCT AAAACAGAAA
GTTGACAGAA ACAACGTAAT CACAACATAT AAGTATTATC CGCAGGGCTG GCTGGCCGAA
GAGAACGCTG TTGGAGCAGA GGATTATACA AAGAGAACCT ATACATACGA CAATAACGGA
AACTTGCTCG AGGCAACTGT TGAGACGTCA AGAGGAACAG GAAACAGTGT AATCAGAACC
TATGATGATT TAAACAGGGT AATAACGAAA GCGGTAACCG GTGTTACAGG CAAAGCTGTA
TACGTATACG ACATTGTAAC CAGCACTGGA ATGATTGCCG AGACAACAAT TGACCAGAAG
TCAAATAAGA CTACGAAGGT TTACGATGCA GTTGGAAGAC TGGAGTATGT AAGGGACGGA
AATGAAACTG CACCATATAT TGCGTATTAT ACTTATTATA AAAATGGTGC AAGAAAGAGT
GTTGTATATA AAAATGGCGC AAAAGAAGAA TATGAATACT ATAATGACGG GCTTTTGAAA
AAGCTTGTAA ATACAACGTC CGGATACACA GAAACCTATG TATATACTTA TGACGCGAGC
CATAACATCA CATCAAAGGT TGATGGTAAG GGCAAAACAG TATATACTTA TGATGCGCAG
AACAGGTTGA AGTCGGTAGA CGAGAAATAT ATCAACAGGG TAACTGTGTA TTCATATGAT
GCTGCGGGTA ACAGAGAAAA AGAAGAAATA CAGTTAAATG GTTCAATAGT TCAGACAAAT
ACCTATTATT ACAACGACTT GAACTGGCTT GAGCGTATTG TCATGGTTGG CCAGACCAAT
AAAACTGTTT ATTATGGTTA CGACAACAAC GGCAACCAGA CAAGCTGCTC GGATGCCGGA
ACAGTAAATG AATATGATGA GTTTAATCAA TTGATAAAGA CAACCATCAC CGGTGGTTCT
ACTGTTGAGA ATATATATAA TGCAGAAGGC TACAGGATAG GAAAGAAAGT TAACGGAACC
TTGACAACTT ATGTATATGA GTACGATAAG GTAGTACTTG AACTGGACGG TGGTGGAAAT
GCCAACCGAA ACGTATATGG ATTGAACCTG CTGATGCGTA CTGTAGGCAA AGATAGCTAC
TACTATATGT ACAATGGACA TGCTGACGTA ACAGCGCTGA TTAATGCTGC TACAGGAAAA
GTGGATGCTA CCTACTATTA CGATGCATTC GGTAATATAC TTGAGTCGAC GGGAAATGTG
AATAATAACA TAACTTATGC CGGGTACCAG TACGATAAAG AGACAGGATT GTACTATCTT
AATGCGAGAA TGTATGATCC TAAAATAGCA AGATTTTTGC AGGAGGATAC ATATACGGGA
ACGCCGGATG ACCCGCTTAG CTTGAATCTG TATGTGTACT GTGCAAATAA TCCTTTGATA
TATTATGACC CGACAGGAAA TTTGGAATCA AGAATATCTA TGCATTTGTA TAAACCTAAA
GGAGCAGAAG AGATTTTAGA TGACTTTATA AACAGAGCTA CGTCCGAAGG AAAAGTTGCG
CTTTATAAGA TTCAACAGGC TAGTTTTTTG ATAACACATG CGATTTATTA TTCACATTCT
CTTGAACAGA AAGCAAGATA CATACAGGAT GCTTTAATTA TTGCTGAACA GGCAAGATAT
GAGTATTCAA AGCATGACTT AATCTATGGT CGACTGAAGT ATGGTAAGGA ATATAGGAAA
GATGCGAAAC CTGTAGATGT TAAAGATGAT CATTATAGAT TTAGGGCTAT CTTTTATCTG
CTGGCAGTAG ATTTAGGCTA TGATTTATTT GACGTAGACG CGTATGATAA GGTAAATAAT
GAATTACCGC AGTTACTTAG AGATATTGGA TATAATTTTG ACTGGATGGA ATTTGAGATA
GCACATTTGA ACTTTAGGGA AGATTTCTGG CTGAATACAG CTATAAAGAA TGGCAATGGT
GAAATGGCCC AGTATGCACT TCTTTCGTTG GAGACAATAA ATGGTTTTCA CCATACATAT
AAAGCAAGTT TTGCCAGTTA CTTAAGTACG AATTTGTACT TATCGAATAT AAATAAATAT
TATTCTGTGC CATCGAGACC TATATATATG AAAGATGGTA CGGGTACTAA GTGGGAACAA
ATAGAAGCTT ATGGTTATAG CAAGCCAAGT AATACTGTTT CTTTAATGAG TAATACAGGG
AATGCTATTC CTAGTAAGGG AAAGAGTAAT TCTCTGCAAC AATTATTTAA GAAAAATGAC
ATTCCAAGTA CCTTGACTGA TGATGAGATA GCTGTGGACT TAAGGATACA GAAGGGATTT
ATTGGTGGAG GTGATAATGA GGGAACGCCT AAGACTACAG GTGCATATAA TCCTAACCTT
AGCATGGATA TTGGTAATGG TTTAGGTAAA CTAAATGGAA AATCAATAAA TGTAAGTGAA
AAAGGGTTAA ATTTAGTAAA AAAACATATA TCCCAGTTTG GAGATATTCC CGAAAACCAA
GCGATGATAA ACAGAATTGA GAGTGCATTA AAAAATGGGC AGCCTATTAC TGGCGCTGAT
GCGAGTTTTT ATATGCATGA AGTGGCAGAA GCTACAATGA TGCAGAAGGG TATACCTTAT
GAAGTAGCAC ATGAGGCAGT ATTACAGAAA TATAATGTTT CACCATATAG TGTACATCAC
CCTGAAGTAA TCCAACAGTT TTCAGAGTGG TTTAACCAAG GATTTAAAGA TTTCTGGGGG
TTAAAATAA
 
Protein sequence
MNAKLYAFGG CYPEPQNRDN TIYLDTVSEY DPVKNLWTEY APGSSPNPNK KMRVPRSNMA 
VATTDNRIYI IGGFDGFNYL NTVEVYNPSI GEFDNSVAFP AISEAKSGAG AVVIGNKLYV
IGGYNGARYS DTVEVCDLSA DKPQWTVKPK TSNWMTPRAE FGIATYGGKI YVFGGQGESG
YLSSIQEYDP ATNTWRTLNT KLTEARAELK ALTMSGKIYI LGGTNGRASD TVEEFDPYEK
TIKKLPRLSR AKSSFGAVVA YNKIYIVGGT DGYKVLSEVH EYFTQVIPGL TYLDGLNGLE
GNTGIFDLNG VNNVSGSYST QVEDFVIDSP AIDVTVTRTY NSNNIKIVNG TVQEDGWSFN
FESSISEKKD GIYKRVTASA LNLRQEPPNI EQLSYLDPMQ WRIIKSLAYG SIVEFQGYII
GNKWIKVRTI DGLHTGWVCA DYVEDIDGME VTYPSGAKVV FRSTGNGKYL PPPGIYDELR
SLGNNVFRLT TKDDQITYEY HNGKLVKVID RYGNTIKYHY EDGKLRKIYD CDPSNENNSI
GRILTINYEG DKVKSIQDST GRIVTYTYNN SGMLETVKDL NGNITKYLYY PADDEIEGQR
YRLKTVSKIN DSNQEVKILT NVYDGYGRMY KQYDTEGRPT YYLYTDLICD EKGEQPTDKN
EVARTVIDKR GKISKEIYNI NFAGKPLKTI DAKGRETTYK YEIKHSTGII DITNYTYNDL
KKSPAYYDIR NKNLPEIVTK TFNGSTTKVE KDEKGNILMI TYPDNSTVKY SYYENGDLMY
EIDQMNRQTF YMYENYESYQ ENGVTKYRSR LTKIVKPAEP GVVYTSTNPP TDSVLKPSDA
VTRYKYVSGP NKIKGCLVEK ITYPNGTWIT YKYYDNGNLM AKSDKADLSA SGNIDAAYKY
EYDNCFRVSS ETTPVGYVTE YRYNNTNQVT HVIKKDLDGK TVSVHRTYYD YDGRKKKEVN
PAMYNSMYDT GKDYIGTASA IYEYDVYGRV TKVTNNIDGI SFSSTITEYE YDAEGNLIGK
KNYSKASAGA NEVLESYYIY EYDSLNRLET TYFKEDDSPD TIAVKLEEYI YEDIYGDSVR
KSKKIYKQYL NDGEFAKTEY IYDYAGREVE VIHPDETKVI KTVITYYPNG NVKTLKDPRG
SISYYTYRNY DASKNLYFDE KYVPVERLNN NTYEISYSRI NYDKAGRKVE EINYVDLIVA
TLNSDGTYSL ASNALAGKKY NSISYSYYNN NKVKQESSSN GKKIEYFYDN DGNLIEETTT
FDKDAYGRDR KKSVLYLDYN AYGKPGRTAV LINNEDISKD LVLNGDKVVP FEKLFPVNEQ
GVTVYGPYEY HSNCSAIVTT YSDFDALGNA RRVVYPNNFV ENFSYDSLGR VVKKSVTIKD
MTNPDTTGRY KTVETITAYN WEGKVAKVEV LAKYEGIVDK LSSTTYEYDG RGFMVKSTTD
VTFNKYDAAN KTVKQEKESV TVAYEYDTAG RLIAEVSAEN YVEGKKPSQT GNYVKYTYYD
SGRLKTKSFE GITKRYNPAT KSFEDVASSI VIEAYAYDEN GNVIKKVDGE AYNKAGGSID
NAYGIEYTYN LANQLKTEKD PEFTSTRADW NYNKKYVYDG IGRVVYEYTA YGKNLMVTKY
DSSTPQIAPS KVSASLGLGY ALTRYAYDDA NRKLDVYVLE NIDDPLGDEV RIKTETYDYT
GNVKSVTDAK GNRTVYEYNV LGNVRSVTYP SDESIGSDKV EYKYDSMGNV KSEKNSIDVI
KEYEYDEQGR MLSSKIYGTG GINKETITRY GYDLQGNLAY EIAPNGNVTR YQYDELGRLI
KTVVNNKKLI DPDTKEISTL RTSQHEEYTC YDKNGNVVKE VEIVKSIGKD TKSSVRVYAY
KYDNMGRLIE KVDPSGAAIE KINYNLNSAQ IQSFDGEDHL KEFEYDKNGR LVTTRDYHDG
NVVHEWKQTY DAAGNIFTKD DGRGNETVYI YNWRGQLTEV RLLEFDSVDS SGQKRYKDVK
TLTRYTYDAN GNMESQSFEG KPAVEYEYNA RNLVKRKIYP GTVNNVETYG YYENGALKQK
VDRNNVITTY KYYPQGWLAE ENAVGAEDYT KRTYTYDNNG NLLEATVETS RGTGNSVIRT
YDDLNRVITK AVTGVTGKAV YVYDIVTSTG MIAETTIDQK SNKTTKVYDA VGRLEYVRDG
NETAPYIAYY TYYKNGARKS VVYKNGAKEE YEYYNDGLLK KLVNTTSGYT ETYVYTYDAS
HNITSKVDGK GKTVYTYDAQ NRLKSVDEKY INRVTVYSYD AAGNREKEEI QLNGSIVQTN
TYYYNDLNWL ERIVMVGQTN KTVYYGYDNN GNQTSCSDAG TVNEYDEFNQ LIKTTITGGS
TVENIYNAEG YRIGKKVNGT LTTYVYEYDK VVLELDGGGN ANRNVYGLNL LMRTVGKDSY
YYMYNGHADV TALINAATGK VDATYYYDAF GNILESTGNV NNNITYAGYQ YDKETGLYYL
NARMYDPKIA RFLQEDTYTG TPDDPLSLNL YVYCANNPLI YYDPTGNLES RISMHLYKPK
GAEEILDDFI NRATSEGKVA LYKIQQASFL ITHAIYYSHS LEQKARYIQD ALIIAEQARY
EYSKHDLIYG RLKYGKEYRK DAKPVDVKDD HYRFRAIFYL LAVDLGYDLF DVDAYDKVNN
ELPQLLRDIG YNFDWMEFEI AHLNFREDFW LNTAIKNGNG EMAQYALLSL ETINGFHHTY
KASFASYLST NLYLSNINKY YSVPSRPIYM KDGTGTKWEQ IEAYGYSKPS NTVSLMSNTG
NAIPSKGKSN SLQQLFKKND IPSTLTDDEI AVDLRIQKGF IGGGDNEGTP KTTGAYNPNL
SMDIGNGLGK LNGKSINVSE KGLNLVKKHI SQFGDIPENQ AMINRIESAL KNGQPITGAD
ASFYMHEVAE ATMMQKGIPY EVAHEAVLQK YNVSPYSVHH PEVIQQFSEW FNQGFKDFWG
LK