Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_1806 |
Symbol | |
ID | 4809790 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 2132940 |
End bp | 2139473 |
Gene Length | 6534 bp |
Protein Length | 2177 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 640107220 |
Product | cellulosome enzyme, dockerin type I |
Protein accession | YP_001038220 |
Protein GI | 125974310 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAGA AAAGTTTTAT TTCTGCTGCC CTAATTGTGT TTTTGTTCTT GTCTTTTTGT CTTGAGACAC CTAATATAGG TGCGGTAAGT GCACAAAATG CTTTTGAAGA TCCATTTGTA CATCTCATAA ACTCTCACCT CGAAAACTGT GAAACTAGCC ATGAAGAAAT ATGTGATTCC ACTGGTAACG AAGGCTGTGA AGTCAGTCAT GAAGAAATAT GCGATTGCAC CGGTAACGAA AGCTGCGAAG CCGGTCATGA AGAAATAAGT GATTCTGACG AACACAAAGC TTGCAAAGCC GGACATGAAG CATGCAGTTG TAAATGCACT TCCGACAGTG ATAAAGATAA TACCATATCC AACTTGGATA TGAATGATAT TGAACCTGTA GGAAATGCAT TATTTGTTGA AAAAAATGAA GAAAATTTGA GCAATATTGT AACATACGCT TCTTTAAGCA GTGTAGTTTT GGCAGCATCA TGCTCCCATC AATTTAACGG TTCTTATACA GTTACTAAAG AACCAACATG TACAACGACC GGTACAAAAG TAGGAAAATG TACAAAATGC GGAGCAATTG TTTCTACAGT TACGATACCT GCATTGGGAC ATAGTTATGG ATCATGGACG GTAACTAAAG CAGCAACCTG TACTACAGAC GGAAGCCAAA AAAGAACCTG CTCAAGATGT AAAAATGTTG AAACCCAAAC GATAAAGGCA ACGGGACATA CATTCAATGG TTCTTATACG GTTACCAAAG AAGCAACATG TACAACGACC GGTACAAAGG TAGGAAAATG TACAAAATGC GGGGCGACAG TATCTACAGT TACGATACCT GCATTGGGAC ATAGTTATGG ATCGTGGACG GTAACTAAAG CAGCAACCTG TACTACAGAC GGAAGCCAAA AAAGAACCTG CTCAAGATGT AAAAATGTTG AAACCCAAAC GATAAAGGCA ACAGGACATA CATTCAATGG TTCTTATACG GTTACCAAAG AAGCAACATG TACAACAGCC GGTACAAAGG TAGGAAAATG TACAAAATGC GGGACGACAG TATCCACAGT AGAGATACCT GCACTGGGAC ATAGTTATGG GTCGTGGACG GTAACTAAAG CGGCGACCTG TACTACTGCC GGAACAGAAA AGAGGACATG TACCAGAAGC GGATGTACTG CAAGTGAGAC GCGTTCGATA TCTGCAACAG GACATACATT CAATGGTTCT TATACGGTTA CCAAAGAAGC AACATGTACA ACAGCCGGTA CAAAGGTAGG AAAATGTACA AAATGCGGGA CGACAGTATC CACAGTAGAG ATACCTGCAT TGGGACATAG TTATGGGTCG TGGACGGTAA CTAAAGCGGC GACTTGTACT ACTGCCGGAA CAGAAAAGAG GACATGTACC AGAAGCGGAT GTACTGCAAG TGAGACGCGT TCGATATCTG CAACAGGGCA TACATTCAAC GGGTCTTATA CGACAATAAA GGAGCCGACA TGTACAACGA CTGGTACAAA GGTAGGAAAA TGTACAAAAT GTGGGGAAGT TGTTTCGTCA GTAGAGATAA AAGAGTTAGG GCATGACTTC GGTTCATGGA AAACAATAAA GCAAGCAACC TGTACGGAAA AAGGGCTCAG AGAAGGAACA TGTTCAAGAT GTTCTGTGCG TAAAACGGAA GAAATATCAC CAACAGGGCA CCAATTTAAC GGTTCATTTA CAACGGTAAA GGAACCGACA TGTACAGAAG AAGGATTAAA GGTAGGAAAA TGTACAAAAT GCGGAGAGGT GGTAGCAACA GCACCTATTC CGGCCCTTGG CGGTGATCAT CAGTTTAACG GTTCATTTAC CACAGTAAAA GAACCGACAT GTACAGAGAA GGGACTGAAG GAAGGTCGTT GCACCAGGTG CGGAGCGACA GTTACGACGG CACCGATACC TGAATTAGGG CATGACTTCG GTTCGTGGAA GACAATAAAG GAAGCAACCT GTACGGAGAA AGGGCTCAGG GAAGGAACAT GTTCAAGATG TTCTGTGCGT AAAACGGAAG AAATATCACC AACAGGACAC CAATTTAACG GTTCATTTAC AACGGTAAAG AAACCGACAT GTACAGAAGA AGGAGTAAGA GAAGGAAGAT GTACAAAATG CGGAGAGGTA GTAGCAACAG CACCTATTCC GGCCCTTGGC GGTGATCATC AGTTTAACGG TTCATTTACC ACAGTAAAAG AACCGACATG TACAGAGAAA GGACTGAAGG AAGGTCGTTG CACCAGGTGC GGAGCAACAG TTACGACGAC ACCGATACCT GAATTGGGGC ATGACTTCGG TTCATGGAAG ACAATAAAGG AAGCAACTTG TACGGAGAAA GGGCTCAGGG AAGGAACATG TTCAAGATGT TCTGTGCGTA AAACGGAAGA AATATCACCA ACAGGGCACC AATTTAACGG TTCATTTACA ACGGTAAAGG AACCGACATG TACAGAAGAA GGATTAAAGG TAGGAAAATG TACAAAATGC GGAGAAGAGG TAGCAACAGC ACCTATTCCA GCCCTTGGCG GCGCACACCA ATTTAACGGT TCATTTACCA CAGTAAAAAA ACCGACATGT ACAGATCCGG GACTGGCGGA AGGTAGATGC AGCAGATGTA AAACGGTGGT AGCAACTAAA GAAATTCCTC CCCTTGGAGG CTCGCATCAG TTTAACGGTT CATATAAAAT AATAAAAGAA GCAACATGCA CGGAGGAGGG ATTGAAAGAA GGACGTTGTA CCAAGTGTGG CACTGTTATT TCAACATCTG TTATACCGCC GCAACATAAA TTTTCCAAAA TAACTATAAC TCCCGATAGA ATAACCTTGG GCGGAAGCAA TGCTACTGAA AGTCCAATAT ATGTAAAGGT AGTATGTTCA AAATGTAACA CAACTGTTGA TGTAACAAGT AAGGCAAAAT TTTCAAGCAG CAACAGCAAT GTGGCATCGG TGGTGAATGG ATATGTTAAA AGCGGTACAC AATTTGGAAC TGCAACAATT ACCGCCGATT ATGATGGAAT GAAAGCGGTA TGCAGTGTGC AAGTTAAACC GGCCGGCGGA GAAAAACTCC GGGCGTTGTG TATAACTCCA AAAGAAGATA CAATAGCTGA GTTCAACAAA TGGGGTTCAC AAGTGAAAGT CATGGCAGTA TACGATGATT ACGAGGTTGA TATTACAGAT TATGTCCTGT TTACGTCCGG AGACAGAAAT ATTGCTTATG TAGACGAGTA TTACGGTAAT AAATATATAA AAAGCGGAAC AAAGAAAGGA ACAACTTTGA TAACGGCATC CTATGAAGGG AAAAAAGATA CATGTACTGT AAAGGTGGAC ATGGCATATG AAGTGGAAGA AATGCCTTTC AAGCTCGGAA AAGAAACAAA TATCCTGGTT CCCGAAGACT TACCGGTTAT AGGCGGTACT GAGGTGGAGT TCAGCTTTGA TCATATTCCG GGAATGGTGA AATACGGAGA AAAAGATTTT AGGATTGCAA TTGGGATAGA AGATAAGGAA AGCCTTGATA AAAAATGGGA TAATTTTGTA AAGTATTTTG AAGATGCCAA GAATAGTAAG GCTTCTGCGA AAGAATTAAG AAACAGAATG AAAAAGCTGG GCTCAAAGAA AGGTAGTTTT AGCATTAAAG ATGATTGGGA ACCGGAAGTG GAAGCCTATG GCTACATTGA AGGTGTGTTT ATTAACGGTA TACCTGTAGC AACCAGGGGT TCGTTCGCTG TAATAGTAGA GGCAGAATAC AGAGGACAGA AACAATACTT TATAGGACCT GTTCCTGTTT ACTTTGAAAT AGCCGGCGGT CTCGAGATGG AGCTTATTAG TGACATCCTC AGGGTTGATT TTGAAACCGG CAGAATTATG CTTAATTCAG AGTTAAAAGT GACACCGCGT TTTGAACTCG GCGCCGGAGT CGGTTTAGTA AAAGTATTGA CGGTTGGCGG TTCGGGAGAA GCAGAACTTG AATTTCTTAT TATTACCGGT TCGGAAGATT ATTTAAAGGT TACATTGACG GGAAGCTTGA AACTAAAGGT TTCGTCATAT TTCTTCAGTG CTGAAAAAGA AATTGCAAAA GGTACCTGGG TTCTTTATGA ATCAAAACCT CGCTTAAGGA TGGCAAGTCC CAACATAAAT GCTCAATTCG ATTTGTACAA TGCAGATGAA TATAAAATGA TGCCGAGGGA TTATATTGAA AGGCCGTCAG AATGGTTGGG AAATCGGCGG TTAATGCGGT CAATGGCAAC AGGATTTACC AATAAGGAAC TTAAAGTTTT AGGAACCAAT ATATATCCTG ATGCTCAGCC GCAACTGGTG AATTTGGAGG ATAAACAGGT TTTGGTGTGG ATTGCCGATA ATCCCGACAG AACCTCTGCC AACAGAACTA TGCTGGTTTA TTCTGTGTAT GATAAGAACA GCGGCATATG GAGTGAACCT GTGGCAGTAG ATGATGACGG TACGGCAGAT TTCTATCCTC AACTAGCAGT GGACGGAAAC GACCTTTATG TTGTATGGCA AAACAGCAAC AAAACCTTTG CAGAAGATGT AACATTGGAA GAAGTTGTGG CTTCGAGTGA GATAGCCGTC AGCAAATTTG ACGAAGTAAC CGGCACATTT GGAGCAGCCG TTCGGTTGAC AGAAAATGAT GTGGTTGATA TGCTGCCGCA AATAATCGTA TCAGATGGTA ATGCTTACAT AGTATGGTTT ACAAACAATA AAAACGATGT ATTCGGTGTG GACGGTGAAA ACTCAATCTA CTACTGTGAA CTTAAGGATA ATGAATGGTC AAGTCCCGAA CTTCTTAGTG AAGGCTTGAA TGCAATTGTA TCCATAAGTG CCGGGTTTAT TGAAGGCTCA TTTGCAGTTG CATATGCTTT GGACGGGGAC GATATGCTTG AAACAATAGA TGACATGGAA ATATACATTG TAAAGCCCGG TGATAAAGAT ATAAGGATTA CCGATAATGA TACCATGGAT TCTGCGCCTG TATTTTCAAG TTTCAATGGA GAGGGAGCTT TATATTGGTA CAATGAAGGT AACATATTAT ATATAACGCA AATAGGCGCT GAGCCGAACC GAGTCTTTAG CGAATCGAAA CCCGGGCTGA AGGACAACTT TAAGGTTGTT GAAGGAAGCA ATGGTGAAAC GGCCATAATC TGGACTAATA CAGCAAAGGG CTCAAGTACA ATATTTACAG CAATTTATGA TGAAGACCGG GCAGCATGGA GTGATGTTGT AAAGCTATCG GATGTTACAG GCCAAGTTCA ATCTCCGGAT GGTGTTTTTG ACGATGAAGG CAACTTTAGT ATTGCATTTA GCAGATTATA CCTGCTGGAG GATGGAAATG AACAGGCTGA TTTATGCATT ATCAAGGTTG TTCCGTCCTA CAATCTGTCC ATTGACAGCG TGAATTTCGA CCATAGTAAA GTTATACCTG GCACACAGCT CGCAATTGAT GTTGAAGTAA CCAATAATGG CGAAATTGGT GTTGAAGAAT TGGTTGTTGA TATTTTAGAC GGAGACGAAA TAATTAATTC TGAAGCAGTT CAGATAAGCT TGAAACCAGG AGAGAGCAAA ACAGCGACTG TATTGATGAA TTTACCGGAC ACCATAGCAA AAAAAGCATA TAGTATCAGG GTTTCTACCG TAGAGGGCGA AGAATACAAC ACAGATGACA ATGTTAAACA GTTTACAATT GGCTATACCG ATATTTCTTT GCAGCTTGAG ATATACAGTG AGGGAGACAT TGAATATGTC ACTGCCAATA TTATAAACTT AAGCCATGTG CCTACAGGAG CAACTTTAAA GGTAACAAAG GGCAGTGAAG ACGGTGAAGT GATAGACACA AAGGTTATTG ACAGCACAGA CGATATAGTG AAGTATGAAT ATCAGTTTGA CAAAAAGATA CTTTGTGCGG ACAAAGAAAC GGAAATACTG TATTTTACAG TTGTAGCAGA TGAGGAAGAG ATTTATACCA GTGACAACAC CAGAACCTTA GTGTTAAGCG TTAACAACGA TAGTACTGAT AAAACCACTG TATCAGGTTA TATTTCGGTT GATTTTGATT ATCCGCCGGA ATCGGAATCA AAAATAAAAT CAGGATTCAA TGTAAAAGTT GCAGGAACGG AATTGTCAAC GAAGACAGAC GAGAAAGGTT ATTTTGAAAT ATCCGGCATA CCCGGCGATA TGAGGGAATT TACATTGGAA ATAAGCAAGC GAAATTATCT TAAAAGGAAT GTCACGGTGA ACGGAACCGG AAAATTAGTG GTTTCAACTG AAGACAATCC GCTCATATTA TGGGCCGGGG ATGTAGAGCG TAAAGGAGTG CAAGACAATG CTATTAATAT GGTGGATGTG ATGGAAATAT CCAAAGTTTT TGGCACAAGA GCCGGAGATG AAGAATATGT AGCTGAGTTG GACTTAAATA TGGACGGAGC AATCAATTTA TTTGATATAG CTATAGTTAT CAGGCATTTT AACGCATTAC CTTCCCGCTA TTAA
|
Protein sequence | MKKKSFISAA LIVFLFLSFC LETPNIGAVS AQNAFEDPFV HLINSHLENC ETSHEEICDS TGNEGCEVSH EEICDCTGNE SCEAGHEEIS DSDEHKACKA GHEACSCKCT SDSDKDNTIS NLDMNDIEPV GNALFVEKNE ENLSNIVTYA SLSSVVLAAS CSHQFNGSYT VTKEPTCTTT GTKVGKCTKC GAIVSTVTIP ALGHSYGSWT VTKAATCTTD GSQKRTCSRC KNVETQTIKA TGHTFNGSYT VTKEATCTTT GTKVGKCTKC GATVSTVTIP ALGHSYGSWT VTKAATCTTD GSQKRTCSRC KNVETQTIKA TGHTFNGSYT VTKEATCTTA GTKVGKCTKC GTTVSTVEIP ALGHSYGSWT VTKAATCTTA GTEKRTCTRS GCTASETRSI SATGHTFNGS YTVTKEATCT TAGTKVGKCT KCGTTVSTVE IPALGHSYGS WTVTKAATCT TAGTEKRTCT RSGCTASETR SISATGHTFN GSYTTIKEPT CTTTGTKVGK CTKCGEVVSS VEIKELGHDF GSWKTIKQAT CTEKGLREGT CSRCSVRKTE EISPTGHQFN GSFTTVKEPT CTEEGLKVGK CTKCGEVVAT APIPALGGDH QFNGSFTTVK EPTCTEKGLK EGRCTRCGAT VTTAPIPELG HDFGSWKTIK EATCTEKGLR EGTCSRCSVR KTEEISPTGH QFNGSFTTVK KPTCTEEGVR EGRCTKCGEV VATAPIPALG GDHQFNGSFT TVKEPTCTEK GLKEGRCTRC GATVTTTPIP ELGHDFGSWK TIKEATCTEK GLREGTCSRC SVRKTEEISP TGHQFNGSFT TVKEPTCTEE GLKVGKCTKC GEEVATAPIP ALGGAHQFNG SFTTVKKPTC TDPGLAEGRC SRCKTVVATK EIPPLGGSHQ FNGSYKIIKE ATCTEEGLKE GRCTKCGTVI STSVIPPQHK FSKITITPDR ITLGGSNATE SPIYVKVVCS KCNTTVDVTS KAKFSSSNSN VASVVNGYVK SGTQFGTATI TADYDGMKAV CSVQVKPAGG EKLRALCITP KEDTIAEFNK WGSQVKVMAV YDDYEVDITD YVLFTSGDRN IAYVDEYYGN KYIKSGTKKG TTLITASYEG KKDTCTVKVD MAYEVEEMPF KLGKETNILV PEDLPVIGGT EVEFSFDHIP GMVKYGEKDF RIAIGIEDKE SLDKKWDNFV KYFEDAKNSK ASAKELRNRM KKLGSKKGSF SIKDDWEPEV EAYGYIEGVF INGIPVATRG SFAVIVEAEY RGQKQYFIGP VPVYFEIAGG LEMELISDIL RVDFETGRIM LNSELKVTPR FELGAGVGLV KVLTVGGSGE AELEFLIITG SEDYLKVTLT GSLKLKVSSY FFSAEKEIAK GTWVLYESKP RLRMASPNIN AQFDLYNADE YKMMPRDYIE RPSEWLGNRR LMRSMATGFT NKELKVLGTN IYPDAQPQLV NLEDKQVLVW IADNPDRTSA NRTMLVYSVY DKNSGIWSEP VAVDDDGTAD FYPQLAVDGN DLYVVWQNSN KTFAEDVTLE EVVASSEIAV SKFDEVTGTF GAAVRLTEND VVDMLPQIIV SDGNAYIVWF TNNKNDVFGV DGENSIYYCE LKDNEWSSPE LLSEGLNAIV SISAGFIEGS FAVAYALDGD DMLETIDDME IYIVKPGDKD IRITDNDTMD SAPVFSSFNG EGALYWYNEG NILYITQIGA EPNRVFSESK PGLKDNFKVV EGSNGETAII WTNTAKGSST IFTAIYDEDR AAWSDVVKLS DVTGQVQSPD GVFDDEGNFS IAFSRLYLLE DGNEQADLCI IKVVPSYNLS IDSVNFDHSK VIPGTQLAID VEVTNNGEIG VEELVVDILD GDEIINSEAV QISLKPGESK TATVLMNLPD TIAKKAYSIR VSTVEGEEYN TDDNVKQFTI GYTDISLQLE IYSEGDIEYV TANIINLSHV PTGATLKVTK GSEDGEVIDT KVIDSTDDIV KYEYQFDKKI LCADKETEIL YFTVVADEEE IYTSDNTRTL VLSVNNDSTD KTTVSGYISV DFDYPPESES KIKSGFNVKV AGTELSTKTD EKGYFEISGI PGDMREFTLE ISKRNYLKRN VTVNGTGKLV VSTEDNPLIL WAGDVERKGV QDNAINMVDV MEISKVFGTR AGDEEYVAEL DLNMDGAINL FDIAIVIRHF NALPSRY
|
| |