Gene Cthe_1806 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1806 
Symbol 
ID4809790 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2132940 
End bp2139473 
Gene Length6534 bp 
Protein Length2177 aa 
Translation table11 
GC content41% 
IMG OID640107220 
Productcellulosome enzyme, dockerin type I 
Protein accessionYP_001038220 
Protein GI125974310 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGA AAAGTTTTAT TTCTGCTGCC CTAATTGTGT TTTTGTTCTT GTCTTTTTGT 
CTTGAGACAC CTAATATAGG TGCGGTAAGT GCACAAAATG CTTTTGAAGA TCCATTTGTA
CATCTCATAA ACTCTCACCT CGAAAACTGT GAAACTAGCC ATGAAGAAAT ATGTGATTCC
ACTGGTAACG AAGGCTGTGA AGTCAGTCAT GAAGAAATAT GCGATTGCAC CGGTAACGAA
AGCTGCGAAG CCGGTCATGA AGAAATAAGT GATTCTGACG AACACAAAGC TTGCAAAGCC
GGACATGAAG CATGCAGTTG TAAATGCACT TCCGACAGTG ATAAAGATAA TACCATATCC
AACTTGGATA TGAATGATAT TGAACCTGTA GGAAATGCAT TATTTGTTGA AAAAAATGAA
GAAAATTTGA GCAATATTGT AACATACGCT TCTTTAAGCA GTGTAGTTTT GGCAGCATCA
TGCTCCCATC AATTTAACGG TTCTTATACA GTTACTAAAG AACCAACATG TACAACGACC
GGTACAAAAG TAGGAAAATG TACAAAATGC GGAGCAATTG TTTCTACAGT TACGATACCT
GCATTGGGAC ATAGTTATGG ATCATGGACG GTAACTAAAG CAGCAACCTG TACTACAGAC
GGAAGCCAAA AAAGAACCTG CTCAAGATGT AAAAATGTTG AAACCCAAAC GATAAAGGCA
ACGGGACATA CATTCAATGG TTCTTATACG GTTACCAAAG AAGCAACATG TACAACGACC
GGTACAAAGG TAGGAAAATG TACAAAATGC GGGGCGACAG TATCTACAGT TACGATACCT
GCATTGGGAC ATAGTTATGG ATCGTGGACG GTAACTAAAG CAGCAACCTG TACTACAGAC
GGAAGCCAAA AAAGAACCTG CTCAAGATGT AAAAATGTTG AAACCCAAAC GATAAAGGCA
ACAGGACATA CATTCAATGG TTCTTATACG GTTACCAAAG AAGCAACATG TACAACAGCC
GGTACAAAGG TAGGAAAATG TACAAAATGC GGGACGACAG TATCCACAGT AGAGATACCT
GCACTGGGAC ATAGTTATGG GTCGTGGACG GTAACTAAAG CGGCGACCTG TACTACTGCC
GGAACAGAAA AGAGGACATG TACCAGAAGC GGATGTACTG CAAGTGAGAC GCGTTCGATA
TCTGCAACAG GACATACATT CAATGGTTCT TATACGGTTA CCAAAGAAGC AACATGTACA
ACAGCCGGTA CAAAGGTAGG AAAATGTACA AAATGCGGGA CGACAGTATC CACAGTAGAG
ATACCTGCAT TGGGACATAG TTATGGGTCG TGGACGGTAA CTAAAGCGGC GACTTGTACT
ACTGCCGGAA CAGAAAAGAG GACATGTACC AGAAGCGGAT GTACTGCAAG TGAGACGCGT
TCGATATCTG CAACAGGGCA TACATTCAAC GGGTCTTATA CGACAATAAA GGAGCCGACA
TGTACAACGA CTGGTACAAA GGTAGGAAAA TGTACAAAAT GTGGGGAAGT TGTTTCGTCA
GTAGAGATAA AAGAGTTAGG GCATGACTTC GGTTCATGGA AAACAATAAA GCAAGCAACC
TGTACGGAAA AAGGGCTCAG AGAAGGAACA TGTTCAAGAT GTTCTGTGCG TAAAACGGAA
GAAATATCAC CAACAGGGCA CCAATTTAAC GGTTCATTTA CAACGGTAAA GGAACCGACA
TGTACAGAAG AAGGATTAAA GGTAGGAAAA TGTACAAAAT GCGGAGAGGT GGTAGCAACA
GCACCTATTC CGGCCCTTGG CGGTGATCAT CAGTTTAACG GTTCATTTAC CACAGTAAAA
GAACCGACAT GTACAGAGAA GGGACTGAAG GAAGGTCGTT GCACCAGGTG CGGAGCGACA
GTTACGACGG CACCGATACC TGAATTAGGG CATGACTTCG GTTCGTGGAA GACAATAAAG
GAAGCAACCT GTACGGAGAA AGGGCTCAGG GAAGGAACAT GTTCAAGATG TTCTGTGCGT
AAAACGGAAG AAATATCACC AACAGGACAC CAATTTAACG GTTCATTTAC AACGGTAAAG
AAACCGACAT GTACAGAAGA AGGAGTAAGA GAAGGAAGAT GTACAAAATG CGGAGAGGTA
GTAGCAACAG CACCTATTCC GGCCCTTGGC GGTGATCATC AGTTTAACGG TTCATTTACC
ACAGTAAAAG AACCGACATG TACAGAGAAA GGACTGAAGG AAGGTCGTTG CACCAGGTGC
GGAGCAACAG TTACGACGAC ACCGATACCT GAATTGGGGC ATGACTTCGG TTCATGGAAG
ACAATAAAGG AAGCAACTTG TACGGAGAAA GGGCTCAGGG AAGGAACATG TTCAAGATGT
TCTGTGCGTA AAACGGAAGA AATATCACCA ACAGGGCACC AATTTAACGG TTCATTTACA
ACGGTAAAGG AACCGACATG TACAGAAGAA GGATTAAAGG TAGGAAAATG TACAAAATGC
GGAGAAGAGG TAGCAACAGC ACCTATTCCA GCCCTTGGCG GCGCACACCA ATTTAACGGT
TCATTTACCA CAGTAAAAAA ACCGACATGT ACAGATCCGG GACTGGCGGA AGGTAGATGC
AGCAGATGTA AAACGGTGGT AGCAACTAAA GAAATTCCTC CCCTTGGAGG CTCGCATCAG
TTTAACGGTT CATATAAAAT AATAAAAGAA GCAACATGCA CGGAGGAGGG ATTGAAAGAA
GGACGTTGTA CCAAGTGTGG CACTGTTATT TCAACATCTG TTATACCGCC GCAACATAAA
TTTTCCAAAA TAACTATAAC TCCCGATAGA ATAACCTTGG GCGGAAGCAA TGCTACTGAA
AGTCCAATAT ATGTAAAGGT AGTATGTTCA AAATGTAACA CAACTGTTGA TGTAACAAGT
AAGGCAAAAT TTTCAAGCAG CAACAGCAAT GTGGCATCGG TGGTGAATGG ATATGTTAAA
AGCGGTACAC AATTTGGAAC TGCAACAATT ACCGCCGATT ATGATGGAAT GAAAGCGGTA
TGCAGTGTGC AAGTTAAACC GGCCGGCGGA GAAAAACTCC GGGCGTTGTG TATAACTCCA
AAAGAAGATA CAATAGCTGA GTTCAACAAA TGGGGTTCAC AAGTGAAAGT CATGGCAGTA
TACGATGATT ACGAGGTTGA TATTACAGAT TATGTCCTGT TTACGTCCGG AGACAGAAAT
ATTGCTTATG TAGACGAGTA TTACGGTAAT AAATATATAA AAAGCGGAAC AAAGAAAGGA
ACAACTTTGA TAACGGCATC CTATGAAGGG AAAAAAGATA CATGTACTGT AAAGGTGGAC
ATGGCATATG AAGTGGAAGA AATGCCTTTC AAGCTCGGAA AAGAAACAAA TATCCTGGTT
CCCGAAGACT TACCGGTTAT AGGCGGTACT GAGGTGGAGT TCAGCTTTGA TCATATTCCG
GGAATGGTGA AATACGGAGA AAAAGATTTT AGGATTGCAA TTGGGATAGA AGATAAGGAA
AGCCTTGATA AAAAATGGGA TAATTTTGTA AAGTATTTTG AAGATGCCAA GAATAGTAAG
GCTTCTGCGA AAGAATTAAG AAACAGAATG AAAAAGCTGG GCTCAAAGAA AGGTAGTTTT
AGCATTAAAG ATGATTGGGA ACCGGAAGTG GAAGCCTATG GCTACATTGA AGGTGTGTTT
ATTAACGGTA TACCTGTAGC AACCAGGGGT TCGTTCGCTG TAATAGTAGA GGCAGAATAC
AGAGGACAGA AACAATACTT TATAGGACCT GTTCCTGTTT ACTTTGAAAT AGCCGGCGGT
CTCGAGATGG AGCTTATTAG TGACATCCTC AGGGTTGATT TTGAAACCGG CAGAATTATG
CTTAATTCAG AGTTAAAAGT GACACCGCGT TTTGAACTCG GCGCCGGAGT CGGTTTAGTA
AAAGTATTGA CGGTTGGCGG TTCGGGAGAA GCAGAACTTG AATTTCTTAT TATTACCGGT
TCGGAAGATT ATTTAAAGGT TACATTGACG GGAAGCTTGA AACTAAAGGT TTCGTCATAT
TTCTTCAGTG CTGAAAAAGA AATTGCAAAA GGTACCTGGG TTCTTTATGA ATCAAAACCT
CGCTTAAGGA TGGCAAGTCC CAACATAAAT GCTCAATTCG ATTTGTACAA TGCAGATGAA
TATAAAATGA TGCCGAGGGA TTATATTGAA AGGCCGTCAG AATGGTTGGG AAATCGGCGG
TTAATGCGGT CAATGGCAAC AGGATTTACC AATAAGGAAC TTAAAGTTTT AGGAACCAAT
ATATATCCTG ATGCTCAGCC GCAACTGGTG AATTTGGAGG ATAAACAGGT TTTGGTGTGG
ATTGCCGATA ATCCCGACAG AACCTCTGCC AACAGAACTA TGCTGGTTTA TTCTGTGTAT
GATAAGAACA GCGGCATATG GAGTGAACCT GTGGCAGTAG ATGATGACGG TACGGCAGAT
TTCTATCCTC AACTAGCAGT GGACGGAAAC GACCTTTATG TTGTATGGCA AAACAGCAAC
AAAACCTTTG CAGAAGATGT AACATTGGAA GAAGTTGTGG CTTCGAGTGA GATAGCCGTC
AGCAAATTTG ACGAAGTAAC CGGCACATTT GGAGCAGCCG TTCGGTTGAC AGAAAATGAT
GTGGTTGATA TGCTGCCGCA AATAATCGTA TCAGATGGTA ATGCTTACAT AGTATGGTTT
ACAAACAATA AAAACGATGT ATTCGGTGTG GACGGTGAAA ACTCAATCTA CTACTGTGAA
CTTAAGGATA ATGAATGGTC AAGTCCCGAA CTTCTTAGTG AAGGCTTGAA TGCAATTGTA
TCCATAAGTG CCGGGTTTAT TGAAGGCTCA TTTGCAGTTG CATATGCTTT GGACGGGGAC
GATATGCTTG AAACAATAGA TGACATGGAA ATATACATTG TAAAGCCCGG TGATAAAGAT
ATAAGGATTA CCGATAATGA TACCATGGAT TCTGCGCCTG TATTTTCAAG TTTCAATGGA
GAGGGAGCTT TATATTGGTA CAATGAAGGT AACATATTAT ATATAACGCA AATAGGCGCT
GAGCCGAACC GAGTCTTTAG CGAATCGAAA CCCGGGCTGA AGGACAACTT TAAGGTTGTT
GAAGGAAGCA ATGGTGAAAC GGCCATAATC TGGACTAATA CAGCAAAGGG CTCAAGTACA
ATATTTACAG CAATTTATGA TGAAGACCGG GCAGCATGGA GTGATGTTGT AAAGCTATCG
GATGTTACAG GCCAAGTTCA ATCTCCGGAT GGTGTTTTTG ACGATGAAGG CAACTTTAGT
ATTGCATTTA GCAGATTATA CCTGCTGGAG GATGGAAATG AACAGGCTGA TTTATGCATT
ATCAAGGTTG TTCCGTCCTA CAATCTGTCC ATTGACAGCG TGAATTTCGA CCATAGTAAA
GTTATACCTG GCACACAGCT CGCAATTGAT GTTGAAGTAA CCAATAATGG CGAAATTGGT
GTTGAAGAAT TGGTTGTTGA TATTTTAGAC GGAGACGAAA TAATTAATTC TGAAGCAGTT
CAGATAAGCT TGAAACCAGG AGAGAGCAAA ACAGCGACTG TATTGATGAA TTTACCGGAC
ACCATAGCAA AAAAAGCATA TAGTATCAGG GTTTCTACCG TAGAGGGCGA AGAATACAAC
ACAGATGACA ATGTTAAACA GTTTACAATT GGCTATACCG ATATTTCTTT GCAGCTTGAG
ATATACAGTG AGGGAGACAT TGAATATGTC ACTGCCAATA TTATAAACTT AAGCCATGTG
CCTACAGGAG CAACTTTAAA GGTAACAAAG GGCAGTGAAG ACGGTGAAGT GATAGACACA
AAGGTTATTG ACAGCACAGA CGATATAGTG AAGTATGAAT ATCAGTTTGA CAAAAAGATA
CTTTGTGCGG ACAAAGAAAC GGAAATACTG TATTTTACAG TTGTAGCAGA TGAGGAAGAG
ATTTATACCA GTGACAACAC CAGAACCTTA GTGTTAAGCG TTAACAACGA TAGTACTGAT
AAAACCACTG TATCAGGTTA TATTTCGGTT GATTTTGATT ATCCGCCGGA ATCGGAATCA
AAAATAAAAT CAGGATTCAA TGTAAAAGTT GCAGGAACGG AATTGTCAAC GAAGACAGAC
GAGAAAGGTT ATTTTGAAAT ATCCGGCATA CCCGGCGATA TGAGGGAATT TACATTGGAA
ATAAGCAAGC GAAATTATCT TAAAAGGAAT GTCACGGTGA ACGGAACCGG AAAATTAGTG
GTTTCAACTG AAGACAATCC GCTCATATTA TGGGCCGGGG ATGTAGAGCG TAAAGGAGTG
CAAGACAATG CTATTAATAT GGTGGATGTG ATGGAAATAT CCAAAGTTTT TGGCACAAGA
GCCGGAGATG AAGAATATGT AGCTGAGTTG GACTTAAATA TGGACGGAGC AATCAATTTA
TTTGATATAG CTATAGTTAT CAGGCATTTT AACGCATTAC CTTCCCGCTA TTAA
 
Protein sequence
MKKKSFISAA LIVFLFLSFC LETPNIGAVS AQNAFEDPFV HLINSHLENC ETSHEEICDS 
TGNEGCEVSH EEICDCTGNE SCEAGHEEIS DSDEHKACKA GHEACSCKCT SDSDKDNTIS
NLDMNDIEPV GNALFVEKNE ENLSNIVTYA SLSSVVLAAS CSHQFNGSYT VTKEPTCTTT
GTKVGKCTKC GAIVSTVTIP ALGHSYGSWT VTKAATCTTD GSQKRTCSRC KNVETQTIKA
TGHTFNGSYT VTKEATCTTT GTKVGKCTKC GATVSTVTIP ALGHSYGSWT VTKAATCTTD
GSQKRTCSRC KNVETQTIKA TGHTFNGSYT VTKEATCTTA GTKVGKCTKC GTTVSTVEIP
ALGHSYGSWT VTKAATCTTA GTEKRTCTRS GCTASETRSI SATGHTFNGS YTVTKEATCT
TAGTKVGKCT KCGTTVSTVE IPALGHSYGS WTVTKAATCT TAGTEKRTCT RSGCTASETR
SISATGHTFN GSYTTIKEPT CTTTGTKVGK CTKCGEVVSS VEIKELGHDF GSWKTIKQAT
CTEKGLREGT CSRCSVRKTE EISPTGHQFN GSFTTVKEPT CTEEGLKVGK CTKCGEVVAT
APIPALGGDH QFNGSFTTVK EPTCTEKGLK EGRCTRCGAT VTTAPIPELG HDFGSWKTIK
EATCTEKGLR EGTCSRCSVR KTEEISPTGH QFNGSFTTVK KPTCTEEGVR EGRCTKCGEV
VATAPIPALG GDHQFNGSFT TVKEPTCTEK GLKEGRCTRC GATVTTTPIP ELGHDFGSWK
TIKEATCTEK GLREGTCSRC SVRKTEEISP TGHQFNGSFT TVKEPTCTEE GLKVGKCTKC
GEEVATAPIP ALGGAHQFNG SFTTVKKPTC TDPGLAEGRC SRCKTVVATK EIPPLGGSHQ
FNGSYKIIKE ATCTEEGLKE GRCTKCGTVI STSVIPPQHK FSKITITPDR ITLGGSNATE
SPIYVKVVCS KCNTTVDVTS KAKFSSSNSN VASVVNGYVK SGTQFGTATI TADYDGMKAV
CSVQVKPAGG EKLRALCITP KEDTIAEFNK WGSQVKVMAV YDDYEVDITD YVLFTSGDRN
IAYVDEYYGN KYIKSGTKKG TTLITASYEG KKDTCTVKVD MAYEVEEMPF KLGKETNILV
PEDLPVIGGT EVEFSFDHIP GMVKYGEKDF RIAIGIEDKE SLDKKWDNFV KYFEDAKNSK
ASAKELRNRM KKLGSKKGSF SIKDDWEPEV EAYGYIEGVF INGIPVATRG SFAVIVEAEY
RGQKQYFIGP VPVYFEIAGG LEMELISDIL RVDFETGRIM LNSELKVTPR FELGAGVGLV
KVLTVGGSGE AELEFLIITG SEDYLKVTLT GSLKLKVSSY FFSAEKEIAK GTWVLYESKP
RLRMASPNIN AQFDLYNADE YKMMPRDYIE RPSEWLGNRR LMRSMATGFT NKELKVLGTN
IYPDAQPQLV NLEDKQVLVW IADNPDRTSA NRTMLVYSVY DKNSGIWSEP VAVDDDGTAD
FYPQLAVDGN DLYVVWQNSN KTFAEDVTLE EVVASSEIAV SKFDEVTGTF GAAVRLTEND
VVDMLPQIIV SDGNAYIVWF TNNKNDVFGV DGENSIYYCE LKDNEWSSPE LLSEGLNAIV
SISAGFIEGS FAVAYALDGD DMLETIDDME IYIVKPGDKD IRITDNDTMD SAPVFSSFNG
EGALYWYNEG NILYITQIGA EPNRVFSESK PGLKDNFKVV EGSNGETAII WTNTAKGSST
IFTAIYDEDR AAWSDVVKLS DVTGQVQSPD GVFDDEGNFS IAFSRLYLLE DGNEQADLCI
IKVVPSYNLS IDSVNFDHSK VIPGTQLAID VEVTNNGEIG VEELVVDILD GDEIINSEAV
QISLKPGESK TATVLMNLPD TIAKKAYSIR VSTVEGEEYN TDDNVKQFTI GYTDISLQLE
IYSEGDIEYV TANIINLSHV PTGATLKVTK GSEDGEVIDT KVIDSTDDIV KYEYQFDKKI
LCADKETEIL YFTVVADEEE IYTSDNTRTL VLSVNNDSTD KTTVSGYISV DFDYPPESES
KIKSGFNVKV AGTELSTKTD EKGYFEISGI PGDMREFTLE ISKRNYLKRN VTVNGTGKLV
VSTEDNPLIL WAGDVERKGV QDNAINMVDV MEISKVFGTR AGDEEYVAEL DLNMDGAINL
FDIAIVIRHF NALPSRY