Gene Cthe_0918 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0918 
Symbol 
ID4811211 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1101624 
End bp1105253 
Gene Length3630 bp 
Protein Length1209 aa 
Translation table11 
GC content36% 
IMG OID640106337 
Productcellulosome enzyme, dockerin type I 
Protein accessionYP_001037345 
Protein GI125973435 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.447129 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGAGCA TTATACTTAA AGCATTTGAC GAGTCAACAG AAATTAATGG TGATTCCCTT 
GACTTGGGTG AAATTCAATT GACACCTTAC AAGGGCAGGT TGACCTTGGA ATTATTAAAG
AAACCGGCAC ATACATCTCA GGAAGAGCCT GGATTGGAGC AAGAAAACTT ATACAATTAT
ACTATTAGTA TATACAATAA AACAAAGCAG GCAAAAATCA AAAACTTTAA TATCAAACCA
CCTTATATTT TCTTAAATGA TACTAACATA GAAACAGGTG ACGTTATTGA AGTAACGTTT
ACACACAAGA AGGACTATAC TTTGCCGGCT ACGATCGAGC TTACTTTAGA CAGATTAATG
AATGCGAGCG GGCAAGCTGT TGCAAAAGAG AACGGAAAAG GAATGGCATC AATACAGTCG
GGGAACCAAA GATGTGAATT AATGGTTTAT AACTCTGACA AAAAGTTCAT AGAAAGCTGT
TCGATGAACA GAGAGGGTAA AATAATCTCC AATCCTTTGA AAGAAGGAGT GTATACTTTT
GTATTCATCT CCAGCAACAA CTACTTGTGG AGACTTAATA CACTGGAGCA ATTTGAGGAA
ATAGGTCTGA AAGAAGGTAT ACATTTCGTA AAGAAGGAAG TAGAAATAAA AAATGGGGAA
ATTAAAGATT TGGGACTGAT TGTTGTTCCT GAACTAAACG AAAAAGAGTT GTTCTTCACA
GATCCCGATG TTGCAAGCTA TGCTGTGGTT TCGGAAAATA ATCGTATAGG AAAAGATATA
GTGCTTAGAG CCAAATATTC ATTCAGAAAA CCTATTGAAT CACAGACAGT TAAATTACTG
TTCAGGGTTC CGGATCAAAC ATTTTATGTA AAAGATACAT TAACTATTGA CGGAGAACTT
GTTGATTCAG TAAATGTATC CGGTAGTTTT ATTGAAGTAA CAACTACAAA AACAGAAGGT
GTTGTACGCT TTAATATCAG ACCTACTGAA GCGGTTGTTT GTTACAGTAA TGCTTATGTG
GAATTTAAAG CGGAAGGAAG GACTGTAAGA GAACCTCTGG GTGTCGTTAA CGAAGTTGTT
CCTTACCTCA CAATAGGAGC TCCTAATGTT ACAAGCCTGG AAAAAGTTTA TGTAACCGGT
CAGACACTTC CCAACCTGGA AGTGAAGATA TATGATGAGG GAAGCTTGGT TGGAGTTACC
AAAGCTAATT CGACCGGTGC TTGGAATTTC CAAGTTTCGC TGGGCAAAGA CAAATCTTAT
TCAGTTCATA ACTTGTCAGC AAGGATTGAA TACAATGACC AGGTTATTGA ATCTGAGACA
CTTGAAGTTA TTTATAAAAA GTCCGTTCCG GAGATTGAGA AAGTAACCAT GATCCACTCA
AATCAGTCGC TGATATTGTA TAAAAATGGT GAAATAGTTC CTAAAGGAGT ATATACTTAT
GTTCCTTCTC AATCCTATAC TTTTATTGTG GACTTTGTTG GTGATTCAGC AGAAGAAATT
GAAAATTTGT GTATTGTAAG CAACCGCAAC GGCGAAGAAA GAAAACTGGA TGCAAAATAT
GATCCACAAT CGAAGAGCTG GATTGCAACA GGAAGTTTTG GTTACAATCA TGTTCCGGGT
TCGTTTACTG TAGAGTACAA TTTAAAAAAG AAAAGATTAA TTAAAGAGGA TGGTGAATTG
AATTTAGAGG TTTTAGAGCA GATTTCAACT TTTGCAGATC AATTTCTTTA TGAATTAAAA
AAAGAAAATG TCATGGTAGG ATTACTTGAT GAAAATGAAA TAGGATTTAT GCTGAAAGAA
GATACAGGTG GCATTGCAAT TGTAACAATT GTAATAGGTG ATCAGCCAAT TGAACGAGAA
AAATACGAAA AAATGGGGTA TTTATTTAAA GAAATAGATG GAAAACTAGG AGCAGTATTA
TTAATAAAAG ACGAGACTGA AAATAAAATT TTTACATCAA AAGTAAGATT AGCCAGCAAT
AAGACAGAAG ATTCCGTAAA GTCTGAAGAC GATGATGAAT CTTCACAAAA AGAGACGCCT
TTAGTTACAA AAGATGAGCG TATTAGTATG GCTGCTACAA TAATTGGGGA ATTAGGTAAT
ATTGCTGGTA GTGGAAAGGT TGTGGAAGCA ATAGATTTTA TTTCCGGAAT TGGTACAGCG
GTGAATACTA TTGCATTTAC ACCGGCAAAC GCAGTTACAG ATATGGGTTA TGTTCAGAAA
AAATTGCAAG AATCTGAGAA CATTTCAAGC GAAGATAAAG AAAGGCTCCG CGATGATTTG
TTTTTAGCAT CTGGTGTGTA TAATTTTACT GTGAATGCAA ACTTTATTGC ATCACTTATG
GATGGACATA CCAGAAATGG CACGATGCAT AAAATAATAG ACATAGTTAT TGATAAATAT
GATCAGTACA GTCTTGGTGA TAAGTTATGG GAAGATATAT TTGGAAAAAA TGAAGATGAT
AGAGAATCAG GCAAAAGCAC AAAATCTGGA AAAGGATCAT CTATTAAGCT ACCATTACAA
GAACAAAAAA CTCTGGTAGA TCCAAGTGGA TATGTATACG AAACTGTACC TGACAACAGA
ATAACCGATG CTACAGTCAC AGTATACTAT AAAGATGAAA ACGGAGAAAT GATATTATGG
AATGCGGAAG AATTCGGCCA GAAGAATCCA TTGCTGACAG ACGAGAATGG TTTTTATGCC
TGGGATGTTC CTGAAGGAAT GTGGCAGGTA AAGGTCGAAA AGGAAGGATA TGAAACAGCA
TATAGTGAGA TTTTACCTGT GCCTCCTGTG CAGACCAATG TTAATATTCC TTTGGTTTCG
TATGAACCAC CGAAGGTTGG ACACATTTAT GCATATCCGG AATATATAGA AATTGCATTT
AGCAAATATG TAAAGCACGC AACGCTGGAT TCAGAGTCTA TTCAGCTGAA ACAGGGAGAA
AATAAGGTAA ACATTAAAAT AGTATATGAA GATGAAGAGG GAAATTACTC TAAGAAAATT
AAACTAATTC CTGCAGAAAA GACTTCTTTT GAAGGAAAAT ACACTCTTAA TATTTCTAAG
AGCATAACAA GCTATGCTGG AGTTGCGATG CAAAAAGCTG AAATCCGCGA TATAGAGATT
GTAGCTGAAC CAAAATCAAT CGAAATATTA GATAAAGTAG AAATTGAATT AAGAAAAACT
GTTGCAATAG AGATACGTGT ACTGCCGGAA GAAGCAGCAA AAGGCAAAAA GATAATCGTA
ACATCCGGAA TGGAAGAAAT AGTATCTGCA GAGGATGTTC TACTGGATGA AAGGGGACGC
GGAAAACTCA AACTGAAAGG TAATCTTCCC GGAACGGTAG ATATCGATTT GCGTTTGGAA
GGTAGCGCTG TTGAAAAGAA AATACAAGCG ACAGTTAAAT TTCCGGAAGA TAATGGAGAG
ATTATTGAGC CGCCTGTTGT GCTTAATGGT GACTTAAACA GAAATGGAAT TGTTAACGAC
GAAGATTATA TACTGCTGAA GAATTACTTG TTAAGGGGGA ATAAATTAGT AATAGATTTG
AATGTGGCTG ATGTCAATAA AGACGGAAAA GTTAATTCTA CTGACTGTTT ATTCCTTAAG
AAGTATATTT TGGGACTTAT AACTATATAG
 
Protein sequence
MGSIILKAFD ESTEINGDSL DLGEIQLTPY KGRLTLELLK KPAHTSQEEP GLEQENLYNY 
TISIYNKTKQ AKIKNFNIKP PYIFLNDTNI ETGDVIEVTF THKKDYTLPA TIELTLDRLM
NASGQAVAKE NGKGMASIQS GNQRCELMVY NSDKKFIESC SMNREGKIIS NPLKEGVYTF
VFISSNNYLW RLNTLEQFEE IGLKEGIHFV KKEVEIKNGE IKDLGLIVVP ELNEKELFFT
DPDVASYAVV SENNRIGKDI VLRAKYSFRK PIESQTVKLL FRVPDQTFYV KDTLTIDGEL
VDSVNVSGSF IEVTTTKTEG VVRFNIRPTE AVVCYSNAYV EFKAEGRTVR EPLGVVNEVV
PYLTIGAPNV TSLEKVYVTG QTLPNLEVKI YDEGSLVGVT KANSTGAWNF QVSLGKDKSY
SVHNLSARIE YNDQVIESET LEVIYKKSVP EIEKVTMIHS NQSLILYKNG EIVPKGVYTY
VPSQSYTFIV DFVGDSAEEI ENLCIVSNRN GEERKLDAKY DPQSKSWIAT GSFGYNHVPG
SFTVEYNLKK KRLIKEDGEL NLEVLEQIST FADQFLYELK KENVMVGLLD ENEIGFMLKE
DTGGIAIVTI VIGDQPIERE KYEKMGYLFK EIDGKLGAVL LIKDETENKI FTSKVRLASN
KTEDSVKSED DDESSQKETP LVTKDERISM AATIIGELGN IAGSGKVVEA IDFISGIGTA
VNTIAFTPAN AVTDMGYVQK KLQESENISS EDKERLRDDL FLASGVYNFT VNANFIASLM
DGHTRNGTMH KIIDIVIDKY DQYSLGDKLW EDIFGKNEDD RESGKSTKSG KGSSIKLPLQ
EQKTLVDPSG YVYETVPDNR ITDATVTVYY KDENGEMILW NAEEFGQKNP LLTDENGFYA
WDVPEGMWQV KVEKEGYETA YSEILPVPPV QTNVNIPLVS YEPPKVGHIY AYPEYIEIAF
SKYVKHATLD SESIQLKQGE NKVNIKIVYE DEEGNYSKKI KLIPAEKTSF EGKYTLNISK
SITSYAGVAM QKAEIRDIEI VAEPKSIEIL DKVEIELRKT VAIEIRVLPE EAAKGKKIIV
TSGMEEIVSA EDVLLDERGR GKLKLKGNLP GTVDIDLRLE GSAVEKKIQA TVKFPEDNGE
IIEPPVVLNG DLNRNGIVND EDYILLKNYL LRGNKLVIDL NVADVNKDGK VNSTDCLFLK
KYILGLITI