Gene Cthe_0258 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0258 
Symbol 
ID4808541 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp317319 
End bp318728 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content43% 
IMG OID640105670 
Productcellulosome enzyme, dockerin type I 
Protein accessionYP_001036690 
Protein GI125972780 
COG category[D] Cell cycle control, cell division, chromosome partitioning
[Z] Cytoskeleton 
COG ID[COG5184] Alpha-tubulin suppressor and related RCC1 domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000567445 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAAGT ATGGGTTAAA AAAGATTGTT TGGATGCTTG GCATTTTATG TTTTCTGGTT 
GTGTCTTTAA ATACCTCAAT TTTTGCAGCG GACGGTAAAA ATGTGGTTTT AGGCGATGTC
AACGGAGATT CCAAAATAAA TGCAATTGAC GTTTTGCTTA TGAAAAAATA TATACTCAAA
GTTATAAATG ATTTACCCTC CGACGGTGTG AAAGCAGCGG ATGTAAATGC TGACGGTCAA
ATAAATTCGA TAGATTTTAC ATGGCTGAAA AAATATATGT TAAAAGCTGT TGAGAAATTT
CCCGGAGAAG CAAGCAATAA TCCTGACGCT GTTATTCAGT TTGAATCCGG TTTTGCCCAT
TCGGTGCTTT TGAAAAAAGA CGGGACCGTA TGGGTTTTGG GAAACAACGG CAAAGGACAG
TTGGGACTTC CCGAAGTATC GGCCGTAAAT GAGCCTGTCA TGATAAACGG TCTTTCAGGA
ATAAAATCGG TGGCTGCGGG AAGGGAGCAT ACACTGGCAT TGCAGGAAGA CGGTACTTTG
TGGGCGTGGG GAAACAATTA CAGCCTTCAA CTCATAGAGT ATATGGAAAG GGATCCTGAT
ACAAAAGAGA GATTTACAAG TATTCCGATT AAAGTTGAGA CTCATTCCGA TATCAAATAT
GTGGCGGCTA AATTTTCACG TACCCTCATA GTAAAAAATG ACGGTACTGT TTGGCTGTAT
TCGCTTCCTC CTATAAATAC CTCCTCGGAT GCCGAGTACA TGCCGTGGGA AATAAAAGGC
TTTGGGGATA TAAAGATGGC GGATATTGGG ACAGGACATA TAGTTGCACT AAGAGAAGAC
GGAACGGTGT GGACCTGGGG TGAAAATGTC TGGGGACAAT TGGGTAACGG TTGGCAGCAG
CACCACAACA TTCATACTTA TATTTATTTT GAGCCCAATC AGGCAAAGAA TCTCTCGGAT
ATTGTTTCGA TAGCCGCGGG AGATGCTCAT TCGGTGGCAT TGAAGAGTGA CGGAACTGTA
TGGACTTGGG GCAGCAACTT CAACGGCGAG CTTGGAAACG GTACGACTAC TTATATTTTG
GAGCCAAAAA AGGTTGAAGG TTTGGAGGAT ATAGTAGCCA TTGATGCCGG AATCGGCCAT
ACGGTGGCGT TGAAGGCTGA CGGAACGGTA TGGGTGTGGG GTAAAAACAG CTATGGTCAG
CTGGGAAACG GCACAACCAT GAGAAGCACT GTTCCGATAC AGGTAGAAGG ACTTGAAGGA
ATTGTGGCAA TACAAGCAGG TATGGAGTGC ACGATAGCAT ATAAAAATGA CGGAACGGTA
TGGGCATGGG GTAAAAATGA TTTTGGACAA TTAGGTGACG GAACTTTTGA AAACATATTA
AGGCCCGTAA AAGTATTTGA AAGAAAATGA
 
Protein sequence
MRKYGLKKIV WMLGILCFLV VSLNTSIFAA DGKNVVLGDV NGDSKINAID VLLMKKYILK 
VINDLPSDGV KAADVNADGQ INSIDFTWLK KYMLKAVEKF PGEASNNPDA VIQFESGFAH
SVLLKKDGTV WVLGNNGKGQ LGLPEVSAVN EPVMINGLSG IKSVAAGREH TLALQEDGTL
WAWGNNYSLQ LIEYMERDPD TKERFTSIPI KVETHSDIKY VAAKFSRTLI VKNDGTVWLY
SLPPINTSSD AEYMPWEIKG FGDIKMADIG TGHIVALRED GTVWTWGENV WGQLGNGWQQ
HHNIHTYIYF EPNQAKNLSD IVSIAAGDAH SVALKSDGTV WTWGSNFNGE LGNGTTTYIL
EPKKVEGLED IVAIDAGIGH TVALKADGTV WVWGKNSYGQ LGNGTTMRST VPIQVEGLEG
IVAIQAGMEC TIAYKNDGTV WAWGKNDFGQ LGDGTFENIL RPVKVFERK