Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0258 |
Symbol | |
ID | 4808541 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 317319 |
End bp | 318728 |
Gene Length | 1410 bp |
Protein Length | 469 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640105670 |
Product | cellulosome enzyme, dockerin type I |
Protein accession | YP_001036690 |
Protein GI | 125972780 |
COG category | [D] Cell cycle control, cell division, chromosome partitioning [Z] Cytoskeleton |
COG ID | [COG5184] Alpha-tubulin suppressor and related RCC1 domain-containing proteins |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000000567445 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAAAGT ATGGGTTAAA AAAGATTGTT TGGATGCTTG GCATTTTATG TTTTCTGGTT GTGTCTTTAA ATACCTCAAT TTTTGCAGCG GACGGTAAAA ATGTGGTTTT AGGCGATGTC AACGGAGATT CCAAAATAAA TGCAATTGAC GTTTTGCTTA TGAAAAAATA TATACTCAAA GTTATAAATG ATTTACCCTC CGACGGTGTG AAAGCAGCGG ATGTAAATGC TGACGGTCAA ATAAATTCGA TAGATTTTAC ATGGCTGAAA AAATATATGT TAAAAGCTGT TGAGAAATTT CCCGGAGAAG CAAGCAATAA TCCTGACGCT GTTATTCAGT TTGAATCCGG TTTTGCCCAT TCGGTGCTTT TGAAAAAAGA CGGGACCGTA TGGGTTTTGG GAAACAACGG CAAAGGACAG TTGGGACTTC CCGAAGTATC GGCCGTAAAT GAGCCTGTCA TGATAAACGG TCTTTCAGGA ATAAAATCGG TGGCTGCGGG AAGGGAGCAT ACACTGGCAT TGCAGGAAGA CGGTACTTTG TGGGCGTGGG GAAACAATTA CAGCCTTCAA CTCATAGAGT ATATGGAAAG GGATCCTGAT ACAAAAGAGA GATTTACAAG TATTCCGATT AAAGTTGAGA CTCATTCCGA TATCAAATAT GTGGCGGCTA AATTTTCACG TACCCTCATA GTAAAAAATG ACGGTACTGT TTGGCTGTAT TCGCTTCCTC CTATAAATAC CTCCTCGGAT GCCGAGTACA TGCCGTGGGA AATAAAAGGC TTTGGGGATA TAAAGATGGC GGATATTGGG ACAGGACATA TAGTTGCACT AAGAGAAGAC GGAACGGTGT GGACCTGGGG TGAAAATGTC TGGGGACAAT TGGGTAACGG TTGGCAGCAG CACCACAACA TTCATACTTA TATTTATTTT GAGCCCAATC AGGCAAAGAA TCTCTCGGAT ATTGTTTCGA TAGCCGCGGG AGATGCTCAT TCGGTGGCAT TGAAGAGTGA CGGAACTGTA TGGACTTGGG GCAGCAACTT CAACGGCGAG CTTGGAAACG GTACGACTAC TTATATTTTG GAGCCAAAAA AGGTTGAAGG TTTGGAGGAT ATAGTAGCCA TTGATGCCGG AATCGGCCAT ACGGTGGCGT TGAAGGCTGA CGGAACGGTA TGGGTGTGGG GTAAAAACAG CTATGGTCAG CTGGGAAACG GCACAACCAT GAGAAGCACT GTTCCGATAC AGGTAGAAGG ACTTGAAGGA ATTGTGGCAA TACAAGCAGG TATGGAGTGC ACGATAGCAT ATAAAAATGA CGGAACGGTA TGGGCATGGG GTAAAAATGA TTTTGGACAA TTAGGTGACG GAACTTTTGA AAACATATTA AGGCCCGTAA AAGTATTTGA AAGAAAATGA
|
Protein sequence | MRKYGLKKIV WMLGILCFLV VSLNTSIFAA DGKNVVLGDV NGDSKINAID VLLMKKYILK VINDLPSDGV KAADVNADGQ INSIDFTWLK KYMLKAVEKF PGEASNNPDA VIQFESGFAH SVLLKKDGTV WVLGNNGKGQ LGLPEVSAVN EPVMINGLSG IKSVAAGREH TLALQEDGTL WAWGNNYSLQ LIEYMERDPD TKERFTSIPI KVETHSDIKY VAAKFSRTLI VKNDGTVWLY SLPPINTSSD AEYMPWEIKG FGDIKMADIG TGHIVALRED GTVWTWGENV WGQLGNGWQQ HHNIHTYIYF EPNQAKNLSD IVSIAAGDAH SVALKSDGTV WTWGSNFNGE LGNGTTTYIL EPKKVEGLED IVAIDAGIGH TVALKADGTV WVWGKNSYGQ LGNGTTMRST VPIQVEGLEG IVAIQAGMEC TIAYKNDGTV WAWGKNDFGQ LGDGTFENIL RPVKVFERK
|
| |