Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2549 |
Symbol | |
ID | 4809305 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3017939 |
End bp | 3018913 |
Gene Length | 975 bp |
Protein Length | 324 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 640107964 |
Product | cellulosome enzyme, dockerin type I |
Protein accession | YP_001038943 |
Protein GI | 125975033 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 54 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCTATC TGGATAATGA GCTGTTTGTT ACAGCGCTAA CAGATTCGCA AAGTTTATAC TGGCTGCCGG AAGGCCCAAA TGGCATGATA GCAGTAATTG ATACCAAAAC TGTGGAGTTA AAAGAAAAAA TAGATATCAA AATTAGGCCA TTTAATATTT TTGCAGGAAA GAATGGTTAC TTATATGTGA CTTCAAGAGA GCCTCAAAAG GCTTATTTTA ATAGCTATTC ACGTTCCACT AAAGAATTCA TGGATTCGGA ATTAGTAAAT AATGAATGCT TGTCTGAGTA CAATCCAACC CTAGACAGGA TTTATGCTAT TCCTATTGAT ATAATGCCAA TAGACTATAA AGTTTTAAAT GTTGATAACG GTAAGTTTGT GTCTTCTTAT AGTTCAACAT ACTATGACAG TTATCCTTTA GCAGAAAAAT TTAAGATATC TCCTGACGGC AAATACTTGT TTAATAGTTC TGGAGTTGTA TTTACATGCA ATGAGAATGT AAATGAAGAT ATGAAGTTTG CTTTTACTCT GGATAAAAAA TTTACAGATA TTGCATTTAA TATGGAAGAA AACAGGTTTT ATACTGCAGT TGGCGGCAAT CAAATTTACG TTTATAATTA TGAAGACTTT TCAGGAATTG ATACGTTGTC GTCAACTGGA GAGATATTGA AGCTGTTTTA TGTAGACGGT AAATTGTGTG CTTTATCTAG AAGCGCCAAT GGCAGACCAA TGTTTGAAGT TATTCAAAAA GTGAAAATCA AATATGGTGA TGTTAATAAA GATGGAAGAA TAAATTCAAC GGATATTATG TATTTGAAGG GATATCTGTT GCGAAACAGT GCTTTCAATT TAGACGAATA CGGCTTAATG GCGGCGGATG TGGACGGCAA TGGTTCAGTA AGCTCATTGG ATTTGACATA TCTGAAGAGG TATATATTAC GCAGGATTTC AGACTTCCCT GCAAACAAGA AATAA
|
Protein sequence | MAYLDNELFV TALTDSQSLY WLPEGPNGMI AVIDTKTVEL KEKIDIKIRP FNIFAGKNGY LYVTSREPQK AYFNSYSRST KEFMDSELVN NECLSEYNPT LDRIYAIPID IMPIDYKVLN VDNGKFVSSY SSTYYDSYPL AEKFKISPDG KYLFNSSGVV FTCNENVNED MKFAFTLDKK FTDIAFNMEE NRFYTAVGGN QIYVYNYEDF SGIDTLSSTG EILKLFYVDG KLCALSRSAN GRPMFEVIQK VKIKYGDVNK DGRINSTDIM YLKGYLLRNS AFNLDEYGLM AADVDGNGSV SSLDLTYLKR YILRRISDFP ANKK
|
| |