Gene Cthe_2549 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2549 
Symbol 
ID4809305 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3017939 
End bp3018913 
Gene Length975 bp 
Protein Length324 aa 
Translation table11 
GC content34% 
IMG OID640107964 
Productcellulosome enzyme, dockerin type I 
Protein accessionYP_001038943 
Protein GI125975033 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones54 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCTATC TGGATAATGA GCTGTTTGTT ACAGCGCTAA CAGATTCGCA AAGTTTATAC 
TGGCTGCCGG AAGGCCCAAA TGGCATGATA GCAGTAATTG ATACCAAAAC TGTGGAGTTA
AAAGAAAAAA TAGATATCAA AATTAGGCCA TTTAATATTT TTGCAGGAAA GAATGGTTAC
TTATATGTGA CTTCAAGAGA GCCTCAAAAG GCTTATTTTA ATAGCTATTC ACGTTCCACT
AAAGAATTCA TGGATTCGGA ATTAGTAAAT AATGAATGCT TGTCTGAGTA CAATCCAACC
CTAGACAGGA TTTATGCTAT TCCTATTGAT ATAATGCCAA TAGACTATAA AGTTTTAAAT
GTTGATAACG GTAAGTTTGT GTCTTCTTAT AGTTCAACAT ACTATGACAG TTATCCTTTA
GCAGAAAAAT TTAAGATATC TCCTGACGGC AAATACTTGT TTAATAGTTC TGGAGTTGTA
TTTACATGCA ATGAGAATGT AAATGAAGAT ATGAAGTTTG CTTTTACTCT GGATAAAAAA
TTTACAGATA TTGCATTTAA TATGGAAGAA AACAGGTTTT ATACTGCAGT TGGCGGCAAT
CAAATTTACG TTTATAATTA TGAAGACTTT TCAGGAATTG ATACGTTGTC GTCAACTGGA
GAGATATTGA AGCTGTTTTA TGTAGACGGT AAATTGTGTG CTTTATCTAG AAGCGCCAAT
GGCAGACCAA TGTTTGAAGT TATTCAAAAA GTGAAAATCA AATATGGTGA TGTTAATAAA
GATGGAAGAA TAAATTCAAC GGATATTATG TATTTGAAGG GATATCTGTT GCGAAACAGT
GCTTTCAATT TAGACGAATA CGGCTTAATG GCGGCGGATG TGGACGGCAA TGGTTCAGTA
AGCTCATTGG ATTTGACATA TCTGAAGAGG TATATATTAC GCAGGATTTC AGACTTCCCT
GCAAACAAGA AATAA
 
Protein sequence
MAYLDNELFV TALTDSQSLY WLPEGPNGMI AVIDTKTVEL KEKIDIKIRP FNIFAGKNGY 
LYVTSREPQK AYFNSYSRST KEFMDSELVN NECLSEYNPT LDRIYAIPID IMPIDYKVLN
VDNGKFVSSY SSTYYDSYPL AEKFKISPDG KYLFNSSGVV FTCNENVNED MKFAFTLDKK
FTDIAFNMEE NRFYTAVGGN QIYVYNYEDF SGIDTLSSTG EILKLFYVDG KLCALSRSAN
GRPMFEVIQK VKIKYGDVNK DGRINSTDIM YLKGYLLRNS AFNLDEYGLM AADVDGNGSV
SSLDLTYLKR YILRRISDFP ANKK