Gene Cthe_0543 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0543 
Symbol 
ID4808292 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp663788 
End bp666007 
Gene Length2220 bp 
Protein Length739 aa 
Translation table11 
GC content45% 
IMG OID640105957 
Productglycoside hydrolase family protein 
Protein accessionYP_001036972 
Protein GI125973062 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAGAAAA TTTTGGCGTT TTTGCTGACA GTTGCGCTGG TGGCAGTAGT GGCCATTCCA 
CAAGCCGTGG TAAGTTTTGC TGCGGATTTC AACTATGGTG AGGCACTTCA GAAAGCAATA
ATGTTTTATG AGTTCCAGCG CTCGGGAAAA CTGCCCGAAA ACAAAAGAAA CAACTGGCGT
GGAGATTCCG CTCTTAATGA CGGCGCAGAC AACGGTTTGG ACCTTACAGG CGGTTGGTAT
GATGCCGGTG ACCATGTAAA GTTCAACCTT CCGATGGCCT ATGCCGTTAC CATGCTCGCA
TGGAGTGTTT ATGAATCCCG GGATGCGTAT GTACAAAGCG GACAGCTTCC TTACATACTG
GACAATATTA AATGGGCTAC CGACTACTTT ATAAAATGCC ATCCAAGTCC AAATGTATAT
TATTATCAGG TGGGAGACGG AGCATTGGAC CATTCATGGT GGGGACCTGC TGAAGTAATG
CAGATGCCAA GACCGTCCTT CAAAGTGGAT TTGACCAATC CGGGTTCGAC TGTGGTTGCT
GAGACGGCAG CGGCTATGGC TGCATCCTCA ATTGTTTTCA AGCCTACAGA CCCGGAATAT
GCTGCCACAC TTTTAAGGCA TGCGAAAGAA CTCTTTACTT TTGCCGACAC CACAAGAAGT
GACGCAGGAT ATAGAGCGGC AGAGGGATAC TATTCATCCC ACAGCGGTTT TTATGATGAA
CTTACCTGGG CGAGTATATG GCTGTATCTT GCAACAGGAG ACCAGTCTTA TCTTGATAAA
GCAGAATCCT ATGAACCTCA TTGGGAAAGG GAAAGAGGTA CAACTTTAAT TAGTTATTCC
TGGGCTCATT GCTGGGATAA CAAATTGTAC GGTTCTTTGC TTTTGTTAGC AAAAATTACC
GGCAAGTCTT ATTACAAGCA ATGTATTGAA AACCATCTTG ACTATTGGAC CGTCGGATTT
AACGGAAGCA GAGTTCAATA TACTCCAAAA GGACTCGCAT ATCTTGACAG ATGGGGTTCA
TTGAGATATG CAACCACACA GGCGTTCCTT GCCAGCGTTT ATGCGGACTG GTCCGGCTGT
GACCCGGCTA AGGCGGCTGT CTACAAGGAA TTTGCAAAAA AACAGGTGGA TTATGCATTA
GGAAGCACAG GAAGAAGCTT TGTAGTAGGT TTTGGAAAAA ATCCGCCAAG AAATCCTCAC
CACAGGACGG CCCACAGCTC ATGGAGCGCT TTAATGACCG AACCTGCGGA GTGCAGACAT
ATTCTGGTGG GTGCATTGGT TGGCGGACCG GACGGTTCGG ATTCATATGT TGACAGGCTC
GATGATTATC AGTGCAATGA GGTGGCCAAC GACTATAATG CTGGATTTGT AGGTGCTCTT
GCCAAGATGT ATGAGAAGTA TGGCGGAGAA CCGATTCCGA ATTTCGTTGC TTTTGAAACA
CCGGGGGAAG AATTTTATGT TGAAGCTGCG GTAAATGCTG CAGGACCCGG TTTTGTAAAT
ATCAAAGCTT CAATAATCAA CAAGTCCGGT TGGCCGGCAA GAGGTTCAGA TAAATTGTCA
GCCAAGTATT TTGTCGATAT TTCCGAAGCT GTTGCAAAAG GCATTACTTT GGATCAAATT
ACCGTTCAGT CGACTACTAA TGGCGGAGCC AAGGTTTCAC AGCTTCTTCC GTGGGATCCG
GACAATCATA TTTATTATGT AAACATTGAC TTTACGGGAA TAAACATATT CCCCGGAGGA
ATAAATGAAT ACAAGAGGGA TGTATATTTC ACTATTACGG CGCCGTATGG AGAGGGTAAC
TGGGACAATA CCAACGACTT CTCCTTCCAG GGACTTGAGC AGGGCTTTAC AAGCAAAAAG
ACTGAATATA TACCGTTGTA TGACGGTAAT GTGAGAGTAT GGGGTAAAGT ACCGGACGGA
GGTTCGGAGC CCGATCCGAC GCCGACAATC ACCGTTGGCC CCACTCCTTC GGTTACACCG
ACATCAGTAC CTGGAATAAT GCTCGGAGAT GTGAATTTTG ACGGAAGAAT AAACTCGACG
GATTATTCAC GCTTAAAAAG ATATGTAATA AAGTCTTTGG AATTCACAGA TCCTGAAGAG
CACCAGAAGT TCATTGCAGC TGCGGATGTT GACGGGAACG GAAGAATAAA CTCCACAGAT
TTGTATGTGC TCAACAGGTA CATATTAAAA CTTATTGAAA AATTCCCGGC TGAACAGTAA
 
Protein sequence
MKKILAFLLT VALVAVVAIP QAVVSFAADF NYGEALQKAI MFYEFQRSGK LPENKRNNWR 
GDSALNDGAD NGLDLTGGWY DAGDHVKFNL PMAYAVTMLA WSVYESRDAY VQSGQLPYIL
DNIKWATDYF IKCHPSPNVY YYQVGDGALD HSWWGPAEVM QMPRPSFKVD LTNPGSTVVA
ETAAAMAASS IVFKPTDPEY AATLLRHAKE LFTFADTTRS DAGYRAAEGY YSSHSGFYDE
LTWASIWLYL ATGDQSYLDK AESYEPHWER ERGTTLISYS WAHCWDNKLY GSLLLLAKIT
GKSYYKQCIE NHLDYWTVGF NGSRVQYTPK GLAYLDRWGS LRYATTQAFL ASVYADWSGC
DPAKAAVYKE FAKKQVDYAL GSTGRSFVVG FGKNPPRNPH HRTAHSSWSA LMTEPAECRH
ILVGALVGGP DGSDSYVDRL DDYQCNEVAN DYNAGFVGAL AKMYEKYGGE PIPNFVAFET
PGEEFYVEAA VNAAGPGFVN IKASIINKSG WPARGSDKLS AKYFVDISEA VAKGITLDQI
TVQSTTNGGA KVSQLLPWDP DNHIYYVNID FTGINIFPGG INEYKRDVYF TITAPYGEGN
WDNTNDFSFQ GLEQGFTSKK TEYIPLYDGN VRVWGKVPDG GSEPDPTPTI TVGPTPSVTP
TSVPGIMLGD VNFDGRINST DYSRLKRYVI KSLEFTDPEE HQKFIAAADV DGNGRINSTD
LYVLNRYILK LIEKFPAEQ