Gene Cthe_0797 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0797 
Symbol 
ID4810415 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp961597 
End bp964041 
Gene Length2445 bp 
Protein Length814 aa 
Translation table11 
GC content44% 
IMG OID640106214 
Productglycoside hydrolase family protein 
Protein accessionYP_001037225 
Protein GI125973315 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2730] Endoglucanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAAAAAA TTGTTTCTTT GGTTTGTGTG CTTGTGATGC TGGTAAGCAT CTTAGGCTCG 
TTTTCAGTCG TAGCGGCATC ACCGGTAAAA GGCTTTCAGG TATCGGGAAC AAAGCTTTTG
GATGCAAGCG GAAACGAGCT TGTAATGAGG GGCATGCGTG ATATTTCAGC AATAGATTTG
GTTAAAGAAA TAAAAATCGG ATGGAATTTG GGAAATACTT TGGATGCTCC TACAGAGACT
GCCTGGGGAA ATCCAAGGAC AACCAAGGCA ATGATAGAAA AGGTAAGGGA AATGGGCTTT
AATGCCGTCA GAGTGCCTGT TACCTGGGAT ACGCACATCG GACCTGCTCC GGACTATAAA
ATTGACGAAG CATGGCTGAA CAGAGTTGAG GAAGTGGTAA ACTATGTTCT TGACTGCGGT
ATGTACGCGA TCATAAATGT TCACCATGAC AATACATGGA TTATACCTAC ATATGCCAAT
GAGCAAAGGA GTAAAGAAAA ACTTGTAAAA GTTTGGGAAC AAATAGCAAC CCGTTTTAAA
GATTATGACG ACCATTTGTT GTTTGAGACA ATGAACGAAC CGAGAGAAGT AGGTTCACCT
ATGGAATGGA TGGGCGGAAC GTATGAAAAC CGAGATGTGA TAAACAGATT TAATTTGGCG
GTTGTTAATA CCATCAGAGC AAGCGGCGGA AATAACGATA AAAGATTCAT ACTGGTTCCG
ACCAATGCGG CAACCGGCCT GGATGTTGCA TTAAACGACC TTGTCATTCC GAACAATGAC
AGCAGAGTCA TAGTATCCAT ACATGCTTAT TCACCGTATT TCTTTGCTAT GGATGTCAAC
GGAACTTCAT ATTGGGGAAG TGACTATGAC AAGGCTTCTC TTACAAGTGA ACTTGATGCT
ATTTACAACA GATTTGTGAA AAACGGAAGG GCTGTAATTA TCGGAGAATT CGGAACCATT
GACAAGAACA ACCTGTCTTC AAGGGTGGCT CATGCCGAGC ACTATGCAAG AGAAGCAGTT
TCAAGAGGAA TTGCTGTTTT CTGGTGGGAT AACGGCTATT ACAATCCGGG TGATGCAGAG
ACTTATGCAT TGCTGAACAG AAAAACTCTC TCATGGTATT ATCCTGAAAT TGTCCAGGCT
CTTATGAGAG GTGCCGGCGT TGAACCTTTA GTTTCACCGA CTCCTACACC TACATTAATG
CCGACCCCCT CGCCCACGGT GACAGCAAAT ATTTTGTACG GTGACGTAAA CGGGGACGGA
AAAATAAATT CTACAGACTG TACAATGCTA AAGAGATATA TTTTGCGTGG CATAGAAGAA
TTCCCAAGTC CTAGCGGAAT TATAGCCGCT GACGTAAATG CGGATCTGAA AATCAATTCC
ACCGACTTGG TATTGATGAA AAAATATCTA CTGCGCTCAA TAGACAAATT TCCTGCGGAG
GATTCTCAAA CACCTGATGA AGACAATCCG GGCATTTTGT ATAACGGAAG ATTCGATTTT
TCAGATCCGA ACGGTCCGAA ATGCGCCTGG TCCGGCAGCA ATGTTGAGCT GAATTTTTAC
GGCACGGAAG CAAGTGTGAC TATCAAATCC GGCGGTGAGA ACTGGTTCCA GGCTATTGTA
GACGGCAATC CTCTTCCTCC TTTTTCGGTT AACGCTACTA CCTCTACCGT AAAGCTTGTA
AGCGGTCTTG CAGAAGGAGC TCATCATCTT GTATTGTGGA AGAGGACAGA GGCATCCTTG
GGAGAAGTTC AGTTCCTTGG GTTTGATTTT GGTTCAGGAA AGCTTCTTGC CGCACCGAAG
CCTTTGGAAA GAAAGATTGA GTTTATCGGA GACTCCATCA CATGTGCATA CGGAAATGAA
GGAACAAGCA AGGAGCAGTC TTTTACACCG AAAAATGAAA ACAGCTATAT GTCTTATGCG
GCAATTACAG CCCGTAATTT GAATGCAAGT GCAAATATGA TTGCGTGGTC CGGAATCGGA
CTTACCATGA ACTACGGCGG AGCCCCCGGA CCTCTTATAA TGGACCGTTA TCCTTATACC
CTTCCTTACA GCGGAGTCAG ATGGGATTTT AGCAAATATG TGCCTCAGGT TGTTGTAATC
AATCTTGGTA CCAATGATTT TTCTACATCA TTTGCAGATA AAACAAAGTT TGTAACGGCA
TATAAAAACC TTATAAGTGA AGTTCGCAGG AACTATCCGG ATGCCCATAT ATTCTGCTGT
GTCGGTCCGA TGCTTTGGGG AACGGGCCTG GATTTGTGCC GCAGTTATGT TACGGAAGTT
GTAAATGATT GTAACAGAAG CGGGGATTTA AAGGTGTATT TTGTTGAGTT TCCGCAGCAG
GACGGAAGCA CCGGATACGG AGAAGACTGG CATCCAAGTA TTGCCACCCA CCAGCTGATG
GCTGAGCGGC TTACTGCGGA AATAAAAAAC AAGCTTGGAT GGTAA
 
Protein sequence
MKKIVSLVCV LVMLVSILGS FSVVAASPVK GFQVSGTKLL DASGNELVMR GMRDISAIDL 
VKEIKIGWNL GNTLDAPTET AWGNPRTTKA MIEKVREMGF NAVRVPVTWD THIGPAPDYK
IDEAWLNRVE EVVNYVLDCG MYAIINVHHD NTWIIPTYAN EQRSKEKLVK VWEQIATRFK
DYDDHLLFET MNEPREVGSP MEWMGGTYEN RDVINRFNLA VVNTIRASGG NNDKRFILVP
TNAATGLDVA LNDLVIPNND SRVIVSIHAY SPYFFAMDVN GTSYWGSDYD KASLTSELDA
IYNRFVKNGR AVIIGEFGTI DKNNLSSRVA HAEHYAREAV SRGIAVFWWD NGYYNPGDAE
TYALLNRKTL SWYYPEIVQA LMRGAGVEPL VSPTPTPTLM PTPSPTVTAN ILYGDVNGDG
KINSTDCTML KRYILRGIEE FPSPSGIIAA DVNADLKINS TDLVLMKKYL LRSIDKFPAE
DSQTPDEDNP GILYNGRFDF SDPNGPKCAW SGSNVELNFY GTEASVTIKS GGENWFQAIV
DGNPLPPFSV NATTSTVKLV SGLAEGAHHL VLWKRTEASL GEVQFLGFDF GSGKLLAAPK
PLERKIEFIG DSITCAYGNE GTSKEQSFTP KNENSYMSYA AITARNLNAS ANMIAWSGIG
LTMNYGGAPG PLIMDRYPYT LPYSGVRWDF SKYVPQVVVI NLGTNDFSTS FADKTKFVTA
YKNLISEVRR NYPDAHIFCC VGPMLWGTGL DLCRSYVTEV VNDCNRSGDL KVYFVEFPQQ
DGSTGYGEDW HPSIATHQLM AERLTAEIKN KLGW