Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_0797 |
Symbol | |
ID | 4810415 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | - |
Start bp | 961597 |
End bp | 964041 |
Gene Length | 2445 bp |
Protein Length | 814 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640106214 |
Product | glycoside hydrolase family protein |
Protein accession | YP_001037225 |
Protein GI | 125973315 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2730] Endoglucanase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAAAAAAA TTGTTTCTTT GGTTTGTGTG CTTGTGATGC TGGTAAGCAT CTTAGGCTCG TTTTCAGTCG TAGCGGCATC ACCGGTAAAA GGCTTTCAGG TATCGGGAAC AAAGCTTTTG GATGCAAGCG GAAACGAGCT TGTAATGAGG GGCATGCGTG ATATTTCAGC AATAGATTTG GTTAAAGAAA TAAAAATCGG ATGGAATTTG GGAAATACTT TGGATGCTCC TACAGAGACT GCCTGGGGAA ATCCAAGGAC AACCAAGGCA ATGATAGAAA AGGTAAGGGA AATGGGCTTT AATGCCGTCA GAGTGCCTGT TACCTGGGAT ACGCACATCG GACCTGCTCC GGACTATAAA ATTGACGAAG CATGGCTGAA CAGAGTTGAG GAAGTGGTAA ACTATGTTCT TGACTGCGGT ATGTACGCGA TCATAAATGT TCACCATGAC AATACATGGA TTATACCTAC ATATGCCAAT GAGCAAAGGA GTAAAGAAAA ACTTGTAAAA GTTTGGGAAC AAATAGCAAC CCGTTTTAAA GATTATGACG ACCATTTGTT GTTTGAGACA ATGAACGAAC CGAGAGAAGT AGGTTCACCT ATGGAATGGA TGGGCGGAAC GTATGAAAAC CGAGATGTGA TAAACAGATT TAATTTGGCG GTTGTTAATA CCATCAGAGC AAGCGGCGGA AATAACGATA AAAGATTCAT ACTGGTTCCG ACCAATGCGG CAACCGGCCT GGATGTTGCA TTAAACGACC TTGTCATTCC GAACAATGAC AGCAGAGTCA TAGTATCCAT ACATGCTTAT TCACCGTATT TCTTTGCTAT GGATGTCAAC GGAACTTCAT ATTGGGGAAG TGACTATGAC AAGGCTTCTC TTACAAGTGA ACTTGATGCT ATTTACAACA GATTTGTGAA AAACGGAAGG GCTGTAATTA TCGGAGAATT CGGAACCATT GACAAGAACA ACCTGTCTTC AAGGGTGGCT CATGCCGAGC ACTATGCAAG AGAAGCAGTT TCAAGAGGAA TTGCTGTTTT CTGGTGGGAT AACGGCTATT ACAATCCGGG TGATGCAGAG ACTTATGCAT TGCTGAACAG AAAAACTCTC TCATGGTATT ATCCTGAAAT TGTCCAGGCT CTTATGAGAG GTGCCGGCGT TGAACCTTTA GTTTCACCGA CTCCTACACC TACATTAATG CCGACCCCCT CGCCCACGGT GACAGCAAAT ATTTTGTACG GTGACGTAAA CGGGGACGGA AAAATAAATT CTACAGACTG TACAATGCTA AAGAGATATA TTTTGCGTGG CATAGAAGAA TTCCCAAGTC CTAGCGGAAT TATAGCCGCT GACGTAAATG CGGATCTGAA AATCAATTCC ACCGACTTGG TATTGATGAA AAAATATCTA CTGCGCTCAA TAGACAAATT TCCTGCGGAG GATTCTCAAA CACCTGATGA AGACAATCCG GGCATTTTGT ATAACGGAAG ATTCGATTTT TCAGATCCGA ACGGTCCGAA ATGCGCCTGG TCCGGCAGCA ATGTTGAGCT GAATTTTTAC GGCACGGAAG CAAGTGTGAC TATCAAATCC GGCGGTGAGA ACTGGTTCCA GGCTATTGTA GACGGCAATC CTCTTCCTCC TTTTTCGGTT AACGCTACTA CCTCTACCGT AAAGCTTGTA AGCGGTCTTG CAGAAGGAGC TCATCATCTT GTATTGTGGA AGAGGACAGA GGCATCCTTG GGAGAAGTTC AGTTCCTTGG GTTTGATTTT GGTTCAGGAA AGCTTCTTGC CGCACCGAAG CCTTTGGAAA GAAAGATTGA GTTTATCGGA GACTCCATCA CATGTGCATA CGGAAATGAA GGAACAAGCA AGGAGCAGTC TTTTACACCG AAAAATGAAA ACAGCTATAT GTCTTATGCG GCAATTACAG CCCGTAATTT GAATGCAAGT GCAAATATGA TTGCGTGGTC CGGAATCGGA CTTACCATGA ACTACGGCGG AGCCCCCGGA CCTCTTATAA TGGACCGTTA TCCTTATACC CTTCCTTACA GCGGAGTCAG ATGGGATTTT AGCAAATATG TGCCTCAGGT TGTTGTAATC AATCTTGGTA CCAATGATTT TTCTACATCA TTTGCAGATA AAACAAAGTT TGTAACGGCA TATAAAAACC TTATAAGTGA AGTTCGCAGG AACTATCCGG ATGCCCATAT ATTCTGCTGT GTCGGTCCGA TGCTTTGGGG AACGGGCCTG GATTTGTGCC GCAGTTATGT TACGGAAGTT GTAAATGATT GTAACAGAAG CGGGGATTTA AAGGTGTATT TTGTTGAGTT TCCGCAGCAG GACGGAAGCA CCGGATACGG AGAAGACTGG CATCCAAGTA TTGCCACCCA CCAGCTGATG GCTGAGCGGC TTACTGCGGA AATAAAAAAC AAGCTTGGAT GGTAA
|
Protein sequence | MKKIVSLVCV LVMLVSILGS FSVVAASPVK GFQVSGTKLL DASGNELVMR GMRDISAIDL VKEIKIGWNL GNTLDAPTET AWGNPRTTKA MIEKVREMGF NAVRVPVTWD THIGPAPDYK IDEAWLNRVE EVVNYVLDCG MYAIINVHHD NTWIIPTYAN EQRSKEKLVK VWEQIATRFK DYDDHLLFET MNEPREVGSP MEWMGGTYEN RDVINRFNLA VVNTIRASGG NNDKRFILVP TNAATGLDVA LNDLVIPNND SRVIVSIHAY SPYFFAMDVN GTSYWGSDYD KASLTSELDA IYNRFVKNGR AVIIGEFGTI DKNNLSSRVA HAEHYAREAV SRGIAVFWWD NGYYNPGDAE TYALLNRKTL SWYYPEIVQA LMRGAGVEPL VSPTPTPTLM PTPSPTVTAN ILYGDVNGDG KINSTDCTML KRYILRGIEE FPSPSGIIAA DVNADLKINS TDLVLMKKYL LRSIDKFPAE DSQTPDEDNP GILYNGRFDF SDPNGPKCAW SGSNVELNFY GTEASVTIKS GGENWFQAIV DGNPLPPFSV NATTSTVKLV SGLAEGAHHL VLWKRTEASL GEVQFLGFDF GSGKLLAAPK PLERKIEFIG DSITCAYGNE GTSKEQSFTP KNENSYMSYA AITARNLNAS ANMIAWSGIG LTMNYGGAPG PLIMDRYPYT LPYSGVRWDF SKYVPQVVVI NLGTNDFSTS FADKTKFVTA YKNLISEVRR NYPDAHIFCC VGPMLWGTGL DLCRSYVTEV VNDCNRSGDL KVYFVEFPQQ DGSTGYGEDW HPSIATHQLM AERLTAEIKN KLGW
|
| |