Gene Cthe_0115 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0115 
Symbol 
ID4808741 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp147032 
End bp149011 
Gene Length1980 bp 
Protein Length659 aa 
Translation table11 
GC content41% 
IMG OID640105526 
Productglycogen debranching enzyme, putative 
Protein accessionYP_001036549 
Protein GI125972639 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3408] Glycogen debranching enzyme 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type
[TIGR01561] glycogen debranching enzyme, archaeal type, putative 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACTTTG GAAAATCCAG CTGGAGAACT TATGAGCAAG GGATTCAACG GGAATGGCTT 
CTAACCAACG GCATAGGAGG ATTTGCATCA TCCACGATTA TAGGTTCGAA CACCCGAAGG
TACCATGGTC TGTTGGTGGC GGCACTGAAA CCTCCTGTAA GCAGGCACTT GATATTGTCC
AAAATTGATG AATGTGTAAC TGTTGACAAT GAATCGTTCA ATTTGTTTTC TTATGAAGTG
CCGGGTTTTA TAATGCATGG ATATCATCAT CTGGAACGAT TTGAATATGA TATGCTTCCT
AAATATATAT ATAGAGTAAA GGATGTATAT GTTGAAAAGG AAATATGCAT GGTTTACGGT
GAAAACACCG TTGTTGTGGT TTACCACGTC ATAAACGGGA CCGGAAGAAC CAATTTGAGA
CTTACTCCTC TTGTGAATTT CCGGGACTAT CATTTTAACT CCGGCAGAGC ATATATGAGT
TTTGAAAGAA AATTTGAAAA AGATGTTTTA ACGGTGCGAC CTTCCTGCTA CGACATTGAC
ATTAAGCTGT ACTCAACTGA TGGGAAATTT ACGGAGCTTG ACGATACCTG GTTTTACAAC
ATGGATTATG CTGTGGAAAG GGAAAGAGGT TTGGCCTCCA CCGAGGATCA TTATATACCC
GGGTATTATG ATATTGAAAT AAAACCTTTG GAGGAAAAAT ACATAACCTT TGTAGCTACC
GTCGAAAAAA ACGTAAATCA CAGAGACGGG CTTAAAATAA TAGAAAAAGA AAAAAAGAGG
CTTAAAAAAC TGGCAAGTAA AGCGGGATAC AAGGATGAGC TGGCAAAAAA ACTGGTTTTG
GCGGCAGACA AATTTATAGT ACACAGGGAA TCCACTAATG CGAAAACCGT AATTGCAGGA
TATCCGTGGT TTACGGATTG GGGAAGGGAT ACCATGATAT CTCTTGCGGG ACTTACGCTG
GCAACCCGGC GTTTTGAGGA TGCAAAGGAG ATTCTTTATA CTTTTTCAAA ATATGTCAAA
GACGGACTTA TACCGAACAT GTTCCCCGAT GCGGGACATG AGCCGCCGTA CAACACCGTT
GATGCCGCTC TGTGGTACTT TGAAGCTGTA AACAAGTATG TAAATTACAC CCATGATTAT
AAGTTCATAA AAGAAAATAT ATACAGTGGA TTGAAACAGA TAATTGAGTA CTACTCAAAA
GGAACCCACT TTAATATAAA AGCTGATGAG GATTATTTAA TCAGTGCCGG AGATGAACAT
ACCCAGCTTA CATGGATGGA TGCGAAAGTG GGTGACTGGG TGGTTACTCC GCGTCATGGA
AAAGCTGTGG AGATAAATGC TTTGTGGTAT AATGCCTTAA AAATTATGTC GCAGCTTTCG
AAACATTTTA AAGAAGATGC CGGATTATAC GATGAAATGG CCGAAAAAGT CAAGAAGTCT
TTTGAAGAGA AGTTCTGGAA TGAAGAGAAA AAATGTCTTT ATGACTGTCT TACACCAAAT
TATAAAGATG ATAAAGTAAG GCCGAACCAG ATATTGGCTG TAAGTCTTTC CTATCCGGTA
ATAGAAGGAG AAAAGGCAAA GAGTGTTGTG GATATTGTTT TTAAAGAGCT CTATACCGCT
TATGGCTTAA GGAGTCTGTC ACCTAAAGAA AAGGAATATG TGGGAGTGTA TATGGGTGAT
CAATACAGAA GGGACGGAGC ATACCATCAG GGCACGGTCT GGACATGGCC TTTGGGCCAC
TTTATAACTG CGTATCTCAG GGTAAACAAT TATTCGCCGG AGGCCAGGAA AATGGCTTTA
AGATTTATTG AGCCCTTTAA GGACCATCTT CAGGATGCCT GTCTTGGTTC TGTTTCCGAG
ATATTTGACG GTAATGAGCC GTTGATACCA AGAGGCTGTT TTGCACAGGC CTGGAGTGTG
GCAGAGATAC TCAGAGCCTA TGTTGAGGAC GTGATGCAGT TGGTACAGCT AAAACACTAG
 
Protein sequence
MNFGKSSWRT YEQGIQREWL LTNGIGGFAS STIIGSNTRR YHGLLVAALK PPVSRHLILS 
KIDECVTVDN ESFNLFSYEV PGFIMHGYHH LERFEYDMLP KYIYRVKDVY VEKEICMVYG
ENTVVVVYHV INGTGRTNLR LTPLVNFRDY HFNSGRAYMS FERKFEKDVL TVRPSCYDID
IKLYSTDGKF TELDDTWFYN MDYAVERERG LASTEDHYIP GYYDIEIKPL EEKYITFVAT
VEKNVNHRDG LKIIEKEKKR LKKLASKAGY KDELAKKLVL AADKFIVHRE STNAKTVIAG
YPWFTDWGRD TMISLAGLTL ATRRFEDAKE ILYTFSKYVK DGLIPNMFPD AGHEPPYNTV
DAALWYFEAV NKYVNYTHDY KFIKENIYSG LKQIIEYYSK GTHFNIKADE DYLISAGDEH
TQLTWMDAKV GDWVVTPRHG KAVEINALWY NALKIMSQLS KHFKEDAGLY DEMAEKVKKS
FEEKFWNEEK KCLYDCLTPN YKDDKVRPNQ ILAVSLSYPV IEGEKAKSVV DIVFKELYTA
YGLRSLSPKE KEYVGVYMGD QYRRDGAYHQ GTVWTWPLGH FITAYLRVNN YSPEARKMAL
RFIEPFKDHL QDACLGSVSE IFDGNEPLIP RGCFAQAWSV AEILRAYVED VMQLVQLKH