Gene Cthe_0388 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0388 
Symbol 
ID4808465 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp482647 
End bp483681 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content42% 
IMG OID640105802 
Productalcohol dehydrogenase GroES-like protein 
Protein accessionYP_001036819 
Protein GI125972909 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID[TIGR01202] 2-desacetyl-2-hydroxyethyl bacteriochlorophyllide A dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.00875101 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTACAAT CGGTTATGGT ATCTCCTGGA AAGATTGAGT TCCATGAGGT GGAAAAACCG 
GAACTAAAAC CTGGACAAGT ACTAATAAAG ATTATGAGAA TTGGCATATG TGGTTCGGAT
ATTCATGTAA ATCATGGAAA ACATCCTTTC ACAAAGTATC CGGTAACTCA AGGACATGAA
GTAAGTGGAA AAATCGTCGA AGTTGCAGAA GATGTGGAGC ATCTGAAAGT TGGCCAGAAA
GTGACTATAG AGCCTCAGGT TGTATGTGGT AAATGTCATC CGTGCCGTAC AGGAAAATAT
AACCTCTGTG AGGAACTAAA AGTTATGGGA TTCCAGACCG TTGGCGCAGG CAGCGAATAT
TTTGCTGTTG ATGCGAAGAA TGTAACGACT GTCCCTGACC ATCTTTCTTA TGACGAGGCC
GCGATGATTG AGCCTTTGGC TGTTACCGTT CATGCGGCAA ACAGAGTCGG TGATGTGAAG
GATAAGGACA TTGTAGTAAT AGGTGCGGGT CCTATAGGAA TTTTGCTGGT TCAGACCCTG
AAAGCAAAAG GTGCCCGCAA GGTAATGGTT ACGGATGTAA GCGATTATCG TTTGGAATTG
GCGCTCAAGT GTGGTGCTGA TTTTGCAGTC AATACCAAAA AAGAAGATTT CGGAGAAGCG
ATGCTTCGCT GTTTTGGACC TGATAAAGCA GATGTTATAT ATGATTGTGC GGGCAACAAT
ACAACAATGG AACAAGCCAT AAAACATTCC CGCAAGGGAA GCATAATTGT TCTGGTTGCA
GTATTTGAAG GTATGGCAAC AGTCGATCTT GCAACTCTTA ACGATAAGGA ACTGGATTTG
AATACGACAA TGATGTATCG GCATGAGGAC TATGTGGAAG CCATTGAGCT TGTTGAAGCG
GGTAAAGTAA AATTAACCCC TCTGATGAGC AAGCATTTTG CTTTTAGAGA TTGGGAAAAA
GCCTATGAGT ATATTGACAA TAATCGTGAA ATTACTATGA AAGTCCTCAT TGATGTTAAT
AACGATGATG ATTAA
 
Protein sequence
MLQSVMVSPG KIEFHEVEKP ELKPGQVLIK IMRIGICGSD IHVNHGKHPF TKYPVTQGHE 
VSGKIVEVAE DVEHLKVGQK VTIEPQVVCG KCHPCRTGKY NLCEELKVMG FQTVGAGSEY
FAVDAKNVTT VPDHLSYDEA AMIEPLAVTV HAANRVGDVK DKDIVVIGAG PIGILLVQTL
KAKGARKVMV TDVSDYRLEL ALKCGADFAV NTKKEDFGEA MLRCFGPDKA DVIYDCAGNN
TTMEQAIKHS RKGSIIVLVA VFEGMATVDL ATLNDKELDL NTTMMYRHED YVEAIELVEA
GKVKLTPLMS KHFAFRDWEK AYEYIDNNRE ITMKVLIDVN NDDD