Gene Cthe_2445 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2445 
Symbol 
ID4809824 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2915703 
End bp2916746 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content45% 
IMG OID640107859 
Productalcohol dehydrogenase GroES-like protein 
Protein accessionYP_001038840 
Protein GI125974930 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.682207 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGGAA AAATGAAGGT TTGTGTGTTG ACAGGCAAAG AAAAGCTGGA GTGGGTAGAA 
CGCGATATTC CACAGCCGGG AAGGGGGGAA TTGCAGATCA AGCTGAAACA TGTGGGCGTG
TGCGGTTCAG ACTTACATTT TTACAAAGAA GGACGTCTTG CAAACTGGGA ACTTGACGGA
CCGCTGGCAT TGGGACATGA ACCCGGAGGA ATTGTATCAG CCATAGGTGA AGGTGTTGAA
GGCTTTGAGA TTGGTGACAA GGTGGCATTG GAACCGGGAG TGCCGTGCGG TGAATGTGAA
GACTGTAGAA AAGGACATTA CAATTTGTGC AAACATATCA AATTTATGGC CATTCCTCAT
GAAAAAGACG GTGTGTTTGC CGAGTACTGC GTTCATAGTG CAAGCATGTG CTACAAACTG
CCTGAGAATG TTGACACAAT GGAAGGTGGC CTTATGGAAC CGCTTTCTGT TGCGCTTCAT
GCTACTGAAC TGTCAAACGC TAAAATAGGC GAAACAGCTA TTGTTTTAGG AAGTGGCTGC
ATTGGTTTGT GCACAGTTAT GGCTCTTAAG GCACGGGGAG TAAGTGAAAT TTATGTTACT
GATGTGGTGG ACAAGAGGCT TGAGAAGGCA CTTGAAGTCG GAGCAACCCG GGTATTTAAC
AGTCAGAGGG AAGACATTGT GGAGTTTGCA AAAACTCTGC CCGGCGGCGG TGCGGATCAG
GTTTACGAAT GTGCGGGAAG CCGTGTGACA ACTCTGCAGA CATGCAAACT GATCAAACGT
GCCGGAAAAG TCACATTGGT TGGTGTGTCA CCGGAGCCTG TTTTGGAACT GGATATCGCC
ACTTTGAACG CAATGGAAGG TACCGTATAT TCCGTATATC GTTACAGAAA TATGTATCCT
ATTGCAATAG CAGCCGTGTC TTCAGGAGTA ATTCCGCTTA AGAAAATTGT ATCCCATGTG
TTCGATTTTA AGGATTGCAT AGAGGCGATT GAATACAGTA CAAATCACAA GGATGAAGTT
ATAAAATCTG TTATTAAATT TTAA
 
Protein sequence
MKGKMKVCVL TGKEKLEWVE RDIPQPGRGE LQIKLKHVGV CGSDLHFYKE GRLANWELDG 
PLALGHEPGG IVSAIGEGVE GFEIGDKVAL EPGVPCGECE DCRKGHYNLC KHIKFMAIPH
EKDGVFAEYC VHSASMCYKL PENVDTMEGG LMEPLSVALH ATELSNAKIG ETAIVLGSGC
IGLCTVMALK ARGVSEIYVT DVVDKRLEKA LEVGATRVFN SQREDIVEFA KTLPGGGADQ
VYECAGSRVT TLQTCKLIKR AGKVTLVGVS PEPVLELDIA TLNAMEGTVY SVYRYRNMYP
IAIAAVSSGV IPLKKIVSHV FDFKDCIEAI EYSTNHKDEV IKSVIKF