Gene EcSMS35_0387 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0387 
SymboladhC 
ID6143543 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp402177 
End bp403286 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content54% 
IMG OID641615283 
ProductS-(hydroxymethyl)glutathione dehydrogenase/class III alcohol dehydrogenase 
Protein accessionYP_001742490 
Protein GI170682377 
COG category[C] Energy production and conversion 
COG ID[COG1062] Zn-dependent alcohol dehydrogenases, class III 
TIGRFAM ID[TIGR02818] S-(hydroxymethyl)glutathione dehydrogenase/class III alcohol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.41926 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATCAC GTGCTGCCGT TGCATTTGCT CCCGGTAAAC CGCTGGAAAT CGTTGAAATT 
GACGTTGCAC CACCGAAAAA AGGTGAAGTG CTGATTAAAG TCACCCATAC CGGCGTTTGC
CATACCGACG CATTTACCCT CTCCGGCGAT GACCCGGAAG GTGTATTCCC GGTAGTTCTC
GGTCACGAAG GGGCCGGCGT TGTGGTTGAA GTCGGTGAAG GCGTAACCAG CGTCAAACCT
GGCGACCATG TGATCCCGCT TTACACCGCA GAGTGCGGCG AGTGTGAGTT CTGTCGTTCC
GGCAAAACCA ACCTCTGTGT TGCGGTTCGC GAAACCCAGG GTAAAGGCCT GATGCCAGAC
GGCACCACCC GTTTTTCTTA CAACGGGCAG CCGCTTTATC ACTACATGGG ATGCTCTACA
TTCAGTGAAT ACACCGTGGT CGCGGAAGTG TCTCTGGCCA AAATTAATCC AGAGGCAAAC
CACGAACACG TCTGCCTGCT GGGCTGTGGC GTGACCACCG GTATTGGCGC GGTACACAAC
ACGGCTAAAG TCCAGCCAGA TGATTCTGTT GCCGTATTTG GTCTTGGCGC GATTGGTCTG
GCAGTGGTTC AGGGCGCGCG TCAGGCGAAA GCGGGACGGA TTATCGCTAT CGATACGAAC
CCGAAGAAAT TCGATCTGGC GCGTCGCTTC GGTGCTACCG ACTGCATTAA CCCGAATGAC
TACGACAAAC CGATTAAAGA TGTCCTGCTG GATATCAACA AATGGGGTAT CGACCATACC
TTTGAATGCA TCGGTAACGT CAACGTGATG CGTGCGGCGC TGGAAAGTGC GCACCGCGGC
TGGGGTCAGT CGGTGATCAT CGGGGTAGCT GGTTCTGGTC AGGAAATCTC CACCCGTCCA
TTCCAGTTGG TCACCGGTCG CGTATGGAAA GGTTCCGCGT TTGGCGGCGT GAAAGGTCGT
TCCCAGTTAC CGGGTATGGT TGAAGATGCG ATGAAAGGCG ATATCGATCT GGAACCGTTT
GTCACGCATA CCATGAGCCT GGATGAAATT AATGACGCCT TTGACCTGAT GCATGAAGGC
AAATCCATTC GAACCGTAAT TCGTTACTGA
 
Protein sequence
MKSRAAVAFA PGKPLEIVEI DVAPPKKGEV LIKVTHTGVC HTDAFTLSGD DPEGVFPVVL 
GHEGAGVVVE VGEGVTSVKP GDHVIPLYTA ECGECEFCRS GKTNLCVAVR ETQGKGLMPD
GTTRFSYNGQ PLYHYMGCST FSEYTVVAEV SLAKINPEAN HEHVCLLGCG VTTGIGAVHN
TAKVQPDDSV AVFGLGAIGL AVVQGARQAK AGRIIAIDTN PKKFDLARRF GATDCINPND
YDKPIKDVLL DINKWGIDHT FECIGNVNVM RAALESAHRG WGQSVIIGVA GSGQEISTRP
FQLVTGRVWK GSAFGGVKGR SQLPGMVEDA MKGDIDLEPF VTHTMSLDEI NDAFDLMHEG
KSIRTVIRY