Gene ECH74115_0431 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0431 
SymboladhC 
ID6968833 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp440159 
End bp441268 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content53% 
IMG OID643384483 
ProductS-(hydroxymethyl)glutathione dehydrogenase/class III alcohol dehydrogenase 
Protein accessionYP_002268997 
Protein GI209400668 
COG category[C] Energy production and conversion 
COG ID[COG1062] Zn-dependent alcohol dehydrogenases, class III 
TIGRFAM ID[TIGR02818] S-(hydroxymethyl)glutathione dehydrogenase/class III alcohol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones60 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATCAC GTGCTGCCGT TGCATTTGCT CCCGGTAAAC CGCTGGAAAT CGTTGAAATT 
GACGTTGCAC CACCGAAAAA AGGTGAAGTG CTGATTAAAG TCACCCATAC CGGCGTTTGC
CATACCGACG CATTTACCCT CTCCGGCGAT GACCCGGAAG GTGTATTCCC GGTGGTTCTC
GGTCACGAAG GGGCTGGCGT TGTGGTTGAA GTCGGTGAAG GCGTAACCAG CGTCAAACCA
GGCGACCATG TGATCCCGCT TTACACCGCA GAGTGCGGCG AGTGTGAGTT CTGTCGTTCT
GGCAAAACTA ACCTCTGTGT TGCGGTTCGC GAAACCCAGG GTAAAGGCCT GATGCCAGAC
GGCACCACCC GTTTTTCTTA CAATGGGCAG CCGCTTTATC ACTACATGGG GTGTTCTACA
TTCAGTGAAT ACACCGTAGT CGCGGAAGTG TCTCTGGCAA AAATTAATCC AGAAGCAAAC
CATGAACACG TCTGCCTGCT GGGCTGTGGC GTGACCACCG GTATTGGCGC GGTACACAAC
ACAGCTAAAG TCCAGCCAGG GGATTCTGTT GCTGTGTTTG GTCTTGGCGC GATTGGTCTG
GCAGTGGTTC AGGGCGCGCG TCAGGCGAAA GCGGGACGGA TTATCGCTAT CGATACCAAC
CCGAAGAAAT TCGATCTGGC GCGTCGCTTC GGTGCCACCG ACTGCATTAA CCCGAATGAC
TACGACAAAC CGATTAAAGA TGTCCTGCTG GATATCAACA AATGGGGTAT CGACCATACC
TTTGAATGCA TCGGTAATGT CAACGTGATG CGTGCGGCGC TGGAAAGTGC GCACCGCGGC
TGGGGTCAGT CGGTGATCAT CGGGGTCGCG GGTTCCGGTC AGGAAATCTC CACCCGTCCA
TTCCAGTTGG TCACTGGTCG CGTATGGAAA GGTTCCGCGT TTGGCGGCGT GAAAGGTCGT
TCCCAGTTAC CGGGTATGGT TGAAGATGCG ATGAAAGGTG ATATCGATCT GGAACCGTTT
GTCACGCATA CCATGAGCCT GGATGAAATT AATGACGCCT TCGACCTGAT GCATGAAGGC
AAATCCATTC GAACCGTAAT TCGTTACTGA
 
Protein sequence
MKSRAAVAFA PGKPLEIVEI DVAPPKKGEV LIKVTHTGVC HTDAFTLSGD DPEGVFPVVL 
GHEGAGVVVE VGEGVTSVKP GDHVIPLYTA ECGECEFCRS GKTNLCVAVR ETQGKGLMPD
GTTRFSYNGQ PLYHYMGCST FSEYTVVAEV SLAKINPEAN HEHVCLLGCG VTTGIGAVHN
TAKVQPGDSV AVFGLGAIGL AVVQGARQAK AGRIIAIDTN PKKFDLARRF GATDCINPND
YDKPIKDVLL DINKWGIDHT FECIGNVNVM RAALESAHRG WGQSVIIGVA GSGQEISTRP
FQLVTGRVWK GSAFGGVKGR SQLPGMVEDA MKGDIDLEPF VTHTMSLDEI NDAFDLMHEG
KSIRTVIRY