Gene EcHS_A0421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0421 
SymboladhC 
ID5594614 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp443745 
End bp444854 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content54% 
IMG OID640919606 
ProductS-(hydroxymethyl)glutathione dehydrogenase/class III alcohol dehydrogenase 
Protein accessionYP_001457191 
Protein GI157159873 
COG category[C] Energy production and conversion 
COG ID[COG1062] Zn-dependent alcohol dehydrogenases, class III 
TIGRFAM ID[TIGR02818] S-(hydroxymethyl)glutathione dehydrogenase/class III alcohol dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones58 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATCAC GTGCTGCCGT TGCATTTGCT CCCGGTAAAC CGCTGGAAAT CGTTGAAATT 
GACGTTGCAC CACCGAAAAA AGGTGAAGTG CTGATTAAAG TCACCCATAC CGGCGTTTGC
CATACCGACG CATTTACCCT CTCTGGGGAT GACCCGGAAG GTGTATTCCC GGTGGTTCTC
GGTCACGAAG GGGCCGGCGT TGTGGTTGAA GTCGGTGAAG GCGTAACCAG CGTCAAACCT
GGCGACCATG TGATCCCGCT TTACACCGCG GAGTGCGGCG AGTGTGAGTT CTGTCGTTCT
GGCAAAACTA ACCTCTGTGT TGCGGTTCGC GAAACCCAGG GTAAAGGCTT GATGCCAGAC
GGCACCACCC GTTTTTCTTA CAACGGGCAG CCGCTTTATC ACTACATGGG ATGCTCAACA
TTCAGTGAAT ACACCGTGGT CGCGGAAGTG TCTCTGGCCA AAATTAATCC AGAAGCAAAC
CATGAACACG TCTGCCTGCT GGGCTGTGGC GTGACCACCG GTATTGGCGC GGTGCACAAC
ACAGCTAAAG TCCAGCCAGG TGATTCTGTT GCCGTGTTTG GTCTTGGCGC GATTGGTCTG
GCAGTGGTTC AGGGCGCGCG TCAGGCGAAA GCGGGACGGA TTATCGCTAT CGATACCAAC
CCGAAGAAAT TCGATCTGGC GCGTCGCTTC GGTGCTACCG ACTGCATTAA CCCGAATGAC
TACGACAAAC CGATTAAAGA TGTCCTGCTG GATATCAACA AATGGGGTAT CGACCATACC
TTTGAATGCA TCGGTAACGT CAACGTGATG CGTGCGGCGC TGGAAAGTGC GCACCGCGGC
TGGGGTCAGT CGGTGATCAT CGGGGTGGCA GGTTCTGGTC AGGAAATCTC CACCCGTCCA
TTCCAGTTGG TCACTGGTCG CGTATGGAAA GGTTCCGCGT TTGGCGGCGT GAAAGGTCGT
TCCCAGTTAC CGGGTATGGT TGAAGATGCG ATGAAAGGTG ATATCGATCT GGAACCGTTT
GTCACGCATA CCATGAGCCT TGATGAAATT AATGACGCCT TCGACCTGAT GCATGAAGGC
AAATCCATTC GAACCGTAAT TCGTTACTGA
 
Protein sequence
MKSRAAVAFA PGKPLEIVEI DVAPPKKGEV LIKVTHTGVC HTDAFTLSGD DPEGVFPVVL 
GHEGAGVVVE VGEGVTSVKP GDHVIPLYTA ECGECEFCRS GKTNLCVAVR ETQGKGLMPD
GTTRFSYNGQ PLYHYMGCST FSEYTVVAEV SLAKINPEAN HEHVCLLGCG VTTGIGAVHN
TAKVQPGDSV AVFGLGAIGL AVVQGARQAK AGRIIAIDTN PKKFDLARRF GATDCINPND
YDKPIKDVLL DINKWGIDHT FECIGNVNVM RAALESAHRG WGQSVIIGVA GSGQEISTRP
FQLVTGRVWK GSAFGGVKGR SQLPGMVEDA MKGDIDLEPF VTHTMSLDEI NDAFDLMHEG
KSIRTVIRY