Gene EcHS_A4525 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A4525 
Symbol 
ID5591569 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp4531319 
End bp4532338 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content54% 
IMG OID640923621 
Productzinc-binding dehydrogenase family oxidoreductase 
Protein accessionYP_001461062 
Protein GI157163744 
COG category[R] General function prediction only 
COG ID[COG1064] Zn-dependent alcohol dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value0.0564298 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAATGA TAAAAAGCTA TGCCGCAAAA GAAGCGGGCG GCGAACTGGA AGTTTATGAG 
TACGATCCCG GTGAGCTGAA GCCACAAGAT GTTGAAGTGC AGGTGGATTA CTGCGGGATC
TGCCATTCCG ATCTGTCGAT GATCGATAAC GAATGGGGAT TTTCACAATA TCCGCTGGTT
GCCGGGCATG AGGTGATTGG TCGCGTGGTG GCGCTCGGGA GTGCCGCGCA GGATAAAGGT
TTGCAGGTCG GTCAGCGTGT CGGGATTGGC TGGACAGCGC GTAGCTGTGG TCACTGCGAC
GCCTGTATTA GCGGAAATCA GATCAACTGT GAGCAAGGTG CGGTGCCAAC AATTATGAAT
CGCGGAGGTT TTGCCGAGAA GTTGCGTGTA GACTGGCAAT GGGTTATTCC ACTGCCGGAA
AATATCGACA TTGAATCTGC CGGGCCGCTG TTGTGCGGCG GTATCACGGT CTTTAAACCA
CTGTTGATGC ACCATATCAC TGCTACCAGC CGCGTTGGGG TAATTGGTAT TGGCGGGCTG
GGGCATATCG CTATAAAACT TCTGCACGCA ATGGGATGTG AGGTGACGGC CTTTAGTTCT
AATCCGGCGA AAGAGCAGGA ATTGCTGGCG ATGGGTGCCG ATAAAGTGGT GAATAGCCGC
GATCCGCAGG CACTGAAAGC ACTGGCGGGG CAGTTTGATC TCATTATCAA TACCGTGAAC
GTCAGCCTCG ACTGGCAGCC TTATTTTGAG GCGCTGACGT ACGGCGGTAA TTTCCACACT
GTCGGTGCGG TTCTCACGCC GCTGTCTGTT CCGGCCTTTA CGTTAATTGC GGGCGATCGC
AGCGTCTCTG GCTCTGCTAC CGGCACGCCT TATGAACTGC GTAAGCTGAT GCGCTTTGCC
GCCCGCAGCA AGGTTGCGCC GACAACCGAA CTGTTCCCGA TGTCGAAAAT TAACGACGCC
ATCCAGCATG TGCGCGACGG TAAGGCGCGT TACCGCGTGG TGTTGAAAGC CGATTTTTGA
 
Protein sequence
MSMIKSYAAK EAGGELEVYE YDPGELKPQD VEVQVDYCGI CHSDLSMIDN EWGFSQYPLV 
AGHEVIGRVV ALGSAAQDKG LQVGQRVGIG WTARSCGHCD ACISGNQINC EQGAVPTIMN
RGGFAEKLRV DWQWVIPLPE NIDIESAGPL LCGGITVFKP LLMHHITATS RVGVIGIGGL
GHIAIKLLHA MGCEVTAFSS NPAKEQELLA MGADKVVNSR DPQALKALAG QFDLIINTVN
VSLDWQPYFE ALTYGGNFHT VGAVLTPLSV PAFTLIAGDR SVSGSATGTP YELRKLMRFA
ARSKVAPTTE LFPMSKINDA IQHVRDGKAR YRVVLKADF