Gene ECH74115_2090 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2090 
Symbol 
ID6970393 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1988671 
End bp1989681 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content53% 
IMG OID643385991 
Productalcohol dehydrogenase 
Protein accessionYP_002270480 
Protein GI209396014 
COG category[R] General function prediction only 
COG ID[COG1064] Zn-dependent alcohol dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones64 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGCTG CAGTTGTTAC GAAGGATCAT CATGTTGACG TTACGGATAA AACACTGCGC 
TCACTGAAAC ATGGCGAAGC CCTGCTGAAA ATGGAGTGTT GTGGTGTATG TCATACCGAT
CTTCATGTTA AGAATGGCGA TTTTGGTGAC AAAACCGGCG TAATTCTGGG CCATGAAGGG
ATTGGTGTGG TGGCAGAAGT GGGTCCAGGT GTCACCTCAT TAAAACCAGG CGATCGTGCC
AGCGTGGCGT GGTTCTACGA AGGATGCGGT CATTGCGAAT ACTGTAACAG TGGTAACGAA
ACGCTCTGCC GTTCAGTTAA AAATGCCGGA TACAGCGTTG ATGGCGGGAT GGCGGAAGAG
TGCATCGTGG TCGCCGATTA CGCGGTAAAA GTGCCAGATG GTCTGGACTC GGCGGCGGCC
AGCAGCATTA CCTGTGCGGG GGTCACCACC TACAAAGCCG TTAAGCTGTC AAAAATTCGT
CCCGGGCAGT GGATTGCTAT CTACGGTCTT GGCGGTCTGG GTAACCTCGC CCTGCAATAC
GCGAAGAATG TCTTTAACGC GAAAGTGATC GCCATTGATG TCAATGATGA GCAGTTAAAA
CTGGCAACCG AAATGGGTGC AGATTTAGCG ATTAACTCAC GCACCGAAGA CGCCGCCAAA
ATTGTGCAGG AGAAAACCGG TGGCGCTCAC GCTGCGGTGG TAACAGCAGT AGCTAAAGCT
GCGTTTAACT CGGCAGTTGA TGCTGTCCGT GCAGGCGGTC GTGTTGTGGC TGTCGGTCTG
CCGCCGGAGT CTATGAGCCT GGATATCCCA CGTCTTGTGC TGGATGGCAT TGAGGTGGTC
GGTTCGCTGG TCGGCACGCG CCAGGATCTA ACTGAAGCCT TCCAGTTTGC CGCCGAAGGT
AAAGTGGTGC CGAAAGTCGC CCTGCGTCCG TTAGCGGACA TCAACACCAT CTTTACCGAG
ATGGAAGAAG GCAAAATCCG TGGCCGTATG GTGATTGATT TCCGCCGCTA A
 
Protein sequence
MKAAVVTKDH HVDVTDKTLR SLKHGEALLK MECCGVCHTD LHVKNGDFGD KTGVILGHEG 
IGVVAEVGPG VTSLKPGDRA SVAWFYEGCG HCEYCNSGNE TLCRSVKNAG YSVDGGMAEE
CIVVADYAVK VPDGLDSAAA SSITCAGVTT YKAVKLSKIR PGQWIAIYGL GGLGNLALQY
AKNVFNAKVI AIDVNDEQLK LATEMGADLA INSRTEDAAK IVQEKTGGAH AAVVTAVAKA
AFNSAVDAVR AGGRVVAVGL PPESMSLDIP RLVLDGIEVV GSLVGTRQDL TEAFQFAAEG
KVVPKVALRP LADINTIFTE MEEGKIRGRM VIDFRR