Gene ECH74115_0031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0031 
SymbolispH 
ID6970368 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp30652 
End bp31602 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content55% 
IMG OID643384112 
Product4-hydroxy-3-methylbut-2-enyl diphosphate reductase 
Protein accessionYP_002268635 
Protein GI209398644 
COG category[I] Lipid transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0761] Penicillin tolerance protein 
TIGRFAM ID[TIGR00216] (E)-4-hydroxy-3-methyl-but-2-enyl pyrophosphate reductase (IPP and DMAPP forming) 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones68 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGATCC TGTTGGCCAA CCCACGTGGT TTTTGTGCCG GGGTAGACCG CGCTATCAGC 
ATTGTTGAAA ACGCGCTTGC CATTTACGGC GCACCGATAT ATGTCCGTCA CGAAGTGGTG
CATAACCGCT ACGTGGTCGA TAGCCTGCGC GAGCGTGGAG CTATCTTTAT TGAGCAGATC
AGCGAAGTGC CGGACGGCGC GATCCTGATC TTCTCCGCAC ATGGTGTTTC TCAGGCGGTA
CGTAACGAAG CGAAAAGCCG TGATTTGACG GTATTCGACG CCACCTGTCC GCTGGTGACC
AAAGTGCATA TGGAAGTCGC CCGCGCCAGC CGTCGTGGCG AAGAGTCTAT TCTCATCGGT
CACGCCGGGC ACCCGGAAGT GGAAGGGACG ATGGGGCAGT ACAGCAACCC TGAAGGGGGA
ATGTATCTGG TCGAATCGCC TGACGATGTG TGGAAACTGA CGGTCAAAAA CGAAGAGAAG
CTCTCCTTTA TGACCCAAAC CACGCTGTCG GTAGATGACA CGTCTGATGT GATCGACGCG
CTGCGTAAAC GCTTCCCGAA AATTGTCGGT CCGCGCAAAG ATGACATCTG CTACGCCACG
ACTAACCGTC AGGAAGCGGT ACGCGCCCTG GCAGAACAGG CGGAAGTTGT GTTGGTGGTC
GGTTCGAAAA ACTCCTCCAA CTCCAACCGT CTGGCGGAGC TGGCCCAGCG TATGGGCAAA
CGCGCGTTTT TGATTGACGA TGCGAAAGAT ATCCAGGAAG AGTGGGTGAA AGAGGTTAAA
TGCGTCGGCG TGACTGCGGG CGCATCGGCT CCGGATATTC TGGTGCAGAA TGTGGTGGCA
CGTTTGCAGC AGCTGGGTGG TGGTGAAGCC ATTCCGCTGG AAGGCCGTGA AGAAAATATT
GTTTTCGAAG TGCCGAAAGA GCTGCGTGTC GATATTCGTG AAGTCGATTA A
 
Protein sequence
MQILLANPRG FCAGVDRAIS IVENALAIYG APIYVRHEVV HNRYVVDSLR ERGAIFIEQI 
SEVPDGAILI FSAHGVSQAV RNEAKSRDLT VFDATCPLVT KVHMEVARAS RRGEESILIG
HAGHPEVEGT MGQYSNPEGG MYLVESPDDV WKLTVKNEEK LSFMTQTTLS VDDTSDVIDA
LRKRFPKIVG PRKDDICYAT TNRQEAVRAL AEQAEVVLVV GSKNSSNSNR LAELAQRMGK
RAFLIDDAKD IQEEWVKEVK CVGVTAGASA PDILVQNVVA RLQQLGGGEA IPLEGREENI
VFEVPKELRV DIREVD