Gene EcSMS35_0027 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0027 
SymbolispH 
ID6146607 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp31424 
End bp32374 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content55% 
IMG OID641614928 
Product4-hydroxy-3-methylbut-2-enyl diphosphate reductase 
Protein accessionYP_001742144 
Protein GI170684275 
COG category[I] Lipid transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0761] Penicillin tolerance protein 
TIGRFAM ID[TIGR00216] (E)-4-hydroxy-3-methyl-but-2-enyl pyrophosphate reductase (IPP and DMAPP forming) 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.300377 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGATCC TGTTGGCCAA CCCGCGTGGT TTTTGTGCCG GGGTAGACCG CGCTATCAGC 
ATTGTTGAAA ACGCGCTGGC CATTTACGGC GCACCGATAT ATGTCCGTCA CGAAGTGGTG
CATAACCGCT ACGTGGTCGA TAGCCTGCGC GAGCGTGGGG CTATCTTTAT TGAGCAGATT
AGCGAAGTAC CGGACGGCGC GATCCTGATT TTCTCCGCAC ACGGTGTTTC TCAGGCGGTA
CGTAACGAAG CGAAAAGCCG TGATTTGACG GTATTCGACG CCACCTGTCC GCTGGTGACC
AAAGTGCATA TGGAAGTCGC CCGCGCCAGT CGCCGTGGCG AAGAATCTAT TCTCATCGGC
CACGCCGGTC ACCCGGAAGT GGAAGGGACA ATGGGTCAGT ACAGCAACCC GGAAGGGGGA
ATGTATCTGG TCGAATCGCC AGACGATGTG TGGAAACTGA CGGTCAAAAA CGAAGAGAAG
CTCTCCTTTA TGACCCAAAC CACGCTGTCG GTGGATGACA CGTCTGATGT GATCGACGCG
CTGCGTAAAC GCTTCCCGAA AATTGTCGGT CCGCGCAAAG ATGACATCTG TTACGCCACG
ACTAACCGTC AGGAAGCGGT ACGCGCCCTG GCAGAACAGG CGGAAGTTGT GCTGGTGGTC
GGTTCGAAAA ACTCCTCCAA CTCCAACCGT CTGGCGGAGC TGGCCCAACG TATGGGCAAA
CGCGCGTTTT TGATTGACGA TGCGAAAGAT ATCCAGGAAG AGTGGGTGAA AGAGGTTAAA
TGCGTCGGCG TGACTGCGGG CGCATCGGCT CCGGATATTC TGGTGCAGAA TGTGGTGGCA
CGTTTGCAGC AGCTGGGTGG TGGTGAAGCC ATTCCGCTGG AAGGCCGTGA AGAAAACATT
GTTTTCGAAG TGCCGAAAGA GCTGCGTGTC GATATTCGTG AAGTCGATTA A
 
Protein sequence
MQILLANPRG FCAGVDRAIS IVENALAIYG APIYVRHEVV HNRYVVDSLR ERGAIFIEQI 
SEVPDGAILI FSAHGVSQAV RNEAKSRDLT VFDATCPLVT KVHMEVARAS RRGEESILIG
HAGHPEVEGT MGQYSNPEGG MYLVESPDDV WKLTVKNEEK LSFMTQTTLS VDDTSDVIDA
LRKRFPKIVG PRKDDICYAT TNRQEAVRAL AEQAEVVLVV GSKNSSNSNR LAELAQRMGK
RAFLIDDAKD IQEEWVKEVK CVGVTAGASA PDILVQNVVA RLQQLGGGEA IPLEGREENI
VFEVPKELRV DIREVD