Gene EcolC_3626 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3626 
SymbolispH 
ID6067531 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3970995 
End bp3971945 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content55% 
IMG OID641603043 
Product4-hydroxy-3-methylbut-2-enyl diphosphate reductase 
Protein accessionYP_001726566 
Protein GI170021612 
COG category[I] Lipid transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0761] Penicillin tolerance protein 
TIGRFAM ID[TIGR00216] (E)-4-hydroxy-3-methyl-but-2-enyl pyrophosphate reductase (IPP and DMAPP forming) 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.887259 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGATCC TGTTGGCCAA CCCGCGTGGT TTTTGTGCCG GGGTAGACCG CGCTATCAGC 
ATTGTTGAAA ACGCGCTGGC CATTTACGGC GCACCGATAT ATGTCCGTCA CGAAGTGGTG
CATAACCGCT ACGTGGTCGA TAGCCTGCGC GAGCGTGGGG CTATCTTTAT TGAGCAGATT
AGCGAAGTGC CGGACGGCGC AATCCTGATT TTCTCCGCAC ACGGTGTTTC TCAGGCGGTA
CGTAACGAAG CGAAAAGCCG TGATTTGACG GTATTTGACG CCACCTGCCC GCTGGTGACC
AAAGTGCATA TGGAAGTCGC CCGCGCTAGT CGCCGTGGCG AAGAATCTAT TCTCATCGGT
CACGCCGGGC ACCCGGAAGT GGAAGGCACG ATGGGTCAGT ACAGCAACCC GGAAGGGGGA
ATGTATCTGG TTGAATCACC AGACGATGTG TGGAAACTGA CGGTCAAAAA CGAAGAGAAA
CTCTCCTTTA TGACCCAGAC CACGCTGTCG GTGGATGACA CGTCTGATGT GATCGACGCG
CTGCGTAAAC GCTTCCCGAA AATTGTCGGT CCGCGCAAAG ATGACATCTG CTACGCCACG
ACTAACCGAC AGGAAGCGGT ACGTGCCCTG GCCGAACAGG CGGAAGTTGT GCTGGTGGTC
GGTTCGAAAA ACTCCTCTAA CTCCAACCGT CTGGCGGAGC TGGCCCAGCG TATGGGCAAA
AGCGCGTTTT TGATTGATGA TGCGAAAGAT ATCCAGGAAG AGTGGGTGAA AGAGGTTAAA
TGCGTCGGCG TGACTGCGGG CGCATCGGCT CCGGATATTC TGGTGCAGAA TGTGGTGGCA
CGTTTGCAAC AGCTGGGCGG TGGTGAAGCC ATTCCGCTGG AAGGACGTGA AGAAAACATT
GTTTTCGAAG TGCCGAAAGA GCTGCGTGTC GATATTCGTG AAGTCGATTA A
 
Protein sequence
MQILLANPRG FCAGVDRAIS IVENALAIYG APIYVRHEVV HNRYVVDSLR ERGAIFIEQI 
SEVPDGAILI FSAHGVSQAV RNEAKSRDLT VFDATCPLVT KVHMEVARAS RRGEESILIG
HAGHPEVEGT MGQYSNPEGG MYLVESPDDV WKLTVKNEEK LSFMTQTTLS VDDTSDVIDA
LRKRFPKIVG PRKDDICYAT TNRQEAVRAL AEQAEVVLVV GSKNSSNSNR LAELAQRMGK
SAFLIDDAKD IQEEWVKEVK CVGVTAGASA PDILVQNVVA RLQQLGGGEA IPLEGREENI
VFEVPKELRV DIREVD