Gene EcHS_A0031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0031 
SymbolispH 
ID5593742 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp29536 
End bp30486 
Gene Length951 bp 
Protein Length316 aa 
Translation table11 
GC content54% 
IMG OID640919219 
Product4-hydroxy-3-methylbut-2-enyl diphosphate reductase 
Protein accessionYP_001456814 
Protein GI157159496 
COG category[I] Lipid transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0761] Penicillin tolerance protein 
TIGRFAM ID[TIGR00216] (E)-4-hydroxy-3-methyl-but-2-enyl pyrophosphate reductase (IPP and DMAPP forming) 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value0.423956 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGATCC TGTTGGCCAA CCCGCGTGGT TTTTGTGCCG GGGTAGACCG CGCTATCAGC 
ATTGTTGAAA ACGCGCTGGC CATTTACGGC GCACCGATAT ATGTCCGTCA CGAAGTGGTA
CATAACCGCT ATGTGGTCGA TAGCTTGCGT GAGCGTGGGG CTATCTTTAT TGAGCAGATT
AGCGAAGTAC CGGACGGCGC GATCCTGATT TTCTCCGCAC ACGGTGTTTC TCAGGCGGTA
CGTAACGAAG CAAAAAGTCG CGATTTGACG GTATTTGACG CGACCTGTCC GCTGGTGACC
AAAGTGCATA TGGAAGTCGC CCGCGCTAGT CGCCGTGGCG AAGAATCTAT TCTCATCGGT
CACGCCGGGC ACCCGGAAGT GGAAGGCACG ATGGGTCAGT ACAGCAACCC GGAAGGGGGA
ATGTATCTGG TTGAATCACC AGACGATGTG TGGAAACTGA CGGTCAAAAA CGAAGAGAAA
CTCTCCTTTA TGACCCAGAC CACGCTGTCG GTGGATGACA CGTCTGATGT GATCGACGCG
CTGCGTAAAC GCTTCCCGAA AATTGTCGGT CCGCGCAAAG ATGACATCTG CTACGCCACG
ACTAACCGAC AGGAAGCGGT ACGTGCCCTG GCCGAACAGG CGGAAGTTGT GCTGGTGGTC
GGTTCGAAAA ACTCCTCTAA CTCCAACCGT CTGGCGGAGC TGGCCCAGCG TATGGGCAAA
AGCGCGTTTT TGATTGATGA TGCGAAAGAT ATCCAGGAAG AGTGGGTGAA AGAGGTTAAA
TGCGTCGGCG TGACTGCGGG CGCATCGGCT CCGGATATTC TGGTGCAGAA TGTGGTGGCA
CGTTTGCAAC AGCTGGGCGG TGGTGAAGCC ATTCCGCTGG AAGGACGTGA AGAAAACATT
GTTTTCGAAG TGCCGAAAGA GCTGCGTGTC GATATTCGTG AAGTCGATTA A
 
Protein sequence
MQILLANPRG FCAGVDRAIS IVENALAIYG APIYVRHEVV HNRYVVDSLR ERGAIFIEQI 
SEVPDGAILI FSAHGVSQAV RNEAKSRDLT VFDATCPLVT KVHMEVARAS RRGEESILIG
HAGHPEVEGT MGQYSNPEGG MYLVESPDDV WKLTVKNEEK LSFMTQTTLS VDDTSDVIDA
LRKRFPKIVG PRKDDICYAT TNRQEAVRAL AEQAEVVLVV GSKNSSNSNR LAELAQRMGK
SAFLIDDAKD IQEEWVKEVK CVGVTAGASA PDILVQNVVA RLQQLGGGEA IPLEGREENI
VFEVPKELRV DIREVD