Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A0031 |
Symbol | ispH |
ID | 5593742 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 29536 |
End bp | 30486 |
Gene Length | 951 bp |
Protein Length | 316 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640919219 |
Product | 4-hydroxy-3-methylbut-2-enyl diphosphate reductase |
Protein accession | YP_001456814 |
Protein GI | 157159496 |
COG category | [I] Lipid transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0761] Penicillin tolerance protein |
TIGRFAM ID | [TIGR00216] (E)-4-hydroxy-3-methyl-but-2-enyl pyrophosphate reductase (IPP and DMAPP forming) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 42 |
Plasmid unclonability p-value | 0.423956 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGATCC TGTTGGCCAA CCCGCGTGGT TTTTGTGCCG GGGTAGACCG CGCTATCAGC ATTGTTGAAA ACGCGCTGGC CATTTACGGC GCACCGATAT ATGTCCGTCA CGAAGTGGTA CATAACCGCT ATGTGGTCGA TAGCTTGCGT GAGCGTGGGG CTATCTTTAT TGAGCAGATT AGCGAAGTAC CGGACGGCGC GATCCTGATT TTCTCCGCAC ACGGTGTTTC TCAGGCGGTA CGTAACGAAG CAAAAAGTCG CGATTTGACG GTATTTGACG CGACCTGTCC GCTGGTGACC AAAGTGCATA TGGAAGTCGC CCGCGCTAGT CGCCGTGGCG AAGAATCTAT TCTCATCGGT CACGCCGGGC ACCCGGAAGT GGAAGGCACG ATGGGTCAGT ACAGCAACCC GGAAGGGGGA ATGTATCTGG TTGAATCACC AGACGATGTG TGGAAACTGA CGGTCAAAAA CGAAGAGAAA CTCTCCTTTA TGACCCAGAC CACGCTGTCG GTGGATGACA CGTCTGATGT GATCGACGCG CTGCGTAAAC GCTTCCCGAA AATTGTCGGT CCGCGCAAAG ATGACATCTG CTACGCCACG ACTAACCGAC AGGAAGCGGT ACGTGCCCTG GCCGAACAGG CGGAAGTTGT GCTGGTGGTC GGTTCGAAAA ACTCCTCTAA CTCCAACCGT CTGGCGGAGC TGGCCCAGCG TATGGGCAAA AGCGCGTTTT TGATTGATGA TGCGAAAGAT ATCCAGGAAG AGTGGGTGAA AGAGGTTAAA TGCGTCGGCG TGACTGCGGG CGCATCGGCT CCGGATATTC TGGTGCAGAA TGTGGTGGCA CGTTTGCAAC AGCTGGGCGG TGGTGAAGCC ATTCCGCTGG AAGGACGTGA AGAAAACATT GTTTTCGAAG TGCCGAAAGA GCTGCGTGTC GATATTCGTG AAGTCGATTA A
|
Protein sequence | MQILLANPRG FCAGVDRAIS IVENALAIYG APIYVRHEVV HNRYVVDSLR ERGAIFIEQI SEVPDGAILI FSAHGVSQAV RNEAKSRDLT VFDATCPLVT KVHMEVARAS RRGEESILIG HAGHPEVEGT MGQYSNPEGG MYLVESPDDV WKLTVKNEEK LSFMTQTTLS VDDTSDVIDA LRKRFPKIVG PRKDDICYAT TNRQEAVRAL AEQAEVVLVV GSKNSSNSNR LAELAQRMGK SAFLIDDAKD IQEEWVKEVK CVGVTAGASA PDILVQNVVA RLQQLGGGEA IPLEGREENI VFEVPKELRV DIREVD
|
| |