Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0027 |
Symbol | ispH |
ID | 6146607 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 31424 |
End bp | 32374 |
Gene Length | 951 bp |
Protein Length | 316 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641614928 |
Product | 4-hydroxy-3-methylbut-2-enyl diphosphate reductase |
Protein accession | YP_001742144 |
Protein GI | 170684275 |
COG category | [I] Lipid transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0761] Penicillin tolerance protein |
TIGRFAM ID | [TIGR00216] (E)-4-hydroxy-3-methyl-but-2-enyl pyrophosphate reductase (IPP and DMAPP forming) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.300377 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGATCC TGTTGGCCAA CCCGCGTGGT TTTTGTGCCG GGGTAGACCG CGCTATCAGC ATTGTTGAAA ACGCGCTGGC CATTTACGGC GCACCGATAT ATGTCCGTCA CGAAGTGGTG CATAACCGCT ACGTGGTCGA TAGCCTGCGC GAGCGTGGGG CTATCTTTAT TGAGCAGATT AGCGAAGTAC CGGACGGCGC GATCCTGATT TTCTCCGCAC ACGGTGTTTC TCAGGCGGTA CGTAACGAAG CGAAAAGCCG TGATTTGACG GTATTCGACG CCACCTGTCC GCTGGTGACC AAAGTGCATA TGGAAGTCGC CCGCGCCAGT CGCCGTGGCG AAGAATCTAT TCTCATCGGC CACGCCGGTC ACCCGGAAGT GGAAGGGACA ATGGGTCAGT ACAGCAACCC GGAAGGGGGA ATGTATCTGG TCGAATCGCC AGACGATGTG TGGAAACTGA CGGTCAAAAA CGAAGAGAAG CTCTCCTTTA TGACCCAAAC CACGCTGTCG GTGGATGACA CGTCTGATGT GATCGACGCG CTGCGTAAAC GCTTCCCGAA AATTGTCGGT CCGCGCAAAG ATGACATCTG TTACGCCACG ACTAACCGTC AGGAAGCGGT ACGCGCCCTG GCAGAACAGG CGGAAGTTGT GCTGGTGGTC GGTTCGAAAA ACTCCTCCAA CTCCAACCGT CTGGCGGAGC TGGCCCAACG TATGGGCAAA CGCGCGTTTT TGATTGACGA TGCGAAAGAT ATCCAGGAAG AGTGGGTGAA AGAGGTTAAA TGCGTCGGCG TGACTGCGGG CGCATCGGCT CCGGATATTC TGGTGCAGAA TGTGGTGGCA CGTTTGCAGC AGCTGGGTGG TGGTGAAGCC ATTCCGCTGG AAGGCCGTGA AGAAAACATT GTTTTCGAAG TGCCGAAAGA GCTGCGTGTC GATATTCGTG AAGTCGATTA A
|
Protein sequence | MQILLANPRG FCAGVDRAIS IVENALAIYG APIYVRHEVV HNRYVVDSLR ERGAIFIEQI SEVPDGAILI FSAHGVSQAV RNEAKSRDLT VFDATCPLVT KVHMEVARAS RRGEESILIG HAGHPEVEGT MGQYSNPEGG MYLVESPDDV WKLTVKNEEK LSFMTQTTLS VDDTSDVIDA LRKRFPKIVG PRKDDICYAT TNRQEAVRAL AEQAEVVLVV GSKNSSNSNR LAELAQRMGK RAFLIDDAKD IQEEWVKEVK CVGVTAGASA PDILVQNVVA RLQQLGGGEA IPLEGREENI VFEVPKELRV DIREVD
|
| |