Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | B21_00032 |
Symbol | ispH |
ID | 8115476 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli BL21 |
Kingdom | Bacteria |
Replicon accession | NC_012892 |
Strand | + |
Start bp | 30349 |
End bp | 31299 |
Gene Length | 951 bp |
Protein Length | 316 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 644846327 |
Product | hypothetical protein |
Protein accession | YP_002997900 |
Protein GI | 251783596 |
COG category | [I] Lipid transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0761] Penicillin tolerance protein |
TIGRFAM ID | [TIGR00216] (E)-4-hydroxy-3-methyl-but-2-enyl pyrophosphate reductase (IPP and DMAPP forming) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00595799 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGATCC TGTTGGCCAA CCCGCGTGGT TTTTGTGCCG GGGTAGACCG CGCTATCAGC ATTGTTGAAA ACGCGCTGGC CATTTACGGC GCACCGATAT ATGTCCGTCA CGAAGTGGTA CATAACCGCT ATGTGGTCGA TAGCTTGCGT GAGCGTGGGG CTATCTTTAT TGAGCAGATT AGCGAAGTAC CGGACGGCGC GATCCTGATT TTCTCCGCAC ACGGTGTTTC TCAGGCGGTA CGTAACGAAG CAAAAAGCCG CGATTTGACG GTGTTTGATG CCACCTGTCC GCTGGTGACC AAAGTGCATA TGGAAGTCGC CCGCGCCAGC CGTCGTGGCG AAGAGTCGAT TCTCATCGGC CACGCCGGGC ATCCGGAAGT GGAAGGTACG ATGGGCCAGT ACAGTAACCC GGAAGGGGGA ATGTATCTGG TCGAATCGCC GGACGATGTG TGGAAACTGA CGGTCAAAAA CGAAGAGAAG CTCTCCTTTA TGACCCAGAC CACGCTGTCG GTGGATGACA CGTCTGATGT GATCGACGCG CTGCGTAAAC GCTTCCCGAA AATTGTCGGT CCGCGCAAAG ATGACATCTG CTACGCCACG ACTAACCGTC AGGAAGCGGT ACGCGCCCTG GCAGAACAGG CGGAAGTTGT GTTGGTGGTC GGTTCGAAAA ACTCCTCCAA CTCCAACCGT CTGGCGGAGC TGGCCCAGCG TATGGGCAAA CGCGCGTTTT TGATTGACGA TGCGAAAGAT ATCCAGGAAG AGTGGGTGAA AGAGGTTAAA TGCGTCGGCG TGACTGCGGG CGCATCGGCT CCGGATATTC TGGTGCAAAA TGTGGTGGCA CGTTTGCAGC AGCTGGGCGG TGGTGAAGCC ATTCCGCTGG AAGGCCGTGA AGAAAACATT GTTTTCGAAG TGCCGAAAGA GCTGCGTGTC GATATTCGTG AAGTCGATTA A
|
Protein sequence | MQILLANPRG FCAGVDRAIS IVENALAIYG APIYVRHEVV HNRYVVDSLR ERGAIFIEQI SEVPDGAILI FSAHGVSQAV RNEAKSRDLT VFDATCPLVT KVHMEVARAS RRGEESILIG HAGHPEVEGT MGQYSNPEGG MYLVESPDDV WKLTVKNEEK LSFMTQTTLS VDDTSDVIDA LRKRFPKIVG PRKDDICYAT TNRQEAVRAL AEQAEVVLVV GSKNSSNSNR LAELAQRMGK RAFLIDDAKD IQEEWVKEVK CVGVTAGASA PDILVQNVVA RLQQLGGGEA IPLEGREENI VFEVPKELRV DIREVD
|
| |