Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_2422 |
Symbol | aroH |
ID | 6971417 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 2294443 |
End bp | 2295489 |
Gene Length | 1047 bp |
Protein Length | 348 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 643386292 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_002270774 |
Protein GI | 209398312 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 80 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAGAA CTGACGAACT CCGTACTGCG CGTATTGAGA GCCTGGTAAC GCCCGCCGAA CTCGCGCTAC GGTATCCCGT AACGCCTGGC GTCGCCACCC ATGTCACCGA CTCCCGCCGC AGAATTGAAA AAATACTCAA TGGTGAAGAT AAGCGACTGT TGGTCATTAT TGGCCCCTGC TCGATCCACG ATCTCACCGC TGCAATGGAG TACGCCACCC GTCTGCAGTC GCTGCGCAAC CAGTACCAGT CACGGCTGGA AATCGTAATG CGCACCTATT TTGAAAAACC ACGAACTGTT GTCGGCTGGA AAGGACTAAT CTCCGATCCA GATTTAAACG GCAGCTATCG GATAAATCAC GGTCTGGAGC TGGCGCGCAA ATTACTTTTA CAGGTAAATG AGCTGGGCGT CCCAACCGCG ACTGAGTTCC TCGATATGGT GACCGGTCAG TTTATTGCTG ATTTAATCAG TTGGGGCGCG ATTGGCGCAC GTACTACCGA AAGTCAGATC CACCGTGAAA TGGCTTCGGC ACTCTCCTGT CCGGTAGGTT TTAAAAATGG TACCGATGGC AATACGCGGA TTGCTGTGGA TGCTATCCGC GCAGCCCGCG CCAGCCATAT GTTCCTCTCG CCAGACAAAA ATGGTCAGAT GACCATCTAT CAGACCAGCG GCAACCCGTA TGGCCACATT ATTATGCGTG GCGGCAAAAA ACCGAATTAT CATGCCGATG ATATCGCCGC AGCCTGCGAT ACGCTGCACG AGTTTGATTT ACCTGAACAT CTGGTGGTGG ATTTCAGCCA CGGTAACTGC CAGAAGCAGC ACCGTCGCCA GTTAGAAGTT TGTGAGGATA TTTGTCAGCA AATCCGCAAT GGCTCTACGG CGATTGCTGG AATTATGGCG GAAAGTTTCC TGCGCGAAGG AACGCAAAAA ATCGTCGGCG GTCAGCCGCT CACTTACGGT CAATCCATTA CCGACCCGTG TCTGGGCTGG GAAGATACCG AACGCCTGGT CGAAAAACTA GCCTCTGCGG TAGATACCCG CTTCTGA
|
Protein sequence | MNRTDELRTA RIESLVTPAE LALRYPVTPG VATHVTDSRR RIEKILNGED KRLLVIIGPC SIHDLTAAME YATRLQSLRN QYQSRLEIVM RTYFEKPRTV VGWKGLISDP DLNGSYRINH GLELARKLLL QVNELGVPTA TEFLDMVTGQ FIADLISWGA IGARTTESQI HREMASALSC PVGFKNGTDG NTRIAVDAIR AARASHMFLS PDKNGQMTIY QTSGNPYGHI IMRGGKKPNY HADDIAAACD TLHEFDLPEH LVVDFSHGNC QKQHRRQLEV CEDICQQIRN GSTAIAGIMA ESFLREGTQK IVGGQPLTYG QSITDPCLGW EDTERLVEKL ASAVDTRF
|
| |