Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0422 |
Symbol | mhpA |
ID | 6968471 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 430810 |
End bp | 432474 |
Gene Length | 1665 bp |
Protein Length | 554 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643384474 |
Product | 3-(3-hydroxyphenyl)propionate hydroxylase |
Protein accession | YP_002268988 |
Protein GI | 209398053 |
COG category | [C] Energy production and conversion [H] Coenzyme transport and metabolism |
COG ID | [COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 68 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAATAC AACACCCTGA CATCCAGCCT GCTGTTAACC ATAGCGTTCA GGTGGCGATC GCTGGTGCCG GTCCGGTCGG GCTGATGATG GCGAACTATC TCGGCCAGAT GGGCATTGAC GTGCTGGTGG TGGAGAAACT CGATAAGTTG ATCGACTACC CGCGTGCGAT TGGTATTGAT GACGAGGCGC TGCGCACCAT GCAGTCGGTC GGCCTGGTCG ATAATGTTCT GCCGCATACT ACGCCGTGGC ACGCGATGCG TTTTCTCACC CCGAAGGGCC GCTGTTTTGC TGATATTCAG CCAATGACCG ATGAATTTGG CTGGCCGCGC CGTAACGCCT TTATTCAGCC GCAGGTCGAT GCGGTGATGC TGGAAGGGGT GTCGCGTTTT CCGAATGTGC GCTGCTTGTT TTCCCGCGAG CTGGAGGCCT TCAGTCAGCA AGATGACGAA GTGACCTTGC ACCTGAAAAC GGCAGAAGGG CTGCGGGAAA TAGTCAAAGC CCAGTGGCTG GTGGCCTGTG ATGGTGGGGC AAGTTTTGTC CGTCGCACCC TGAATGTGCC GTTTGAAGGT AAAACTGCGC CAAATCAGTG GATTGTGGTA GATATCGCCA ACGATCCGTT AAGTACGCCG CATATCTATT TGTGTTGCGA TCCGGTGCGC CCGTATGTTT CTGCCGCGCT ACCTCATGCG GTACGTCGCT TTGAATTTAT GGTGATGCCG GGAGAAACCG AAGAACAGCT GCGTGAGCCG CAAAATATGC GCAAGCTGTT AAGCAAAGTG CTGCCTAATC CGGACAATGT TGAATTGATT CGCCAGCGTG TCTACACCCA CAACGCGCGA CTGGCGCAAC GTTTCCGTAT TGATCGCGTA CTGCTGGCGG GCGATGCCGC GCACATCATG CCGGTATGGC AGGGGCAGGG CTATAACAGT GGTATGCGCG ACGCCTTTAA CCTCGCATGG AAACTGGCGT TGGTTATCCA GGGGAAAGCC CGCGATGCGC TGCTCGATAC CTATCAACAA GAACGTCGCG ATCACGCCAA AGCGATGATT GACCTGTCCG TGACGGCGGG CAACGTGCTG GCTCCGCCGA AACGCTGGCA GGGTACGTTA CGTGACGGCG TTTCCTGGCT GCTGAATTAT CTGCCGCCAG TAAAACGCTA CTTCCTCGAA ATGCGCTTCA AGCCGATGCC GCAATATTAC GGCGGTGCGC TGGTGCGTGA GGGCGAAGCG AAGCACTCTC CGGTCGGCAA GATGTTTATT CAGCCGAAAG TCACGCTGGA AAACGGCGAC GTGACGCTGC TCGATAACGC GATCGGCGCG AACTTCGCGG TAATTGGCTG GGGATGCAAT CCACTGTGGG GGATGAGCGA CGAGCAAATC CAGCAGTGGC GCGCGTTGAG CACACGCTTC ATTCAGGTGG TGCCGGAAGT GCAAATTCAT ACCGCACAGG ATAACCACGA CGGCGTACTA CGCGTGGGGG ATACGCAAGG TCGCCTGCGT AGCTGGTTCG CGCAACACAA TGCTTCGCTG GTGGTGATGC GCCCGGATCG CTTTGTTGCC GCCACCGCCA TTCCGCAAAC ACTGGGCAAG ACCCTGAATA AACTGGCGTC GGTGATGACG CTGACCCGCC CTGATGCCGA CGTTTCTGTC GAAAAGGTAG CCTGA
|
Protein sequence | MAIQHPDIQP AVNHSVQVAI AGAGPVGLMM ANYLGQMGID VLVVEKLDKL IDYPRAIGID DEALRTMQSV GLVDNVLPHT TPWHAMRFLT PKGRCFADIQ PMTDEFGWPR RNAFIQPQVD AVMLEGVSRF PNVRCLFSRE LEAFSQQDDE VTLHLKTAEG LREIVKAQWL VACDGGASFV RRTLNVPFEG KTAPNQWIVV DIANDPLSTP HIYLCCDPVR PYVSAALPHA VRRFEFMVMP GETEEQLREP QNMRKLLSKV LPNPDNVELI RQRVYTHNAR LAQRFRIDRV LLAGDAAHIM PVWQGQGYNS GMRDAFNLAW KLALVIQGKA RDALLDTYQQ ERRDHAKAMI DLSVTAGNVL APPKRWQGTL RDGVSWLLNY LPPVKRYFLE MRFKPMPQYY GGALVREGEA KHSPVGKMFI QPKVTLENGD VTLLDNAIGA NFAVIGWGCN PLWGMSDEQI QQWRALSTRF IQVVPEVQIH TAQDNHDGVL RVGDTQGRLR SWFAQHNASL VVMRPDRFVA ATAIPQTLGK TLNKLASVMT LTRPDADVSV EKVA
|
| |