Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A0411 |
Symbol | mhpA |
ID | 5591963 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 433117 |
End bp | 434781 |
Gene Length | 1665 bp |
Protein Length | 554 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640919596 |
Product | 3-(3-hydroxyphenyl)propionate hydroxylase |
Protein accession | YP_001457181 |
Protein GI | 157159863 |
COG category | [C] Energy production and conversion [H] Coenzyme transport and metabolism |
COG ID | [COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 0.0836277 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAATAC AACACCCTGA CATCCAGCCT GCTGTTAACC ATAGCGTTCA GGTGGCGATC GCTGGTGCCG GTCCGGTTGG GCTGATGATG GCGAACTATC TCGGTCAGAT GGGCATTGAC GTGCTGGTGG TGGAGAAACT CGATAAGTTG ATCGACTACC CGCGTGCGAT TGGTATTGAT GACGAGGCGC TGCGCACCAT GCAGTCGGTC GGCCTGGTCG AGAATGTTCT GCCGCACACT ACACCGTGGC ACGCGATGCG TTTTCTCACC CCAAAAGGCC GCTGTTTTGC TGATATTCAG CCAATGACCG ATGAATTTGG CTGGCCGCGC CGTAACGCCT TTATTCAGCC ACAGGTCGAT GCGGTGATGC TGGAAGGGTT GTCGCGTTTT CCGAATGTGC GCTGCTTGTT TGCCCGCGAG CTGGAGGCCT TCAGCCAGCA AAATGACGAA GTGACCTTGC ACCTGAAAAC GGCAGAAGGG CAGCGGGAAA CGGTCAAAGC CCAGTGGCTG GTAGCCTGTG ACGGTGGAGC AAGTTTTGTC CGTCGCACTC TGAATGTGCC GTTTGAAGGT AAAACTGCGC CAAATCAGTG GATTGTGGTA GATATCGCCA ACGATCCGTT AAGTACGCCG CATATCTATT TGTGTTGCGA TCCGGTGCGC CCGTATGTTT CTGCCGCGCT GCCTCATGCG GTACGTCGCT TTGAATTTAT GGTGATGCCG GGAGAAACCG AAGAGCAGCT GCGTGAGCCG CAAAATATGC GCAAGCTGTT AAGCAAAGTG CTGCCTAATC CGGACAATGT TGAATTGATT CGCCAGCGTG TCTACACCCA CAACGCGCGA CTGGCGCAAC GTTTCCGTAT TGATCGCGTA CTGCTGGCGG GCGATGCCGC GCACATCATG CCGGTATGGC AGGGGCAGGG CTATAACAGT GGTATGCGCG ACGCCTTTAA CCTCGCATGG AAACTGGCGT TGGTTATCCA GGGGAAAGCC CGCGATGCGC TGCTCGATAC CTATCAACAA GAACGTCGCG ATCACGCCAA AGCGATGATT GACCTGTCCG TGACGGCGGG CAACGTGCTG GCTCCGCCGA AACGCTGGCA GGGTACGTTA CGTGACGGCG TTTCCTGGCT GTTGAATTAT CTGCCGCCAG TAAAACGCTA CTTCCTCGAA ATGCGCTTCA AGCCGATGCC GCAATATTAC GGCGGTGCGC TGATGCGTGA GGGCGAAGCG AAGCACTCTC CGGTCGGCAA GATGTTTATT CAGCCGAAAG TCACGCTGGA AAACGGCGAC GTGACGCTGC TCGATAACGC GATCGGCGCG AACTTCGCGG TAATTGGCTG GGGATGCAAT CCACTGTGGG GGATGAGCGA CGAGCAAATC CAGCAGTGGC GCGCGTTGGG CACACGCTTC ATTCAGGTGG TGCCGGAAGT GCAAATTCAT ACCGCACAGG ATAACCACGA CGGCGTACTA CGCGTGGGCG ATACGCAAGG TCGCCTGCGT AGCTGGTTCG CGCAACACAA TGCTTCGCTG GTGGTGATGC GCCCGGATCG CTTTGTTGCC GCCACCGCCA TTCCGCAAAC CCTGGGCAAG ACCCTGAATA AACTGGCGTC GGTGATGACG CTGACCCGCC CTGATGCCGA CGTTTCTGTC GAAAAGGTAG CCTGA
|
Protein sequence | MAIQHPDIQP AVNHSVQVAI AGAGPVGLMM ANYLGQMGID VLVVEKLDKL IDYPRAIGID DEALRTMQSV GLVENVLPHT TPWHAMRFLT PKGRCFADIQ PMTDEFGWPR RNAFIQPQVD AVMLEGLSRF PNVRCLFARE LEAFSQQNDE VTLHLKTAEG QRETVKAQWL VACDGGASFV RRTLNVPFEG KTAPNQWIVV DIANDPLSTP HIYLCCDPVR PYVSAALPHA VRRFEFMVMP GETEEQLREP QNMRKLLSKV LPNPDNVELI RQRVYTHNAR LAQRFRIDRV LLAGDAAHIM PVWQGQGYNS GMRDAFNLAW KLALVIQGKA RDALLDTYQQ ERRDHAKAMI DLSVTAGNVL APPKRWQGTL RDGVSWLLNY LPPVKRYFLE MRFKPMPQYY GGALMREGEA KHSPVGKMFI QPKVTLENGD VTLLDNAIGA NFAVIGWGCN PLWGMSDEQI QQWRALGTRF IQVVPEVQIH TAQDNHDGVL RVGDTQGRLR SWFAQHNASL VVMRPDRFVA ATAIPQTLGK TLNKLASVMT LTRPDADVSV EKVA
|
| |