Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0378 |
Symbol | mhpA |
ID | 6144565 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 392829 |
End bp | 394493 |
Gene Length | 1665 bp |
Protein Length | 554 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 641615274 |
Product | 3-(3-hydroxyphenyl)propionate hydroxylase |
Protein accession | YP_001742481 |
Protein GI | 170683478 |
COG category | [C] Energy production and conversion [H] Coenzyme transport and metabolism |
COG ID | [COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0214512 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 53 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAATAC AACACCCTGA CATCCAGCCT GCTGTTAACC ATAGCGTTCA GGTGGCGATC GCTGGTGCCG GTCCGGTCGG GCTGATGATG GCGAACTATC TCGGCCAGAT GGGCATTGAC GTGCTGGTGG TGGAGAAACT CGATAAGTTG ATCGACTACC CGCGTGCGAT TGGTATTGAT GACGAGGCGC TGCGCACCAT GCAGTCGGTT GGCCTGGTCG ATAATGTTCT GCCGCACACT ACGCCGTGGC ACGCGATGCG TTTTCTCACC CCAAAAGGTC GCTGTTTTGC TGATATTCAG CCAATGACCG ATGAATTTGG CTGGCCGCGC CGTAACGCCT TTATTCAGCC TCAGGTCGAT GCGGTGATGC TGGAAGGATT GTCGCGTTTT CCGAATGTGC GCTGCCTGTT TTCCCGCGAG CTGGAGGCCT TCAGCCAGCA AGATGACGAA GTGACCTTGC ACCTAAAAAC GGAAGAAGGG CAGCGGGAAA CGGTCAAAGC CCAGTGGCTG GTGGCCTGTG ATGGTGGGGC AAGTTTTGTC CGTCGCACCC TGAATGTGCC GTTTGAAGGT AAAACTGCGC CAAATCAGTG GATTGTGGTA GATATCGCCA ACGATCCGTT AAGTACGCCG CATATCTATT TGTGTTGTGA TCCGGTGCGC CCGTATGTTT CTGCCGCGCT GCCTCATGCG GTACGTCGCT TTGAATTTAT GGTGATGCCG GGAGAAACCG AAGAACAGCT GCGTGAGCCG CAAAATATGC GCAAGCTGTT AAGCAAAGTG CTGCCTAATC CGGACAATGT TGAATTGATT CGCCAGCGTG TCTACACCCA CAACGCGCGA CTGGCGCAAC GTTTTCGTAT TGATCGCGTA CTGCTGGCGG GCGATGCCGC GCACATCATG CCGGTGTGGC AGGGGCAGGG CTATAACAGT GGTATGCGCG ACGCCTTTAA CCTCGCATGG AAACTGGCGT TGGTTATCCA GGGGAAAGCC CGCGATGCGC TGCTCGATAC CTATCAACAA GAACGTCGCG ATCACGCCAA AGCGATGATT GACCTGTCCG TGACGGCGGG CAACGTGCTG GCTCCGCCGA AACGCTGGCA GGGTACGTTA CGTGACGGCG TTTCCTGGCT GTTGAATTAT CTGCCGCCAG TAAAACGCTA CTTCCTCGAA ATGCGCTTCA AGCCGATGCC GCAATATTAC GGCGGTGCGC TGGTGCGTGA GGGCGAAGCG AAGCACTCTC CGGTCGGCAA GATGTTTATT CAGCCGAAAG TCACGCTGGA AAACGGCGAC GTGACGCTGC TCGATAACGC GATCGGCGCG AACTTCGCGG TAATTGGCTG GGGATGCAAT CCACTGTGGG GGATGAGCGA CGAGCAAATC CAGCAGTGGC GCGCGTTGGG CACACGCTTC ATTCAGGTGG TGCCGGAAGT GCAAATTCAT ACCGCACAGG ATAACCACGA CGGCGTACTA CGCGTGGGCG ATACGCAAGG TCGCCTGCGT AGCTGGTTCG CGCAACACAA TGCTTCGCTG GTGGTGATGC GCCCGGATCG CTTTGTTGCC GCCACCGCCA TTCCGCAAAC CCTGGGCAAG ACCCTGAATA AACTGGCGTC GGTGATGACG CTGACCCGCC CTGATGCCGA CGTTTCTGTC GAAAAGGTAG CCTGA
|
Protein sequence | MAIQHPDIQP AVNHSVQVAI AGAGPVGLMM ANYLGQMGID VLVVEKLDKL IDYPRAIGID DEALRTMQSV GLVDNVLPHT TPWHAMRFLT PKGRCFADIQ PMTDEFGWPR RNAFIQPQVD AVMLEGLSRF PNVRCLFSRE LEAFSQQDDE VTLHLKTEEG QRETVKAQWL VACDGGASFV RRTLNVPFEG KTAPNQWIVV DIANDPLSTP HIYLCCDPVR PYVSAALPHA VRRFEFMVMP GETEEQLREP QNMRKLLSKV LPNPDNVELI RQRVYTHNAR LAQRFRIDRV LLAGDAAHIM PVWQGQGYNS GMRDAFNLAW KLALVIQGKA RDALLDTYQQ ERRDHAKAMI DLSVTAGNVL APPKRWQGTL RDGVSWLLNY LPPVKRYFLE MRFKPMPQYY GGALVREGEA KHSPVGKMFI QPKVTLENGD VTLLDNAIGA NFAVIGWGCN PLWGMSDEQI QQWRALGTRF IQVVPEVQIH TAQDNHDGVL RVGDTQGRLR SWFAQHNASL VVMRPDRFVA ATAIPQTLGK TLNKLASVMT LTRPDADVSV EKVA
|
| |