Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4834 |
Symbol | mhpA |
ID | 5673175 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5775501 |
End bp | 5777294 |
Gene Length | 1794 bp |
Protein Length | 597 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641243690 |
Product | 3-(3-hydroxyphenyl)propionate hydroxylase |
Protein accession | YP_001509106 |
Protein GI | 158316598 |
COG category | [C] Energy production and conversion [H] Coenzyme transport and metabolism |
COG ID | [COG0654] 2-polyprenyl-6-methoxyphenol hydroxylase and related FAD-dependent oxidoreductases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.447643 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00803206 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | GTGACCGCGC CCGAGCCGGC CGAGGTCGAG GCCGAGGTCG AGGCCGAGGC TGAGGCCATG GCCGACGTCG TCGTCGTCGG CTACGGGCCG GTCGGCCAGG TAACGGCGAT TCTGCTGGCG CGGCGCGGCT GGCGGGTCAC AGTCCTGGAG CGATGGCCCC GGCCGTACCC GATGCCGCGC GCGGTCTCCT TCGACGGCGA GTCGGCCCGC ATCCTCGCCG CGGCCGGCGT CGGCCCGGCG ATGACCGAGT TCGGTGAGCC GTCCCGCGAC TACACCTGGC GCAACGCCGC GGGTGACGTG CTGCTGCACG TCGACGTCCC CGAGCGCGGG CGTTCCGGCT GGCCGGACTC GACGTCGATG TACCAGCCGG CACTGGAGGA GGCGCTCGCG GAGCGGGGCT CCCGACTGCC GGGCCTGCGG GTGCTGCGCG GGCACCGGGT GGTCGCGCTG ACCGAACGGG ACGGCCACGT CGAGCTGACG GCCGTCCTCG ACGGCTCGCC GCCGGACCGC ACCCTACGCC TCCGCGCCCG TTGGGTGGTG GGCTGCGACG GCGCGAACAG CTTCGTGCGG ACGAGCCTGG GCGTGGAGAC CACCGACTTC GCCTACTTCA ACGACTGGCT GACCTGTGAC GTGGTCCTGC GCGAACCGGC GGTGCACAGG CCCAACAACC TCCAGATCTG CGACCCGACC CGGCCCCGCA CGGCCGTGTC CGCCGGGCCG GGGCACCGGC GGTGGGAGTT CATGCGGCTA CCCGGGGAAC CGGCCGAGGA GTTCGGCCGG ACGGAGTCGG CGTGGCGGCT GCTCGGTCTG TTCGGCCTGC ACCCGGGCAA CGCCACACTG GAACGGCACG CCGTCTACAC GTTCCAGGCC AGGTACGTCG ACCGCTGGCG GGTGGGGCGG GTGCTGCTCG CCGGGGACGC CGCGCACCTG ATGCCGCCGT TCGCCGGGCA GGGCATGTGC TCGGGCTTCC GCGACGCGGC GAATCTCTCC TGGAAGCTCG ACCTGGTGCT CGCCGGAACC GCCCCGGCTG AGCTGCTGGA CACCTACACC CTGGAGCGGC GGGCGCACGT CCAGCACGCG ATCGGCATGT CGATGGACCT CGGGCGGGTG ATCTGCGAGA CCGACCCGGC CGCGGCCCGG GACCGGGACG AGGTCATGAT CGCGGTGCGT GCCCGGGGGC TGCGGGAGGA CCGCGCGCAG TCCGCCGTCG AGCCGCTGAC CGCCGGGTTC CTGCGCGGCG GGGCCGCCGC ACACGGCGGT GGCCCCCAGC GTCCCCCGCA GCCTTCCCGC CCGGGGCGTC CGGGGCCGCC GTCGGTTCCG GTGGGTGACC TCGTGCCGCA GGGACGAGTT CGGGTCGGTG ACCGCACCGG GCTGTTCGAC GAGTTCGTCG AGCCCGGGTT CGTCCTGCTG GCCACCTGGC CGGCCGGCGG GCATCTCCAT CTCCGCCCCG AGACGGACGC CGCTTTTGCG CTTCTGGGCG GACGGGTGGT CAATGTTGTC CCCGCCACCG ATGAGGAGCC TCCTGCCGTC GTGGCCGGGA GCGCGGACGC GAGCGCAAGA ATCACCGTCG TCGACGTCGA CGACGTCTAT CTCCGCTATC TCGCCCTCGC CGGCGCCGAT GCCGTTCTGG TGCGGCCGGA CTTCTACCTC TTCGGCCGTG TCCCGGCGGC GGCACCGCCG GCCGCGGACA CCCGGGCCGG CGGTACAGCG GTACAGCGGT CGGATTCCGT CGAGGAACTC GTCGGTGACC TGCTGGCCGC GCTCGACGCC CCGATTCCCG TCGGGAAGCA TTAG
|
Protein sequence | MTAPEPAEVE AEVEAEAEAM ADVVVVGYGP VGQVTAILLA RRGWRVTVLE RWPRPYPMPR AVSFDGESAR ILAAAGVGPA MTEFGEPSRD YTWRNAAGDV LLHVDVPERG RSGWPDSTSM YQPALEEALA ERGSRLPGLR VLRGHRVVAL TERDGHVELT AVLDGSPPDR TLRLRARWVV GCDGANSFVR TSLGVETTDF AYFNDWLTCD VVLREPAVHR PNNLQICDPT RPRTAVSAGP GHRRWEFMRL PGEPAEEFGR TESAWRLLGL FGLHPGNATL ERHAVYTFQA RYVDRWRVGR VLLAGDAAHL MPPFAGQGMC SGFRDAANLS WKLDLVLAGT APAELLDTYT LERRAHVQHA IGMSMDLGRV ICETDPAAAR DRDEVMIAVR ARGLREDRAQ SAVEPLTAGF LRGGAAAHGG GPQRPPQPSR PGRPGPPSVP VGDLVPQGRV RVGDRTGLFD EFVEPGFVLL ATWPAGGHLH LRPETDAAFA LLGGRVVNVV PATDEEPPAV VAGSADASAR ITVVDVDDVY LRYLALAGAD AVLVRPDFYL FGRVPAAAPP AADTRAGGTA VQRSDSVEEL VGDLLAALDA PIPVGKH
|
| |