Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mpe_A0347 |
Symbol | phhA |
ID | 4786838 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylibium petroleiphilum PM1 |
Kingdom | Bacteria |
Replicon accession | NC_008825 |
Strand | + |
Start bp | 382027 |
End bp | 382914 |
Gene Length | 888 bp |
Protein Length | 295 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 640088902 |
Product | phenylalanine 4-monooxygenase |
Protein accession | YP_001019544 |
Protein GI | 124265540 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3186] Phenylalanine-4-hydroxylase |
TIGRFAM ID | [TIGR01267] phenylalanine-4-hydroxylase, monomeric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.72687 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAACG ACTTCCATGA CGGCGTGAAC AAGGGCGTGG CACCGGTCAC CTACGGGCAG GGCGACCGGC CGCCGCGCGG CGACTACTCG CGCGCGCGGG CCGACTACAG CTGCGAGCAG GACATGGCCC GCTACACCGA GGCCGACCAC GAGACCTACC GCCGTCTCTA TGCCCGCCAG CTGCGGCAGC TGCCTGGCCT GGCCTGCCAG GCATTCATCG ACGCTGTCGA GCAGCTCGGT GCCCCCGACC GCATCCCGCG CTTCAGTGAC ATCTCCGCGC GGCTGTCGAA GGCCACCGGC TGGCAGATCG TCGGCGTGCC CGGCCTGATC CCCGAGGAGG CCTTCTTCGC GCTGCTCGCC CAACGGAAGT TCCCGGTCAC CGACTGGATC CGCACCCCCG AGGAGTTCGA CTACGTCGTC GAGCCCGATG TCTTCCACGA CCTGTTCGGC CACGTGCCCC TGCTGTTCAA CCCGGTGTTC GCCGACTACA TGCAGGCCTA TGGCGCCGGC GGGCTCAAGG CCAGCCGGCT CGACGCCTGC GAGCTGCTGG CGCGCCTGTA CTGGTACACG GTGGAGTTCG GGCTGATCGA CACGCCGCAG GGCCTGCGCG CCTACGGGGC CGGCATCCTG TCGAGCGCCG GCGAGCTGCG CCACGCGGTC CTGTCCCCCG AGCCGCAGCG CATTGCCTTC GATCTGCAGC GGCTGATGCG CACGCTCTAC AAGATCGACA GCTACCAGGC CGGCTACTTC GTGATCGACA GCTTCCGCCA GCTGTTCGAC GCCACCGCGC CGGACTTCAC GCCGGTCTAT GCGGCAGTGC GCCAGCAGCC GCTGGTGGAG GCCGGCATCG TGCTGGACGG CGAGCGCTGC TTCACGCCAG CCGGCTGA
|
Protein sequence | MKNDFHDGVN KGVAPVTYGQ GDRPPRGDYS RARADYSCEQ DMARYTEADH ETYRRLYARQ LRQLPGLACQ AFIDAVEQLG APDRIPRFSD ISARLSKATG WQIVGVPGLI PEEAFFALLA QRKFPVTDWI RTPEEFDYVV EPDVFHDLFG HVPLLFNPVF ADYMQAYGAG GLKASRLDAC ELLARLYWYT VEFGLIDTPQ GLRAYGAGIL SSAGELRHAV LSPEPQRIAF DLQRLMRTLY KIDSYQAGYF VIDSFRQLFD ATAPDFTPVY AAVRQQPLVE AGIVLDGERC FTPAG
|
| |