Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcep18194_A3259 |
Symbol | phhA |
ID | 3748405 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia sp. 383 |
Kingdom | Bacteria |
Replicon accession | NC_007510 |
Strand | - |
Start bp | 92403 |
End bp | 93323 |
Gene Length | 921 bp |
Protein Length | 306 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637761522 |
Product | phenylalanine 4-monooxygenase |
Protein accession | YP_367505 |
Protein GI | 78064736 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3186] Phenylalanine-4-hydroxylase |
TIGRFAM ID | [TIGR01267] phenylalanine-4-hydroxylase, monomeric form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.452524 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTTCCCA ACCGCCACGT GCAGAGCCAG ATCATGTCCA CCGTCGTCAC CGCGAAACTG CAAGAGCAGT TCGATGCGGG CCTCGAAACC CGCGCCGATT TCACCATCGA CCAGCCGCTC GCGCGTTACG GCCAGGTCGA CCACGCGGTG TGGAAGCAGC TCTATGCACG CCAGTCGGCG CTGCTGCGCG GGCGCGCGTG CGACGCATTC GTCGCGGGGC TCGGGAAGAT CGACCTGCCG GCCGATCGCG TGCCGTCGTT CGCCGACGTG AACCGCCAGT TGAAGCCGGC AACGGGCTGG GAAATCGTCG CGGTGCCGGG CCTCGTGCCC GACCGCGTGT TCTTCGAGCA TCTCGCGAAC CGGCGCTTCC CCGTCACCTG GTGGATGCGC CGCCCCGACC AGCTCGACTA CCTGCAGGAA CCCGACTGCT TCCACGACCT GTTCGGCCAC GTGCCGCTGC TGATCAACCC GGTGTTCGCC GACTACATGC AGGCGTACGG CCGCACCGCG CTCGCGGTCG CCGACGACGA AGCGGCGCTG GCGCGCCTCG CGCGGCTCTA CTGGTATACG GTGGAATTCG GGCTGATCCG CGATCCGCGC GGCACGAACG GGTTGTCGAT CTACGGTGCC GGGATCGTGT CGAGCAAGGG CGAAAGCCTG TACAGCCTCG AAAGCGCGGC GCCGAACCGG CTCGGCTTCG ACCTCGAGCG CGTGATGCGC ACGAAGTACC GGATCGACAC GTTCCAGAAG ACCTACTTCG TGATCGACGA TTTCGCGCAG CTGTTCGCGC TCGCCGATGT CGACGGCCGT GCACTGGCCG ACCGGCTGGC AGCGCTGCCC GAGTTCGCGG CCGGCGCGGT GCTCGATACC GATCGCGTGC TGCATCGCGG CACGGGCGAA GGCTGGTCCG CCGACGCATG A
|
Protein sequence | MVPNRHVQSQ IMSTVVTAKL QEQFDAGLET RADFTIDQPL ARYGQVDHAV WKQLYARQSA LLRGRACDAF VAGLGKIDLP ADRVPSFADV NRQLKPATGW EIVAVPGLVP DRVFFEHLAN RRFPVTWWMR RPDQLDYLQE PDCFHDLFGH VPLLINPVFA DYMQAYGRTA LAVADDEAAL ARLARLYWYT VEFGLIRDPR GTNGLSIYGA GIVSSKGESL YSLESAAPNR LGFDLERVMR TKYRIDTFQK TYFVIDDFAQ LFALADVDGR ALADRLAALP EFAAGAVLDT DRVLHRGTGE GWSADA
|
| |