Gene BURPS1710b_0206 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_0206 
SymbolphhA 
ID3691168 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007434 
Strand
Start bp214145 
End bp215155 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content66% 
IMG OID637726662 
Productphenylalanine 4-monooxygenase 
Protein accessionYP_331622 
Protein GI76808615 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3186] Phenylalanine-4-hydroxylase 
TIGRFAM ID[TIGR01267] phenylalanine-4-hydroxylase, monomeric form 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGATGCGCA ATTATCGCAT TTATCCATAC ACAATCGCCC ATTTTGCGCC CAAACCGCGC 
GAGCCCCGCT CTACACTTGC ATGCATCCCC ACCACTGATG CGAAGCAGGG CTACCCCATG
TCCACGGTCG TTACCGCGAA ACTGAAGGAA CAGTTCGACG CGGGCCTCGA AACCCGCGCC
GATTTCACCA TCGATCAGCC GCTCGCCCGC TACGGCGACG TCGACCACGC GGTGTGGACG
CAGTTGTATA CGCGGCAGGC GGCGCTGCTG CGCGGCCGTG CATGCGACGC GTTCATCGAG
GGCCTCGCGC GCATCGGCCT CGCGCCCGAT CGCGTGCCGT CGTTCGCCGA CGTGAACCGG
CGGCTCGAGC CCGCAACCGG CTGGCGCATC GTCGCGGTGC CGGGCCTCGT GCCGGACGCC
GTTTTCTTCG AGCATCTCGC GAACCGGCGG TTTCCGGTCA CCTGGTGGAT GCGCCGCCCG
GACCAGCTCG ATTATCTACA GGAGCCGGAC TGCTTCCACG ATCTGTTCGG CCACGTGCCG
CTGCTGATCG ATCCCGTATT CGCCGACTAC ATGCACGCAT ACGGCCGCGC GGCGCTTCGC
GTCGCCGACG ACGCAAGCGC GCTCGCGCTC CTTGCGCGCC TCTATTGGTA TACGGTCGAA
TTCGGCCTGA TTCGCGACAC GCGCGGCGAA AACGGGCTGC GGATCTACGG CGCGGGCATC
GTGTCGAGCA AGGGCGAAAC GCTCTACAGC CTCGAAAGCA CGTCGCCGAA CCGGATCGGC
TTCGATCTCG AACGCGTGAT GCGGACCCGA TACCGGATCG ACACGTTCCA GAAGACCTAC
TTCGTGATCG ACGATTTCGC GCAACTCTTC GCGCTCGCCG ACCTCGACGC GCGCGCGCTC
GCCGCGCGGC TCGCCGGCGC GCCCGAGCAC GCGGCGGGCG CGGTGCTTGA CGGCGATCAT
GTGCTCACGC GCGGCACCGG TGAAGGCTGG GCAGCCGATG CAGACGCTTG A
 
Protein sequence
MMRNYRIYPY TIAHFAPKPR EPRSTLACIP TTDAKQGYPM STVVTAKLKE QFDAGLETRA 
DFTIDQPLAR YGDVDHAVWT QLYTRQAALL RGRACDAFIE GLARIGLAPD RVPSFADVNR
RLEPATGWRI VAVPGLVPDA VFFEHLANRR FPVTWWMRRP DQLDYLQEPD CFHDLFGHVP
LLIDPVFADY MHAYGRAALR VADDASALAL LARLYWYTVE FGLIRDTRGE NGLRIYGAGI
VSSKGETLYS LESTSPNRIG FDLERVMRTR YRIDTFQKTY FVIDDFAQLF ALADLDARAL
AARLAGAPEH AAGAVLDGDH VLTRGTGEGW AADADA