Gene Bphy_1081 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBphy_1081 
Symbol 
ID6242579 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia phymatum STM815 
KingdomBacteria 
Replicon accessionNC_010622 
Strand
Start bp1217200 
End bp1219086 
Gene Length1887 bp 
Protein Length628 aa 
Translation table11 
GC content62% 
IMG OID642592860 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_001857316 
Protein GI186475846 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism
[R] General function prediction only 
COG ID[COG1082] Sugar phosphate isomerases/epimerases
[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.796659 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.416218 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCCGCT CCATTGCCAC TGTGTCGGTT AGCGGCACGC TCGTCGAGAA GCTCGCTGCG 
ATTCGCGCGG CGGGTTTCGA TGGCGTCGAG ATCTTCGAAA ACGATCTGCT GTATTTCGAC
GGCTCGCCAG CAGATATCCG CAAGCGTTGC GCCGATCTCG GTCTGCAGAT CATGCTGTTC
CAGCCGTTCC GCGATTTCGA AGGCGTCTCG AAGGAGCGGC TTGCGCAGAA CCTTAACCGC
GCAAAACGCA AGTTTGATCT GATGCACGAA CTCGGCACCG ATCTGGTGCT CGTGTGCAGC
AACGTCAACG CAAACGTGAT TGCGGACGAT GCACTGATCG TCGATCAGCT CGGCGAGCTT
GCATCGCTCG CCGAACGCGA AGGTGTGCGG GTCGGCTTCG AAGCCCTCGC GTGGGGCAAG
TATGTGAATT CGTATCGCCA CGCGTGGCGG CTCGTCGATG CCGTCAATCA TCCGAGCCTC
GGGCTGATAC TGGACAGCTT CCATACGCTG TCGATCGACG ATCCTGTCGA TCCCATTGCC
GACATTCCGG GCGATCGGAT TGTGTTCGTG CAGATCGCCG ATGCGCCGCT GCACAAGATG
GACGTTCTCG AATGGAGCCG TCATTACCGG TGCTTTCCGG GGCAGGGCGA CCTCGATGTC
GCCCGCTTCG CGGACCGGGT ACTGGCGACG GGCTATCAGG GCCCCTTTTC GCTGGAGATC
TTCAACGACG GCTTCCGCGC CGCGCCGACG GCGGCGACCG CCGCAGACGG CTATCGCTCG
CTGCTGTACC TCGAAGAGCA GGCGGCGCAT CAGCGCACGG GGCAGAGCGC GCAGCCGCTG
TTCACGCCGC CTGCGCCGCC CGCGCATTCG GGCTTTCAGT TCATTGAGTT CGCCGTCGAT
TCGACCAGCG CGCCGCGTCT TGCGCAACGC TTTGCCGAAG CGGGATTTCA CACGGCGGGC
AAGCATCGCT CGAAGGACGT GACGCTATAT CAGCAAGGCG ATGCGTCGAT CGTGCTGAAC
GCGGAGACGG ATTCATTCGC GAGCGAGTTT TTTCATCGGC ACGGTTTGTC GCTTTGCGCG
TCTGCGTTTC AGGTCGACGA TGCCGCGCGT GTTTTCGAAC GCGCGACGTC GTTCGGCTAC
GCACCGTTTT CGGGCCGTGT CGGTCCCAAC GAGCGCGTCG TGCCCGGTGT GCGTGCACCG
GACGGCAGCC TGCATTACTT CGTCGACGCG CGCCCCGACG AGCCAACGCT TTACGAAGCC
GATTTCGTGC TCGACGCGCA AGCCAACGGG CAGCCAGCCG GGCCGGGCGA TCTGACGCGT
ATCGATCACG TCTGCCTCGA CTTGCCCGCC GATACGCTCG ACACGTGGGT GCTGTATTTC
AAGGCAGTGT TCGGCTTTGA AGCCGAATCG GCCTGGTTGC TTCCCGATCC TTATGGACTG
GTGCGCAGCC GTGCGGTGCG CAGTGCCGAC GGATCGGTGC GGATCGTTCT CAATGCTTCC
GTGGACGGGC GTACGTCGAC GGCGCAATCG CTGCATATAT ATCGCGGTTC TGGCCTGAAT
CATGTGGCGT TTGTCACTGA CGACATCTTC GCGGCCGTCG AACAGTTGCG CGCACGCGAC
GTTCGGCTAT TACGAATACC GTTGAACTAT TACGATGACC TTGAAGCGCG ATACGATTTC
GCCGACGACA TGATTGCCAG GATGAAAGAC GCTCACGTGC TCTATGACCG GGATGCGCAG
GGCGGCGAAT TCTTCCATGT GTACACCGAG CAGATGGACG GACGCTTCTT CCTGGAGGTC
GTGCAGCGCA AGGGCGGCTA CGACGGATAC GGTGCAGTAA ACGCACCTGT CAGGCTCGCC
GCTCAAGCGC AGCGCAAGCA TGGCTAG
 
Protein sequence
MLRSIATVSV SGTLVEKLAA IRAAGFDGVE IFENDLLYFD GSPADIRKRC ADLGLQIMLF 
QPFRDFEGVS KERLAQNLNR AKRKFDLMHE LGTDLVLVCS NVNANVIADD ALIVDQLGEL
ASLAEREGVR VGFEALAWGK YVNSYRHAWR LVDAVNHPSL GLILDSFHTL SIDDPVDPIA
DIPGDRIVFV QIADAPLHKM DVLEWSRHYR CFPGQGDLDV ARFADRVLAT GYQGPFSLEI
FNDGFRAAPT AATAADGYRS LLYLEEQAAH QRTGQSAQPL FTPPAPPAHS GFQFIEFAVD
STSAPRLAQR FAEAGFHTAG KHRSKDVTLY QQGDASIVLN AETDSFASEF FHRHGLSLCA
SAFQVDDAAR VFERATSFGY APFSGRVGPN ERVVPGVRAP DGSLHYFVDA RPDEPTLYEA
DFVLDAQANG QPAGPGDLTR IDHVCLDLPA DTLDTWVLYF KAVFGFEAES AWLLPDPYGL
VRSRAVRSAD GSVRIVLNAS VDGRTSTAQS LHIYRGSGLN HVAFVTDDIF AAVEQLRARD
VRLLRIPLNY YDDLEARYDF ADDMIARMKD AHVLYDRDAQ GGEFFHVYTE QMDGRFFLEV
VQRKGGYDGY GAVNAPVRLA AQAQRKHG