Gene Bmul_4226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBmul_4226 
Symbol 
ID5769591 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia multivorans ATCC 17616 
KingdomBacteria 
Replicon accessionNC_010086 
Strand
Start bp1260762 
End bp1261889 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content67% 
IMG OID641318527 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_001584199 
Protein GI161520772 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value0.453133 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGGCA ATCCCCATCC ACTGTCCAGC GATACGCCGC CCGTCGACGA TCCGGCCGCC 
AATCCGCTCG GGATGGCGGG GATCGAGTTC GTCGAGTTCG CGGCGCCGGT GCCGGACGTG
CTCGCGCAGC GTTTCGAACA GCTCGGCTTC AAGGCGATCG CGCGGCACGT CAGCAAGGCC
GTGACGCTGT ATCGGCAGGG GCAGATGCAT TTCCTGATCA ACGCTGAACC GGACTCGTTC
GCCGCGCGCT ATGCGGAGGA ATACGGGATG GGAGTCTGCG CGATCGGCAT CCGCGTCGCG
AACGCGCGGC GCTCGTTCGA ACGCGCGATC GAACTGGGCG CGTGGGCGTT CGAGGGCGAG
CGCGTCGGCG TCGGTGAGCT GAAGATTCCG GCGATCCAGG GCATCGGCGA TTCGCATCTG
TACTTCGTCG ATCGCTGGCG CGGGCGCAAC GGCCAGCGCG GCGGCGTGGG CGACATCTCG
ATCTTCGACA TCGATTTCCG GCCGATCGAC ATCGCCACCG CGCACACCGA TCTCGATTTC
GCGGGCACCG GGCTGCAGCA GGTCGACCAT TTCACGCAGA CGGTCGGCGC CGGGCGCATG
CAGGAGTGGC TCGACTTCTA CCACGATCTG CTGCACTTCC GCGAGATTCA CGAAATCGAT
GCGCACTGGC ACGTGTCGGA GGAGTCGCGC GTGATGGTGT CGCCGTGCGG CGCGCTGCGG
ATTCCGGTGT ACGAGGAAGG CACGCGCCGC ACCGAGCTGA TGCACGCGTA TCTGCCCGAC
CATCCGGGCG AGGGCGTCCA GCACATCGCG CTCGCGACGG ACGACATCCT CGCGTCGGTC
GACGCGTTGC GGGCGAACGG CGTCGAGTTC ATCGAGCCGC CCGCGCGCTA CTACGACGAG
GTGGACCAGC GGTTGCCGGG GCACGGCGTC GATCTGGACG CGCTGCGCCG TCGTGCGGTA
TTGATCGACG GCGAGATCGG CGAAGACGGC GTGCCGCGCC TGTTCTTCCA GACGTTCGTC
AAGCGCCGGC CCGGCGAAAT CTTCTTCGAG ATCGTGCAGC GCAAGGGGCA TCACGGTTTC
GGCGAAGGCA ATCTCGCGGC GCTCGCGCGC GCTCGCGACG CAAGCTGA
 
Protein sequence
MSGNPHPLSS DTPPVDDPAA NPLGMAGIEF VEFAAPVPDV LAQRFEQLGF KAIARHVSKA 
VTLYRQGQMH FLINAEPDSF AARYAEEYGM GVCAIGIRVA NARRSFERAI ELGAWAFEGE
RVGVGELKIP AIQGIGDSHL YFVDRWRGRN GQRGGVGDIS IFDIDFRPID IATAHTDLDF
AGTGLQQVDH FTQTVGAGRM QEWLDFYHDL LHFREIHEID AHWHVSEESR VMVSPCGALR
IPVYEEGTRR TELMHAYLPD HPGEGVQHIA LATDDILASV DALRANGVEF IEPPARYYDE
VDQRLPGHGV DLDALRRRAV LIDGEIGEDG VPRLFFQTFV KRRPGEIFFE IVQRKGHHGF
GEGNLAALAR ARDAS