Gene Bmul_0223 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBmul_0223 
Symbol 
ID5765722 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia multivorans ATCC 17616 
KingdomBacteria 
Replicon accessionNC_010084 
Strand
Start bp250987 
End bp252084 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content63% 
IMG OID641312626 
Product4-hydroxyphenylpyruvate dioxygenase 
Protein accessionYP_001578415 
Protein GI161523403 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG3185] 4-hydroxyphenylpyruvate dioxygenase and related hemolysins 
TIGRFAM ID[TIGR01263] 4-hydroxyphenylpyruvate dioxygenase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.376735 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000014808 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCAGATCC CCACCTGGGA CAACCCCGTC GGCACCGACG GCTTCGAATT CATCGAATAC 
ACGGCACCGG ACCCGAAGGC GCTCGGACAA CTGTTCGAAC GGATGGGTTT CACCGCGATC
GCGCGTCACC GCCACAAGGA CGTGACGGTC TACCGCCAGG GCGACATCAA CTTCATCATC
AACGCCGAGC CCGACTCGTT CGCGCAGCGC TTCGCCCGGC TGCACGGCCC GTCGATCTGC
GCGATCGCGT TTCGCGTGCA GGACGCCGCG AAGGCGTACA AGCGCGCGCT CGAACTCGGC
GCATGGGGCT TCGACAACAA GACGGGCCCG ATGGAGCTGA ACATCCCGGC GATCAAGGGC
ATCGGCGATT CGCTGATCTA CTTCGTCGAC CGCTGGCGCG GCAAGAACGG CGCGCAGCCG
GGCGCGATCG GCGACATCAG CATCTACGAC GTCGACTTCG AGCCGATCCC CGGCGCCGAG
CCGAACCCGG TCGGTCACGG GCTCACGTAC ATCGACCACC TGACGCACAA CGTGCATCGC
GGCCGCATGC AGGAATGGGC GGAGTTCTAC GAACGCCTGT TCAACTTCCG CGAAGTGCGC
TACTTCGACA TCGAAGGCAA GGTGACGGGT GTGAAGTCGA AGGCGATGAC GTCGCCGTGC
GGCAAGATCC GCATCCCGAT CAACGAGGAA GGCTCGGACA CCGCCGGCCA GATCCAGGAA
TACCTGGACG CGTACCACGG CGAAGGCATT CAGCACATCG CGCTCGGCAC CAACAACATC
TATGGGGCGG TCGACGGCCT GCGCAGCAAG GAAGTGAAGC TGCTCGACAC GATCGACACC
TATTACGAAC TGGTCGATCG CCGCGTGCCG AACCACGGCG AATCGCTGGA AGAGCTGAAG
AAGCGCAAGA TCCTGATCGA CGGCGCACGC GACGATCTGC TGCTGCAGAT CTTCACCGAA
AACCAGATCG GGCCGATCTT CTTCGAGATC ATCCAGCGCA AGGGCAATCA GGGCTTCGGC
GAGGGCAACT TCAAGGCGCT GTTCGAATCG ATCGAGCTCG ATCAGATCCG CCGCGGCGTC
GTGCAGGACA AGGCCTGA
 
Protein sequence
MQIPTWDNPV GTDGFEFIEY TAPDPKALGQ LFERMGFTAI ARHRHKDVTV YRQGDINFII 
NAEPDSFAQR FARLHGPSIC AIAFRVQDAA KAYKRALELG AWGFDNKTGP MELNIPAIKG
IGDSLIYFVD RWRGKNGAQP GAIGDISIYD VDFEPIPGAE PNPVGHGLTY IDHLTHNVHR
GRMQEWAEFY ERLFNFREVR YFDIEGKVTG VKSKAMTSPC GKIRIPINEE GSDTAGQIQE
YLDAYHGEGI QHIALGTNNI YGAVDGLRSK EVKLLDTIDT YYELVDRRVP NHGESLEELK
KRKILIDGAR DDLLLQIFTE NQIGPIFFEI IQRKGNQGFG EGNFKALFES IELDQIRRGV
VQDKA