Gene Bphy_5352 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBphy_5352 
Symbol 
ID6246837 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia phymatum STM815 
KingdomBacteria 
Replicon accessionNC_010623 
Strand
Start bp2491991 
End bp2493358 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content61% 
IMG OID642597073 
Productbenzoate 1,2-dioxygenase, large subunit 
Protein accessionYP_001861476 
Protein GI186474134 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID[TIGR03229] benzoate 1,2-dioxygenase, large subunit 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.587151 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCCCA TCTACCCGGA CCGCAGTCCG GCACTCAAGC ACATCGACGA TTACCTCGTC 
GAAGACACGG CCCGCGGCGA CTACCGCCTG CATCGCAGCG CGTTCACCGA CGAAGCGCTG
TTCGAGCTTG AAATGCAGCA TATTTTCGAA GGCAACTGGA TCTATCTCGC ACACGAAAGC
CAGATCCCGT CGAACAACGA TTACTTCACG ACGACGATGG GTCGTCAGCC CGTCGTCATC
ACGCGCAATC GCCAGGGTGA ACTGAGCGCG CTCGTCAATT CGTGCACGCA TCGCGGCGCG
ATGCTGTGCC GCCATAAACG CGGCAACAAG GCGACTTTCA CCTGCCCGTT TCATGGCTGG
ACCTTCAACA ACAGCGGCAA GCTGCTGAAA GTGAAAGACC CTGCCGACGC GGGTTATCCC
GAGTGCTTCA ATCACGAGGG CTCACATGAT CTGAAAAAAG TCGCGCGGTT CGAAAGCTAT
CGCGGCTTTC TGTTCGGCAG CCTGAATCCC GATGTGAAAC CGCTCGCCGA GTATCTCGGC
GAGGCCGCGC GCATCATCGA CATGATCGTC GACCAGTCGG AGCAAGGTCT CGAAGTGCTG
CGCGGGTCGT CGACCTATAC GTATGAAGGC AACTGGAAGC TGACGGCGGA AAACGGCGCA
GACGGTTATC ACGTGTCGGC CGTGCACTGG AACTACGCGG CGACCACGAA CCAGCGCAAG
GAAAAGAACG CCGCCGAAGA CAAGATCCGC GCGATGGATG CGGGCGGCTG GGGCCGTCAG
GGCGGTGGCT TCTACGCGTT CGAACACGGC CATATGCTGC TGTGGTCGCG GTGGGCGAAC
CCGGAAGACC GCCCGAACTT CAATCGCCGC GACGAATTCG CCGCGCGTTG CGGCGGCGAG
ACCGCGGACT GGATGATCCA GAACTCGCGC AATCTGTGTC TTTATCCCAA CGTGTATCTG
ATGGATCAGT TCGGCTCGCA GATTCGCGTG CTGAAGCCGC TGTCCGTCAA CAAGACGGAA
GTGACCATCT ATTGCATCGC GCCCAAGGGC GAATCCGACG ATGCGCGGGC GCGCCGCATC
CGCCAATACG AAGACTTCTT CAACGTGAGC GGCATGGCGA CCCCCGACGA TCTCGAAGAA
TTCCGCGCGT GCCAGCAAGG CTATGCGGCG CGTTCCGTCG AATGGAACGA CATGTGCCGC
GGCGCGACGC ACTGGATCGA GGGCCCGGAC GAAGGCGCGA AGAAGATCGG CCTCAAGCCG
CTGCTCTCCG GCGTGAAGAC GGAAGACGAA GGCCTGTACA CGATCCAGCA CCGCTACTGG
GTCGAAACGA TCCGCGATGC CCTCGCCAGG GAAAGGAGCG CAGCATGA
 
Protein sequence
MIPIYPDRSP ALKHIDDYLV EDTARGDYRL HRSAFTDEAL FELEMQHIFE GNWIYLAHES 
QIPSNNDYFT TTMGRQPVVI TRNRQGELSA LVNSCTHRGA MLCRHKRGNK ATFTCPFHGW
TFNNSGKLLK VKDPADAGYP ECFNHEGSHD LKKVARFESY RGFLFGSLNP DVKPLAEYLG
EAARIIDMIV DQSEQGLEVL RGSSTYTYEG NWKLTAENGA DGYHVSAVHW NYAATTNQRK
EKNAAEDKIR AMDAGGWGRQ GGGFYAFEHG HMLLWSRWAN PEDRPNFNRR DEFAARCGGE
TADWMIQNSR NLCLYPNVYL MDQFGSQIRV LKPLSVNKTE VTIYCIAPKG ESDDARARRI
RQYEDFFNVS GMATPDDLEE FRACQQGYAA RSVEWNDMCR GATHWIEGPD EGAKKIGLKP
LLSGVKTEDE GLYTIQHRYW VETIRDALAR ERSAA