Gene BURPS1106A_A2581 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A2581 
SymbolbenA 
ID4904300 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp2534768 
End bp2536135 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content67% 
IMG OID640145684 
Productbenzoate 1,2 dioxygenase, alpha subunit 
Protein accessionYP_001076611 
Protein GI126455683 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID[TIGR03229] benzoate 1,2-dioxygenase, large subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCCCGA TCGACCCGGA CCGCCGTGCG ACGCCGCGCC CCATCGACGA TTTCCTCGTC 
GAAGACAAGG CGCGCGGCGA CTACCGGCTG CACCGCAGCG CGTTCACCGA CGAAATGCTG
TTCGAGCTCG AGATGAAGCA CATCTTCGAA GGCAACTGGA TCTATCTCGC GCACGAGAGC
CAGCTCCCGA ACGCGAACGA TTACTACACG ACCACGATCG GCCGCCAGCC GATCGTGATC
GCGCGCAACC GCCACGGCGA GCTGAACGCG TTCGTCAACG CCTGCACGCA CCGCGGCGCG
ATGCTGTGCC GCCACAAGCG CGGCAACCGC GCGAGCTACA CGTGCCCGTT CCACGGCTGG
ACGTTCAGCA ACGGCGGCAA GCTGCTCAAA GTAAAGGACC CCGAAGGAGC CGGCTACCCG
GACTGCTTCA ACCGCGACGG CTCGCACGAT CTGAAGAAAG TCGCGCGCTT CGAGAACTAT
CGCGGCTTCC TGTTCGGCAG CCTGAACCCC GAAGTCGAGC CGCTCGCCGC GCATCTCGGC
GATGCCGCGC GCATCATCGA CATGATCGTC GATCAGTCGG CGGACGGCCT CGAGGTGCTG
CGCGGCTCGT CGACGTACAC GTACGAAGGC AACTGGAAGC TCACCGCCGA GAACGGCGCG
GACGGCTACC ACGTATCGGC CGTTCACTGG AACTACGCGG CGACCGTCAA CCACCGCAAG
ACCGACGCGC AGCACGAAGA CACGATCCGC GCGATGGACG CGGGCAACTG GGGCCGGCAG
GGCGGCGGCT TCTACGCGTT CGATCACGGC CACATGCTGC TGTGGACGCG CTGGGCGAAC
CCGGAGGACC GGCCGAACTT CGATCGCCGC GACGAATTCG CCGCGCGCTG CGGCGGCGAC
GTCGCCGACT GGATGATCCG GAACTCGCGC AACCTGTGCC TGTACCCGAA CGTCTATCTG
ATGGACCAGT TCGGCTCGCA GATCCGCGTG CTGCGCCCGC TCGCCGTCGA TCGCACCGAG
GTCACGATCT ACTGCATCGC GCCGAAGGGC GAGGCGCCCG ACGCGCGCGC GCGGCGCATC
CGCCAGTACG AGGATTTCTT CAACGCGAGC GGAATGGCGA CGCCCGACGA TCTCGAGGAA
TTCCGCGCGT GCCAGCAGGG CTACGCGGGC CGCGCGGTCG AATGGAACGA CATGTGCCGC
GGCGCCTCGC ACTGGATCGA GGGCCCCGAC GAAGCGGCGC GCCGGATCGG CATCCGCCCG
CTGATGAGCG GCGTGAAGAC CGAAGACGAA GGGCTGTACA CGGTCCAGCA CCGCTACTGG
ATCGCGACGA TGAAGCAGGC GCTCGCCGCC GAAAGGAGCG GCGCATGA
 
Protein sequence
MIPIDPDRRA TPRPIDDFLV EDKARGDYRL HRSAFTDEML FELEMKHIFE GNWIYLAHES 
QLPNANDYYT TTIGRQPIVI ARNRHGELNA FVNACTHRGA MLCRHKRGNR ASYTCPFHGW
TFSNGGKLLK VKDPEGAGYP DCFNRDGSHD LKKVARFENY RGFLFGSLNP EVEPLAAHLG
DAARIIDMIV DQSADGLEVL RGSSTYTYEG NWKLTAENGA DGYHVSAVHW NYAATVNHRK
TDAQHEDTIR AMDAGNWGRQ GGGFYAFDHG HMLLWTRWAN PEDRPNFDRR DEFAARCGGD
VADWMIRNSR NLCLYPNVYL MDQFGSQIRV LRPLAVDRTE VTIYCIAPKG EAPDARARRI
RQYEDFFNAS GMATPDDLEE FRACQQGYAG RAVEWNDMCR GASHWIEGPD EAARRIGIRP
LMSGVKTEDE GLYTVQHRYW IATMKQALAA ERSGA