Gene BURPS668_A2725 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A2725 
SymbolbenA 
ID4888482 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp2607593 
End bp2608960 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content67% 
IMG OID640132661 
Productbenzoate 1,2 dioxygenase, alpha subunit 
Protein accessionYP_001063717 
Protein GI126442319 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID[TIGR03229] benzoate 1,2-dioxygenase, large subunit 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCCCGA TCGACCCGGA CCGCCGTGCG ACGCCGCGCC CCATCGACGA TTTCCTCGTC 
GAAGACAAGG CGCGCGGCGA CTACCGGCTG CACCGCAGCG CGTTCACCGA CGAAATGCTG
TTCGAGCTCG AGATGAAGCA CATCTTCGAA GGCAACTGGA TCTATCTCGC GCACGAGAGC
CAGCTCCCGA ACGCGAACGA TTACTACACG ACCACGATCG GCCGCCAGCC GATCGTGATC
GCGCGCAACC GCCACGGCGA GCTGAACGCG TTCGTCAACG CCTGCACGCA CCGCGGCGCG
ATGCTGTGCC GCCACAAGCG CGGCAACCGC GCGAGCTACA CGTGCCCGTT CCACGGCTGG
ACGTTCAGCA ACGGCGGCAA GCTGCTCAAA GTGAAGGACC CCGAAGGAGC CGGCTACCCG
GACTGCTTCA ACCGCGACGG CTCGCACGAT CTGAAGAAAG TCGCGCGCTT CGAGAACTAT
CGCGGCTTCC TGTTCGGCAG CCTGAACCCC GAAGTCGAGC CGCTCGCCGC GCATCTCGGC
GATGCCGCGC GCATCATCGA CATGATCGTC GATCAGTCGG CGGACGGCCT CGAGGTGCTG
CGCGGCTCGT CGACGTACAC GTACGAAGGC AACTGGAAGC TCACCGCCGA GAACGGCGCG
GACGGCTACC ACGTATCGGC CGTTCACTGG AACTACGCGG CGACCGTCAA CCACCGCAAG
ACCGACGCGC AGCACGAAGA CACGATCCGC GCGATGGACG CGGGCAACTG GGGCCGGCAG
GGCGGCGGCT TCTACGCGTT CGATCACGGC CACATGCTGC TGTGGACGCG CTGGGCGAAC
CCGGAGGACC GGCCGAACTT CGATCGCCGC GACGAATTCG CCGCGCGCTG CGGCGGCGAC
GTCGCCGACT GGATGATCCG GAACTCGCGC AACCTGTGTC TGTACCCGAA TGTCTATCTG
ATGGACCAGT TCGGCTCGCA GATCCGCGTG CTGCGCCCGC TCGCCGTCGA TCGCACCGAG
GTCACGATCT ACTGCATCGC GCCGAAGGGC GAGGCGCCCG ACGCGCGCGC GCGGCGCATC
CGCCAGTACG AGGATTTCTT CAACGCGAGC GGAATGGCGA CGCCCGACGA TCTCGAGGAA
TTCCGCGCGT GCCAGCAGGG CTACGCGGGC CGCGCGGTCG AATGGAACGA CATGTGCCGC
GGCGCCTCGC ACTGGATCGA GGGCCCCGAC GAAGCGGCGC GCCGGATCGG CATCCGCCCG
CTGATGAGCG GCGTGAAGAC CGAGGACGAA GGGCTGTACA CGGTCCAGCA CCGCTACTGG
ATCGCGACGA TGAAGCAGGC GCTCGCCGCC GAAAGGAGCG GCGCATGA
 
Protein sequence
MIPIDPDRRA TPRPIDDFLV EDKARGDYRL HRSAFTDEML FELEMKHIFE GNWIYLAHES 
QLPNANDYYT TTIGRQPIVI ARNRHGELNA FVNACTHRGA MLCRHKRGNR ASYTCPFHGW
TFSNGGKLLK VKDPEGAGYP DCFNRDGSHD LKKVARFENY RGFLFGSLNP EVEPLAAHLG
DAARIIDMIV DQSADGLEVL RGSSTYTYEG NWKLTAENGA DGYHVSAVHW NYAATVNHRK
TDAQHEDTIR AMDAGNWGRQ GGGFYAFDHG HMLLWTRWAN PEDRPNFDRR DEFAARCGGD
VADWMIRNSR NLCLYPNVYL MDQFGSQIRV LRPLAVDRTE VTIYCIAPKG EAPDARARRI
RQYEDFFNAS GMATPDDLEE FRACQQGYAG RAVEWNDMCR GASHWIEGPD EAARRIGIRP
LMSGVKTEDE GLYTVQHRYW IATMKQALAA ERSGA