Gene BTH_II0473 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBTH_II0473 
Symbol 
ID3845402 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia thailandensis E264 
KingdomBacteria 
Replicon accessionNC_007650 
Strand
Start bp562307 
End bp563674 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content65% 
IMG OID637837778 
Productbenzoate 1,2-dioxygenase, alpha subunit 
Protein accessionYP_438673 
Protein GI83716643 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID[TIGR03229] benzoate 1,2-dioxygenase, large subunit 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCCCGA TCCATCCGGA TCGCCATCCG ACGCCGCGCA GCCTCGACGA GTTCCTCGTC 
GAAGACAAGG CCTGCGGCGA CTACCGGCTG CACCGCAGCG CGTTCACCGA CGAAATGCTG
TTCGAGCTCG AGATGAAGCA CATCTTCGAA GGCAACTGGA TCTACCTCGC GCACGAGAGC
CAGATCCCGA ACGCGAACGA TTACTACACG ACCACGATCG GCCGCCAGCC GATCGTGATC
GCGCGCAACC GCCAGGGCGA GCTGAACGCG TTCGTCAACG CGTGCACGCA TCGCGGCGCG
ATGCTGTGCC GGCACAAGCG AGGCAATCGC GCGAGCTACA CGTGCCCGTT CCACGGCTGG
ACGTTCAGCA ACAGCGGCAA GCTGCTCAAA GTGAAGGACC CCGAAGGAGC CGGCTATCCG
GACTGCTTCA ATCGCGACGG CTCGCACGAT CTGAAGAAGA TCGCGCGCTT CGAGAACTAT
CGCGGCTTCC TGTTCGGCAG CCTGAATCCG GACGTCGAGC CGCTCGCCGC GTACCTCGGC
GACGCCGCGC GCATCATCGA CATGATCGTC GATCAGTCGG CGGACGGCCT CGAAGTGCTG
CGCGGCTCGT CGACGTACAC ATACGAAGGC AACTGGAAGC TCACCGCCGA GAACGGCGCG
GACGGCTACC ACGTATCGGC CGTTCACTGG AACTACGCGG CGACCGTCAA TCACCGCAAG
ACGGACGCGC AGCACGAAGA CGCGATCCGC GCGATGGATG CGGGCAACTG GGGCCGCCAG
GGCGGCGGCT TTTATGCGTT CGATCACGGC CACATGCTGC TGTGGACGCG CTGGGCGAAT
CCGGAGGACC GGCCGAACTT CGATCGCCGC GACGAATTCG CCGCACGCTG CGGCGGCGAC
GTCGCCGACT GGATGATCCG GAACTCGCGC AACCTGTGCC TGTACCCGAA CGTCTATCTG
ATGGACCAGT TCGGCTCGCA GATCCGCGTG CTGCGCCCGC TCGCCGTCGA TCGCACCGAA
GTCACGATCT ACTGCATCGC GCCGAAGGAA GAAGCGCCCG ACGCGCGCGC GCGACGCATC
CGCCAATATG AGGATTTCTT CAACGCGAGC GGGATGGCGA CGCCCGACGA TCTCGAGGAA
TTCCGCGCAT GCCAGCAGGG CTACGCGGGC CGCGCGGTCG AATGGAACGA CATGAGCCGC
GGCGCTTCGC ACTGGATCGA GGGCCCCGAC GAAGCGGCGC GCCGGATCGG CATCCGGCCG
CTGATGAGCG GCGTGAAAAC CGAGGACGAG GGGCTCTACA CGGTCCAGCA CCGCTACTGG
ATCGCGACGA TGAAGCGCGC GCTCGCCGCC GAAAGGAGCC GCGCATGA
 
Protein sequence
MIPIHPDRHP TPRSLDEFLV EDKACGDYRL HRSAFTDEML FELEMKHIFE GNWIYLAHES 
QIPNANDYYT TTIGRQPIVI ARNRQGELNA FVNACTHRGA MLCRHKRGNR ASYTCPFHGW
TFSNSGKLLK VKDPEGAGYP DCFNRDGSHD LKKIARFENY RGFLFGSLNP DVEPLAAYLG
DAARIIDMIV DQSADGLEVL RGSSTYTYEG NWKLTAENGA DGYHVSAVHW NYAATVNHRK
TDAQHEDAIR AMDAGNWGRQ GGGFYAFDHG HMLLWTRWAN PEDRPNFDRR DEFAARCGGD
VADWMIRNSR NLCLYPNVYL MDQFGSQIRV LRPLAVDRTE VTIYCIAPKE EAPDARARRI
RQYEDFFNAS GMATPDDLEE FRACQQGYAG RAVEWNDMSR GASHWIEGPD EAARRIGIRP
LMSGVKTEDE GLYTVQHRYW IATMKRALAA ERSRA