Gene Bphy_3201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBphy_3201 
Symbol 
ID6244884 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia phymatum STM815 
KingdomBacteria 
Replicon accessionNC_010623 
Strand
Start bp112803 
End bp114113 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content59% 
IMG OID642594995 
Productphthalate 4,5-dioxygenase 
Protein accessionYP_001859407 
Protein GI186472065 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones70 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGTTC CCACGCAGCC CGTGGTGTTC AAACCGAACT CGCGCGGTGC CGACCGCTAT 
CAATACCTGA CTCAGACTGA CGCGGGAACG CCCACGGGCG AACTGATGCG ACGTTACTGG
CAACCCGTCG CGCTTATTGA CTCGTTGCCG CCGGGGGCTG CGCCGCAGCC GATCCGGATT
CTGGGCGAAG ACCTTGTGCT CTTTCGCGAC GACAAGGATC GGGTCGGGCT TATCGACAGA
AAGTGCGCGC ATCGATGTAC TGACCTGGCG CTTGGGCGCG TCGAGGACGG TGGAATCCGT
TGTCCATATC ACGGATGGCT CTTCGACGTT GAAGGTCGTT GCCTGAGTCA ACCTGCTGAA
GCTTCTGCGA CCGCGAAGGA CCGGATCCGC ATGAAGTCGT ATCCGCTTCA TGAGGCGGCG
GGTGCATTCT GGGCATATAT GGGTCCGGGC GAGCCACCGC TTTTTCCGAA TTACCCCGCG
CTCGCAGGCG GCGCGGAGCA TTGCTACACA ACGCGATGGT TCGGTGACTG CAACTGGTTG
CAGGCGAGCG AGGGCAACAT CGATCCAGTT CACACTTCTT ACCTTCACCA GCTCGAACTG
TCTAGCGAGG ACATGAAAGC ACGCTGGGGT GTGTTCTCGA ATCAATCCCG TCCCGAACTG
GCTGTCGAGG ACACCCGATT CGGTGTCCGA CTGTACACGT TGCGCAAGAT TGACGGAACC
GAACGTTCAT CCATTCGAAT CACGAACTTC GTCATGCCCA ACGCGTGCGC AGTCGGAGGG
TTTGAAGGAT ACCTGGGCGA GGGGGGGCTG ACGATGCTTT GGGATGTCCC GATCGACGAC
CAGCATCACT GGCGGTGGGA ATTCATCTTC CATCGAAGCG GAAAGTTGAA CAAGGCCTCG
CTCGAAGCCC AGTATCAGTC GGAAAAGGAA GAAGGCGACC GGATGCGGCG CAAATGGGAG
GACCTTTACT CCCAGGATCG CGAATCGATG AAGGGAAAGG CGTATCTGGG GCTGGGCGAG
TGCTTCTCGG TACACGACAT TGCTATCACC CAGTCGCAAG GCACGATTCA TCAGCAGGCG
GACGAACACT TGTCGTCTTC GGATATCGCG ATCGTCCGTG CGCGCCGGAT GCTTGACGAA
GCCGCCCGGG TTGTTGCGGA AGGCGGCGAT CCGCGCGGCG TGGTTCGTAC AGATGCCGAC
AATGATTTCC GCGATATGGT CGTTGTAACG GGTGAAATCG AAAACGGCGA CTCGAAGGAA
GCCTATTGCG CTCGCTTCAC GGAAAGCCCG GATCTATTCC GTCCGCAATA G
 
Protein sequence
MNVPTQPVVF KPNSRGADRY QYLTQTDAGT PTGELMRRYW QPVALIDSLP PGAAPQPIRI 
LGEDLVLFRD DKDRVGLIDR KCAHRCTDLA LGRVEDGGIR CPYHGWLFDV EGRCLSQPAE
ASATAKDRIR MKSYPLHEAA GAFWAYMGPG EPPLFPNYPA LAGGAEHCYT TRWFGDCNWL
QASEGNIDPV HTSYLHQLEL SSEDMKARWG VFSNQSRPEL AVEDTRFGVR LYTLRKIDGT
ERSSIRITNF VMPNACAVGG FEGYLGEGGL TMLWDVPIDD QHHWRWEFIF HRSGKLNKAS
LEAQYQSEKE EGDRMRRKWE DLYSQDRESM KGKAYLGLGE CFSVHDIAIT QSQGTIHQQA
DEHLSSSDIA IVRARRMLDE AARVVAEGGD PRGVVRTDAD NDFRDMVVVT GEIENGDSKE
AYCARFTESP DLFRPQ