Gene Bxe_C1009 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBxe_C1009 
Symbol 
ID4010522 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia xenovorans LB400 
KingdomBacteria 
Replicon accessionNC_007953 
Strand
Start bp1046904 
End bp1047965 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content57% 
IMG OID637953612 
Productvanillate demethylase 
Protein accessionYP_556232 
Protein GI91781025 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.0048453 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATTTT TGCGCAATAC CTGGTACGCC GCGGGCTGGT CGCAGGATCT TGCAGCCGGC 
GCAATGCTGG GGCGGACCAT GCTCAACGAA CAACTCGTGC TTTTTCGTGC GGACGACGGC
ACGGTATCGG CGCTATCGGA TATCTGTCCC CATCGTTTTG CCCCCTTGCA CCTCGGGAAG
ATCGTCGACG GCTGCCGCAT TCAGTGTGCT TATCACGCAC TCGAATTCGA TGGAACCGGC
GCGTGCGTGA AGAATCCGCA CGGCAAGCAG AAGATTCCGG CGGCGGCAAA GTTGCAGGCC
TATCCGGTCG TGGAAAAACA TTCGTTGATC TGGGTGTGGA TGGGTGAACA GGCGGCCGCC
GATCCGTCAG TCATTCCCGA CTTCAGCATG CTCGATCCTG ACTCGGGTTT TCAGGTAAGT
CGCCGCGACT GGCTTCACAT GGACGCGAGC TACGACCTGG TGGTCGACAA CCTGATGGAT
CTGAGTCACA CGGCGTTTCT CCACGACGGC ATCCTCGGCA GCAAATACAC GATCAAGGCG
GACACATCAC TGGAGCAAAC CGGTGAGACC GTGAAAGTTA CGCGCTTGAT GCCGAACGTC
CCTGTTCCCG GCTTCTTTGA TCTGATGTTC AATCGCGACG GCGGCATCGT CGACTATTGG
ACAGAGATCA GATGGAACCT GCCGGGCTGC CTGATGAACA ACACCGGTGT CACGCTTCCT
GGCGCACCGC GTTCAGAAGG GACAGGCGTC TATGGCATGC ATTTCCTGAC GCCGGAGACG
GATGTCAGCT GTTGGTATCA CTTCGCCGCA GTTCGCCAGA ACCCAAGAAC CTGGGGCGAA
CCTATCGACA CCGAGATAAA GGAAAAAATC TCCGATTTGC GCCGCTATGC GTTCGAGGAA
CAGGACCAGT GGATCATTAA AGCTCAGCAA CAAACGATCC TTCGTGCAAA AGGCAACTTG
CAGCCCGTTA GCCTCGAGAC CGACATTGGG ATTGAACGGT ATAAAAGAAT CCTGAAGGCC
GCGTTGGTTG CCGAACGATC GGGATCGACA ATGGCTGCCT GA
 
Protein sequence
MEFLRNTWYA AGWSQDLAAG AMLGRTMLNE QLVLFRADDG TVSALSDICP HRFAPLHLGK 
IVDGCRIQCA YHALEFDGTG ACVKNPHGKQ KIPAAAKLQA YPVVEKHSLI WVWMGEQAAA
DPSVIPDFSM LDPDSGFQVS RRDWLHMDAS YDLVVDNLMD LSHTAFLHDG ILGSKYTIKA
DTSLEQTGET VKVTRLMPNV PVPGFFDLMF NRDGGIVDYW TEIRWNLPGC LMNNTGVTLP
GAPRSEGTGV YGMHFLTPET DVSCWYHFAA VRQNPRTWGE PIDTEIKEKI SDLRRYAFEE
QDQWIIKAQQ QTILRAKGNL QPVSLETDIG IERYKRILKA ALVAERSGST MAA