Gene Bcen_0337 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcen_0337 
Symbol 
ID4090810 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia cenocepacia AU 1054 
KingdomBacteria 
Replicon accessionNC_008060 
Strand
Start bp371579 
End bp372913 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content68% 
IMG OID638013596 
Producthomogentisate 1,2-dioxygenase 
Protein accessionYP_620222 
Protein GI107021895 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3508] Homogentisate 1,2-dioxygenase 
TIGRFAM ID[TIGR01015] homogentisate 1,2-dioxygenase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCTTG ACCTGTCGAA ACCGGCGACC GCCGGCTACC TGAGCGGTTT CGCGAACGAA 
TTCGCGACCG AGGCGCTGCC CGGTGCGCTG CCGCACGGCC GCAACTCGCC GCAGCGCGCG
CCGTACGGGC TGTACGCGGA GCAGCTGTCG GGCACCGCGT TCACCGCGCC GCGCGGCCAC
AACCGCCGCT CATGGCTGTA CCGGATTCGC CCGGCCGCCG TGCACCGGCC GTTCGAGCCG
TATGCGGGCG CGCAGCGGCT CGTGTCGGAA TTCGGCGATT CGGCCGACGT GCCGCCGACG
CCGCCGAACC AGCTGCGCTG GGACCCGCTG CCGATGCCGG TCGAGCCGAC CGATTTCGTC
GACGGTCTCG TGACGATGGC CGGCAACGGG TCGGCCGCCG CGATGAACGG CTGCGCGATC
CACCTGTACG CGGCGAACCG TTCGATGCAG GACCGCTTTT TCTACAGCGC GGACGGCGAG
CTGCTGATCG TGCCGCAGCA GGGGCGGCTG TTCATCGCGA CCGAATTCGG CCGGCTCGAC
GTCGAGCCGT TCGAGATCGC GGTGATCCCG CGCGGCGTGC GTTTTGCCGT CGCGCTGCCG
GACGGCAACG CGCGCGGCTA TATCTGCGAG AACTTCGGTG CGCTGCTGCG CCTGCCGGAT
CTCGGCCCGA TCGGCTCGAA CGGGCTCGCG AACCCGCGCG ATTTCCTGAC GCCGCAGGCC
GCGTACGAGG ATCGCGAAGG CGCGTTCGAG CTGATCGCGA AGCTGAACGG CCGGCTCTGG
CGCGCGGATA TCGGCCATTC GCCGCTCGAC GTCGTCGCGT GGCACGGCAA CTACGCACCG
TATAAGTACG ATCTGCGGCT GTTCAACACG ATCGGCTCGA TCAGCTTCGA CCATCCCGAT
CCTTCGATCT TCCTCGTGCT GCAGGCGCAG AGCGACACGC CGGGCGTCGA CACGATCGAC
TTCGTGATCT TTCCGCCGCG CTGGCTCGCG GCCGAGGATA CGTTCCGCCC GCCGTGGTTC
CACCGCAACG TCGCGAGCGA GTTCATGGGG CTCGTGCACG GCGCGTACGA CGCGAAGGCC
GAAGGCTTCG TGCCGGGCGG CGCGAGCCTG CACAACTGCA TGTCGGGCCA CGGGCCCGAT
GCGGACACGT TCGAGAAGGC GTCCGCGAGC GACACGACGA AGCCGCACAA GGTCGACGCG
ACGATGGCGT TCATGTTCGA AACCCGCACG CTGATTCGGC CGACGCGCTA CGCGCTCGAC
ACCGCGCAGC TGCAGGCCGA CTACTTCGAA TGCTGGCAAG GCATCAAGAA ACACTTCAAT
CCGGAGCAAC GATGA
 
Protein sequence
MTLDLSKPAT AGYLSGFANE FATEALPGAL PHGRNSPQRA PYGLYAEQLS GTAFTAPRGH 
NRRSWLYRIR PAAVHRPFEP YAGAQRLVSE FGDSADVPPT PPNQLRWDPL PMPVEPTDFV
DGLVTMAGNG SAAAMNGCAI HLYAANRSMQ DRFFYSADGE LLIVPQQGRL FIATEFGRLD
VEPFEIAVIP RGVRFAVALP DGNARGYICE NFGALLRLPD LGPIGSNGLA NPRDFLTPQA
AYEDREGAFE LIAKLNGRLW RADIGHSPLD VVAWHGNYAP YKYDLRLFNT IGSISFDHPD
PSIFLVLQAQ SDTPGVDTID FVIFPPRWLA AEDTFRPPWF HRNVASEFMG LVHGAYDAKA
EGFVPGGASL HNCMSGHGPD ADTFEKASAS DTTKPHKVDA TMAFMFETRT LIRPTRYALD
TAQLQADYFE CWQGIKKHFN PEQR