Gene Bcenmc03_0789 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcenmc03_0789 
Symbol 
ID6122470 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia cenocepacia MC0-3 
KingdomBacteria 
Replicon accessionNC_010508 
Strand
Start bp875252 
End bp876586 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content68% 
IMG OID641637353 
Producthomogentisate 1,2-dioxygenase 
Protein accessionYP_001764089 
Protein GI170732142 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3508] Homogentisate 1,2-dioxygenase 
TIGRFAM ID[TIGR01015] homogentisate 1,2-dioxygenase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.211762 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCTTG ACCTGTCGAA ACCGGCGACC GCCGGCTACC TGAGCGGTTT CGCGAACGAA 
TTCGCGACCG AGGCGCTGCC CGGCGCGCTG CCGCACGGCC GCAACTCGCC GCAGCGTGCG
CCGTACGGAC TGTACGCGGA GCAGCTGTCG GGCACCGCGT TCACCGCGCC GCGCGGCCAC
AACCGCCGCT CATGGCTGTA CCGGATTCGC CCGGCTGCCG TGCACCGGCC GTTCGAGCCG
TATGCGGGGG CGCAGCGGCT CGTGTCGGAA TTCGGCGATT CGGCCGACGT GCCGCCGACG
CCGCCGAACC AGCTGCGCTG GGACCCGCTG CCGATGCTGG TCGAGCCGAC CGATTTCGTC
GACGGCCTCG TGACGATGGC CGGCAACGGG TCGGCCGCCG CGATGAACGG CTGCGCGATC
CACCTGTATG CGGCGAACCG CTCGATGCAG GACCGCTTCT TCTACAGCGC GGACGGCGAG
CTGCTGATCG TGCCGCAGCA GGGGCGGCTG TTCATCGCGA CCGAATTCGG CCGGCTCGAT
GTCGAGCCGT TCGAGATCGC GGTGATCCCG CGCGGCGTGC GTTTTGCCGT CGCGCTGCCG
GACGGCGATG CGCGCGGCTA CATCTGCGAG AACTTCGGTG CGCTGCTGCG CCTGCCGGAT
CTCGGCCCGA TCGGCTCGAA CGGGCTCGCG AACCCGCGCG ATTTCCTGAC GCCGCAGGCC
GCGTACGAGG ATCGCGAAGG CGCGTTCGAG CTGATCGCGA AGCTGAACGG CCGGCTCTGG
CGCGCGGACA TTGGCCATTC GCCGCTCGAC GTCGTCGCGT GGCACGGCAA CTACGCGCCG
TATAAATACG ACCTTCGCCT GTTCAACACG ATCGGCTCGA TCAGCTTCGA CCATCCCGAT
CCGTCGATCT TCCTCGTGCT GCAGGCGCAG AGCGATACGC CGGGCGTCGA CACGATCGAC
TTCGTGATCT TCCCGCCGCG CTGGCTCGCG GCCGAGGATA CGTTCCGCCC GCCGTGGTTC
CACCGCAACG TCGCGAGCGA GTTCATGGGG CTCGTGCACG GCGCGTACGA CGCGAAGGCC
GAGGGCTTCG TGCCGGGCGG CGCGAGCCTG CATAACTGCA TGTCGGGCCA CGGGCCCGAT
GCGGACACGT TCGAGAAGGC GTCCGCGAGC GACACGACGA AGCCGCACAA GGTCGACGCG
ACGATGGCGT TCATGTTCGA AACCCGCACG CTGATCCGGC CGACGTGCTA CGCGCTCGAC
ACGGCGCAAC TGCAGGCCGA CTACTTCGAA TGCTGGCAAG GCATCAAGAA ACACTTCAAT
CCGGAGCAAA AGTGA
 
Protein sequence
MTLDLSKPAT AGYLSGFANE FATEALPGAL PHGRNSPQRA PYGLYAEQLS GTAFTAPRGH 
NRRSWLYRIR PAAVHRPFEP YAGAQRLVSE FGDSADVPPT PPNQLRWDPL PMLVEPTDFV
DGLVTMAGNG SAAAMNGCAI HLYAANRSMQ DRFFYSADGE LLIVPQQGRL FIATEFGRLD
VEPFEIAVIP RGVRFAVALP DGDARGYICE NFGALLRLPD LGPIGSNGLA NPRDFLTPQA
AYEDREGAFE LIAKLNGRLW RADIGHSPLD VVAWHGNYAP YKYDLRLFNT IGSISFDHPD
PSIFLVLQAQ SDTPGVDTID FVIFPPRWLA AEDTFRPPWF HRNVASEFMG LVHGAYDAKA
EGFVPGGASL HNCMSGHGPD ADTFEKASAS DTTKPHKVDA TMAFMFETRT LIRPTCYALD
TAQLQADYFE CWQGIKKHFN PEQK