Gene Bcen2424_0820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcen2424_0820 
Symbol 
ID4449888 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia cenocepacia HI2424 
KingdomBacteria 
Replicon accessionNC_008542 
Strand
Start bp919540 
End bp920874 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content68% 
IMG OID639692844 
Producthomogentisate 1,2-dioxygenase 
Protein accessionYP_834466 
Protein GI116688843 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3508] Homogentisate 1,2-dioxygenase 
TIGRFAM ID[TIGR01015] homogentisate 1,2-dioxygenase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.623719 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCTTG ACCTGTCGAA ACCGGCGACC GCCGGCTACC TGAGCGGTTT CGCGAACGAA 
TTCGCGACCG AGGCGCTGCC CGGTGCGCTG CCGCACGGCC GCAACTCGCC GCAGCGCGCG
CCGTACGGGC TGTACGCGGA GCAGCTGTCG GGCACCGCGT TCACCGCGCC GCGCGGCCAC
AACCGCCGCT CATGGCTGTA CCGGATTCGC CCGGCCGCCG TGCACCGGCC GTTCGAGCCG
TATGCGGGCG CGCAGCGGCT CGTGTCGGAA TTCGGCGATT CGGCCGACGT GCCGCCGACG
CCGCCGAACC AGCTGCGCTG GGACCCGCTG CCGATGCCGG TCGAGCCGAC CGATTTCGTC
GACGGTCTCG TGACGATGGC CGGCAACGGG TCGGCCGCCG CGATGAACGG CTGCGCGATC
CACCTGTACG CGGCGAACCG TTCGATGCAG GACCGCTTTT TCTACAGCGC GGACGGCGAG
CTGCTGATCG TGCCGCAGCA GGGGCGGCTG TTCATCGCGA CCGAATTCGG CCGGCTCGAC
GTCGAGCCGT TCGAGATCGC GGTGATCCCG CGCGGCGTGC GTTTTGCCGT CGCGCTGCCG
GACGGCAACG CGCGCGGCTA TATCTGCGAG AACTTCGGTG CGCTGCTGCG CCTGCCGGAT
CTCGGCCCGA TCGGCTCGAA CGGGCTCGCG AACCCGCGCG ATTTCCTGAC GCCGCAGGCC
GCGTACGAGG ATCGCGAAGG CGCGTTCGAG CTGATCGCGA AGCTGAACGG CCGGCTCTGG
CGCGCGGATA TCGGCCATTC GCCGCTCGAC GTCGTCGCGT GGCACGGCAA CTACGCACCG
TATAAGTACG ATCTGCGGCT GTTCAACACG ATCGGCTCGA TCAGCTTCGA CCATCCCGAT
CCTTCGATCT TCCTCGTGCT GCAGGCGCAG AGCGACACGC CGGGCGTCGA CACGATCGAC
TTCGTGATCT TTCCGCCGCG CTGGCTCGCG GCCGAGGATA CGTTCCGCCC GCCGTGGTTC
CACCGCAACG TCGCGAGCGA GTTCATGGGG CTCGTGCACG GCGCGTACGA CGCGAAGGCC
GAAGGCTTCG TGCCGGGCGG CGCGAGCCTG CACAACTGCA TGTCGGGCCA CGGGCCCGAT
GCGGACACGT TCGAGAAGGC GTCCGCGAGC GACACGACGA AGCCGCACAA GGTCGACGCG
ACGATGGCGT TCATGTTCGA AACCCGCACG CTGATTCGGC CGACGCGCTA CGCGCTCGAC
ACCGCGCAGC TGCAGGCCGA CTACTTCGAA TGCTGGCAAG GCATCAAGAA ACACTTCAAT
CCGGAGCAAC GATGA
 
Protein sequence
MTLDLSKPAT AGYLSGFANE FATEALPGAL PHGRNSPQRA PYGLYAEQLS GTAFTAPRGH 
NRRSWLYRIR PAAVHRPFEP YAGAQRLVSE FGDSADVPPT PPNQLRWDPL PMPVEPTDFV
DGLVTMAGNG SAAAMNGCAI HLYAANRSMQ DRFFYSADGE LLIVPQQGRL FIATEFGRLD
VEPFEIAVIP RGVRFAVALP DGNARGYICE NFGALLRLPD LGPIGSNGLA NPRDFLTPQA
AYEDREGAFE LIAKLNGRLW RADIGHSPLD VVAWHGNYAP YKYDLRLFNT IGSISFDHPD
PSIFLVLQAQ SDTPGVDTID FVIFPPRWLA AEDTFRPPWF HRNVASEFMG LVHGAYDAKA
EGFVPGGASL HNCMSGHGPD ADTFEKASAS DTTKPHKVDA TMAFMFETRT LIRPTRYALD
TAQLQADYFE CWQGIKKHFN PEQR