Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcen_0337 |
Symbol | |
ID | 4090810 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia cenocepacia AU 1054 |
Kingdom | Bacteria |
Replicon accession | NC_008060 |
Strand | + |
Start bp | 371579 |
End bp | 372913 |
Gene Length | 1335 bp |
Protein Length | 444 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 638013596 |
Product | homogentisate 1,2-dioxygenase |
Protein accession | YP_620222 |
Protein GI | 107021895 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3508] Homogentisate 1,2-dioxygenase |
TIGRFAM ID | [TIGR01015] homogentisate 1,2-dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGCTTG ACCTGTCGAA ACCGGCGACC GCCGGCTACC TGAGCGGTTT CGCGAACGAA TTCGCGACCG AGGCGCTGCC CGGTGCGCTG CCGCACGGCC GCAACTCGCC GCAGCGCGCG CCGTACGGGC TGTACGCGGA GCAGCTGTCG GGCACCGCGT TCACCGCGCC GCGCGGCCAC AACCGCCGCT CATGGCTGTA CCGGATTCGC CCGGCCGCCG TGCACCGGCC GTTCGAGCCG TATGCGGGCG CGCAGCGGCT CGTGTCGGAA TTCGGCGATT CGGCCGACGT GCCGCCGACG CCGCCGAACC AGCTGCGCTG GGACCCGCTG CCGATGCCGG TCGAGCCGAC CGATTTCGTC GACGGTCTCG TGACGATGGC CGGCAACGGG TCGGCCGCCG CGATGAACGG CTGCGCGATC CACCTGTACG CGGCGAACCG TTCGATGCAG GACCGCTTTT TCTACAGCGC GGACGGCGAG CTGCTGATCG TGCCGCAGCA GGGGCGGCTG TTCATCGCGA CCGAATTCGG CCGGCTCGAC GTCGAGCCGT TCGAGATCGC GGTGATCCCG CGCGGCGTGC GTTTTGCCGT CGCGCTGCCG GACGGCAACG CGCGCGGCTA TATCTGCGAG AACTTCGGTG CGCTGCTGCG CCTGCCGGAT CTCGGCCCGA TCGGCTCGAA CGGGCTCGCG AACCCGCGCG ATTTCCTGAC GCCGCAGGCC GCGTACGAGG ATCGCGAAGG CGCGTTCGAG CTGATCGCGA AGCTGAACGG CCGGCTCTGG CGCGCGGATA TCGGCCATTC GCCGCTCGAC GTCGTCGCGT GGCACGGCAA CTACGCACCG TATAAGTACG ATCTGCGGCT GTTCAACACG ATCGGCTCGA TCAGCTTCGA CCATCCCGAT CCTTCGATCT TCCTCGTGCT GCAGGCGCAG AGCGACACGC CGGGCGTCGA CACGATCGAC TTCGTGATCT TTCCGCCGCG CTGGCTCGCG GCCGAGGATA CGTTCCGCCC GCCGTGGTTC CACCGCAACG TCGCGAGCGA GTTCATGGGG CTCGTGCACG GCGCGTACGA CGCGAAGGCC GAAGGCTTCG TGCCGGGCGG CGCGAGCCTG CACAACTGCA TGTCGGGCCA CGGGCCCGAT GCGGACACGT TCGAGAAGGC GTCCGCGAGC GACACGACGA AGCCGCACAA GGTCGACGCG ACGATGGCGT TCATGTTCGA AACCCGCACG CTGATTCGGC CGACGCGCTA CGCGCTCGAC ACCGCGCAGC TGCAGGCCGA CTACTTCGAA TGCTGGCAAG GCATCAAGAA ACACTTCAAT CCGGAGCAAC GATGA
|
Protein sequence | MTLDLSKPAT AGYLSGFANE FATEALPGAL PHGRNSPQRA PYGLYAEQLS GTAFTAPRGH NRRSWLYRIR PAAVHRPFEP YAGAQRLVSE FGDSADVPPT PPNQLRWDPL PMPVEPTDFV DGLVTMAGNG SAAAMNGCAI HLYAANRSMQ DRFFYSADGE LLIVPQQGRL FIATEFGRLD VEPFEIAVIP RGVRFAVALP DGNARGYICE NFGALLRLPD LGPIGSNGLA NPRDFLTPQA AYEDREGAFE LIAKLNGRLW RADIGHSPLD VVAWHGNYAP YKYDLRLFNT IGSISFDHPD PSIFLVLQAQ SDTPGVDTID FVIFPPRWLA AEDTFRPPWF HRNVASEFMG LVHGAYDAKA EGFVPGGASL HNCMSGHGPD ADTFEKASAS DTTKPHKVDA TMAFMFETRT LIRPTRYALD TAQLQADYFE CWQGIKKHFN PEQR
|
| |