Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCZK0217 |
Symbol | |
ID | 3022540 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus E33L |
Kingdom | Bacteria |
Replicon accession | NC_006274 |
Strand | + |
Start bp | 232307 |
End bp | 233479 |
Gene Length | 1173 bp |
Protein Length | 390 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 637544392 |
Product | homogentisate 1,2-dioxygenase |
Protein accession | YP_081832 |
Protein GI | 52144997 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3508] Homogentisate 1,2-dioxygenase |
TIGRFAM ID | [TIGR01015] homogentisate 1,2-dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTTATC GTCACATGGG GGAGCTACCT CATAAACGAC ATGTACAATT CCGTAAAAAA GATGGATCAC TTTATCGTGA ACAGGTAATG GGAACAAAAG GTTTTTCTGG TACGCAATCT ATTTTGTATC ATCATTATAT GCCAACGGAA GTAGGGCATG CGGCATTATC GCATTCTTGT CAGTTGCAGT ATGAAGAAGA TGTTGCTCTT TCTCATCGTC ACTTTCGCAC GAAAGAGAAT AAAAAAAGTG GTGATGCAGT AAGTGGCAGA AACTTTATAC TTGGAAATGA GGATTTATTA ATCGGAGTAG TGACTCCGAC AGAAAAAATG GATTATTTCT ACCGTAATGG TGATGGTGAT GAAATGTTGT TTGTCCATTA CGGAACAGGA AAAATTGAAA CAATGTTCGG AACGATTCAC TATCGAAAAG GTGATTATGT AATAATCCCA ATTGGAACGA TTTATCGTGT TATTCCAGAT GAAGGAGAGA CTAAGTTTCT TGTTGTAGAG GCAAATAGTC AAATTACAAC ACCGCGTCGC TACCGTAATG AATATGGACA ATTGTTAGAG CATAGCCCGT TTTGTGAGAG AGATATTCGT GGCCCGGAAA AATTAGAGAC ATATGATGAA AAAGGTGAGT TTGTCGTAAT GACAAAGTCG CGAGGATATA TGCATAAACA TGTTTTAGGA CACCATCCGT TAGATGTAGT TGGATGGGAT GGTTATTTAT ATCCTTGGGT CTTTAATGTA GAGGATTTTG AACCAATTAC AGGTCGTATT CATCAGCCAC CTCCAGTACA TCAAACGTTC GAGGGTCACA ATTTTGTTAT TTGTTCTTTC GTACCACGTT TATATGACTA TCATCCAGAA TCTATTCCGG CACCGTATTA TCATAGTAAC GTGAATAGTG ATGAAGTACT GTACTATGTA GAAGGTAACT TTATGAGCCG AAAAGGGGTG GAGGAAGGGT CTATTACACT TCATCCGAGC GGCATTCCTC ATGGGCCACA TCCTGGGAAA ACAGAGGCGA GTATAGGGAA AAAAGAAACG CTTGAATTAG CTGTTATGAT AGATACATTC CGTCCGCTTC GTATTGTAAA ACAAGCACAT GAAACAGAAG ATGAAAAATA TATGTATAGC TGGATTGAAG AGGGATCGTA TACTGTGAAA TAA
|
Protein sequence | MFYRHMGELP HKRHVQFRKK DGSLYREQVM GTKGFSGTQS ILYHHYMPTE VGHAALSHSC QLQYEEDVAL SHRHFRTKEN KKSGDAVSGR NFILGNEDLL IGVVTPTEKM DYFYRNGDGD EMLFVHYGTG KIETMFGTIH YRKGDYVIIP IGTIYRVIPD EGETKFLVVE ANSQITTPRR YRNEYGQLLE HSPFCERDIR GPEKLETYDE KGEFVVMTKS RGYMHKHVLG HHPLDVVGWD GYLYPWVFNV EDFEPITGRI HQPPPVHQTF EGHNFVICSF VPRLYDYHPE SIPAPYYHSN VNSDEVLYYV EGNFMSRKGV EEGSITLHPS GIPHGPHPGK TEASIGKKET LELAVMIDTF RPLRIVKQAH ETEDEKYMYS WIEEGSYTVK
|
| |