Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCAH187_A0276 |
Symbol | |
ID | 7077372 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus AH187 |
Kingdom | Bacteria |
Replicon accession | NC_011658 |
Strand | + |
Start bp | 236081 |
End bp | 237253 |
Gene Length | 1173 bp |
Protein Length | 390 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 643448786 |
Product | putative homogentisate 1,2-dioxygenase |
Protein accession | YP_002336346 |
Protein GI | 217957802 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3508] Homogentisate 1,2-dioxygenase |
TIGRFAM ID | [TIGR01015] homogentisate 1,2-dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000000242533 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTTATC GTCACATGGG GGAGCTACCT CATAAACGAC ATGTACAATT CCGTAAAAAA GATGGGTCGC TTTATCGTGA GCAGGTAATG GGAACAAAAG GTTTTTCTGG TACGCAATCT ATTTTGTACC ATCATTATAT GCCAACGGAA GTAGGGCATG CGGCATTATC TCATTCTTGT CAGTTGCAGT ATGAAGAAGA TGTTGCTCTT TCTCATCGTC ACTTTCGCAC GAAAGAAAAT AAAAAAAGTG GTGATGCAGT AAGCGGAAGA AACTTTATAC TTGGGAATGA GGATTTATTA ATCGGAGTAG TGAGCCCAAC AGAAAAAATG GACTATTTCT ACCGTAATGG TGATGGCGAT GAAATGTTGT TTGTCCATTA CGGAACAGGA AAAATTGAAA CGATGTTTGG AACGATTCAT TATCGAAAAG GCGATTATGT AACAATTCCA ATTGGAACGA TTTATCGTGT TATTCCAGAT GAAGGAGAGA CTAAGTTTCT TGTTGTAGAG GCAAATAGTC AAATTACAAC ACCGCGTCGC TATCGTAATG AATATGGACA ATTGCTAGAG CATAGTCCGT TTTGTGAGAG AGATATTCGT GGTCCTGAAA AATTAGAGAC GTATGATGAA AAAGGTGAGT TTGTCGTAAT GACAAAGTCG AGGGGATATA TGCATAAACA TGTTTTAGGA CATCATCCGT TAGATGTTGT TGGTTGGGAT GGCTATTTAT ATCCGTGGGT CTTTAATGTA GAGGACTTTG AACCAATTAC AGGTCGTATT CATCAGCCAC CTCCAGTACA TCAAACGTTC GAGGGTCACA ATTTCGTTAT TTGTTCTTTT GTACCACGTT TATATGACTA TCATCCAGAA TCTATTCCGG CACCGTATTA TCATAGTAAC GTGAATAGTG ATGAAGTACT GTACTATGTA GAAGGTAACT TTATGAGCCG AAAAGGTGTG GAGGAAGGGT CTATTACACT TCATCCGAGC GGCATTCCTC ACGGGCCACA TCCTGGGAAA ACAGAGGCGA GTATAGGGAA AAAAGAAACG CTTGAATTAG CTGTTATGAT AGATACATTC CGTCCGCTTC GTATTGTAAA ACAAGCACAT GAAACAGAAG ATGAAAAATA TATGTATAGC TGGATTGAAG AGGGATCATA TACTGTGAAA TAA
|
Protein sequence | MFYRHMGELP HKRHVQFRKK DGSLYREQVM GTKGFSGTQS ILYHHYMPTE VGHAALSHSC QLQYEEDVAL SHRHFRTKEN KKSGDAVSGR NFILGNEDLL IGVVSPTEKM DYFYRNGDGD EMLFVHYGTG KIETMFGTIH YRKGDYVTIP IGTIYRVIPD EGETKFLVVE ANSQITTPRR YRNEYGQLLE HSPFCERDIR GPEKLETYDE KGEFVVMTKS RGYMHKHVLG HHPLDVVGWD GYLYPWVFNV EDFEPITGRI HQPPPVHQTF EGHNFVICSF VPRLYDYHPE SIPAPYYHSN VNSDEVLYYV EGNFMSRKGV EEGSITLHPS GIPHGPHPGK TEASIGKKET LELAVMIDTF RPLRIVKQAH ETEDEKYMYS WIEEGSYTVK
|
| |