Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCG9842_B5068 |
Symbol | |
ID | 7183662 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus G9842 |
Kingdom | Bacteria |
Replicon accession | NC_011772 |
Strand | + |
Start bp | 218030 |
End bp | 219202 |
Gene Length | 1173 bp |
Protein Length | 390 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 643548021 |
Product | putative homogentisate 1,2-dioxygenase |
Protein accession | YP_002443765 |
Protein GI | 218895354 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3508] Homogentisate 1,2-dioxygenase |
TIGRFAM ID | [TIGR01015] homogentisate 1,2-dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00282868 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 62 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTTATC GTCACATGGG AGAACTACCT CATAAACGAC ATGTACAATT TCGTAAAAAA GATGGATCGC TTTATCGTGA ACAGGTAATG GGAACAAAAG GTTTTTCTGG TACGCAGTCT ATTTTGTATC ATCACTATAT GCCAACAGAA GTAGGTCATT CTGCATTATC GCATTCTTGT CAGTTGCAGT ATGAAGAGGA TGTTGCTCTT TCTCATCGCC ACTTCCGAAC GAAAGAAAAT AAAAAAAGTG GTGATGCAAT AAGTGGACGA AATTTCATAC TTGGAAATGA AGATTTGTTA ATTGGAGTAG TGAGCCCAAC AGAAAAAATG GATTATTTCT ATCGTAATGG TGATGGCGAC GAAATGTTAT TTGTTCATTA TGGAACAGGG AAAATTGAAA CGATGTTTGG AACGATTCAC TATAGAAAAG GCGACTATGT AACGATCCCA ATTGGAACGA TTTATCGTGT TATTCCAGAT GAAGGAGAGA CTAAGTTTCT TGTTGTAGAG GCGAATAGCC AAATTACAAC GCCGCGTCGT TATCGAAATG AATACGGACA ATTGTTAGAG CATAGTCCGT TTTGTGAAAG AGATCTTCGT GGTCCAGAAA AATTAGAGAC CTATGATGAA AAAGGCGATT TTGTCGTAAT GACAAAATCA AGAGGTTATA TGCACAAACA TGTTTTAGGA CACCACCCGT TAGATGTTGT TGGATGGGAT GGCTATTTGT ATCCGTGGGT ATTTAATGTA GAGGATTTTG AACCAATTAC AGGGCGCATT CATCAGCCGC CGCCAGTACA TCAAACATTT GAAGGGCATA ATTTTGTTAT TTGCTCTTTC GTACCACGTT TATACGATTA TCATCCAGAG TCAATTCCGG CACCATATTA TCATAGTAAT GTTAATAGTG ATGAAGTTCT TTACTATGTA GAAGGAAACT TTATGAGTCG CAAAGGTGTG GAAGAAGGTT CTATTACACT TCATCCGAGC GGGATTCCCC ATGGGCCGCA TCCGGGGAAA ACAGAGGCAA GTATAGGGAA GAAAGAGACA CTTGAATTAG CTGTTATGAT AGACACATTC CGTCCGCTTC GTATTGTAAA ACAAGCACAT GAAACAGAAG ATGAAAAGTA TATGTATAGC TGGATTGAAC AAGGTTCATA TACTGTGAAA TAA
|
Protein sequence | MFYRHMGELP HKRHVQFRKK DGSLYREQVM GTKGFSGTQS ILYHHYMPTE VGHSALSHSC QLQYEEDVAL SHRHFRTKEN KKSGDAISGR NFILGNEDLL IGVVSPTEKM DYFYRNGDGD EMLFVHYGTG KIETMFGTIH YRKGDYVTIP IGTIYRVIPD EGETKFLVVE ANSQITTPRR YRNEYGQLLE HSPFCERDLR GPEKLETYDE KGDFVVMTKS RGYMHKHVLG HHPLDVVGWD GYLYPWVFNV EDFEPITGRI HQPPPVHQTF EGHNFVICSF VPRLYDYHPE SIPAPYYHSN VNSDEVLYYV EGNFMSRKGV EEGSITLHPS GIPHGPHPGK TEASIGKKET LELAVMIDTF RPLRIVKQAH ETEDEKYMYS WIEQGSYTVK
|
| |