Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCG9842_B1558 |
Symbol | hutI |
ID | 7182614 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus G9842 |
Kingdom | Bacteria |
Replicon accession | NC_011772 |
Strand | - |
Start bp | 3586187 |
End bp | 3587458 |
Gene Length | 1272 bp |
Protein Length | 423 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 643551483 |
Product | imidazolonepropionase |
Protein accession | YP_002447153 |
Protein GI | 218898742 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1228] Imidazolonepropionase and related amidohydrolases |
TIGRFAM ID | [TIGR01224] imidazolonepropionase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.240138 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 0.0370079 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGGACA CTTTACTAAT AAATATCGGT CAATTACTAA CAATGGATCA AGAAGATGGC TTGTTAAGGC GGGAAGCGAT GAACACGCTT CCTGTTATCG AAAACGGTGT GGTTGGAATT GAAAATGATG TAATCACTTT CGTTGGAACA GCGGAAGAAG CGAAAGGACT TCAGGCGAAA GAAGTTATTG ATTGCGGCGG GAAAATGGTT TCTCCTGGTC TTGTTGATCC GCATACTCAT CTTGTATTTG GTGGATCTCG CGAAAATGAA ATCGCACTAA AATTACAAGG AGTTCCGTAC TTAGAAATTT TAGAACAAGG CGGAGGTATT CTTTCAACTG TAAATGCAAC GAAACAGGCG TCGAAGGAAG AACTTGTTCA AAAAGCGAAA TTCCATTTAG ACCGTATGCT ATCTTTCGGA GTTACTACTG TAGAAGCGAA GAGCGGTTAC GGATTAGATG ATGAGACGGA ATGGAAACAA TTAGAGGCAA CAGCACAATT ACAAAAAGAG CATCCGATCG ATTTAGTTTC CACATTTTTA GGTGCTCATG CAGTTCCGAA AGAATATAAA GGTAGATCGA AAGAATTTTT ACAATGGATG TTAGACTTAT TGCCAGAAAT GAAAGAGAAG CAATTAGCGG AGTTCGTTGA TATTTTCTGT GAAACAGGTG TGTTCTCTGT CGAAGAATCA AAAGAGTTTT TATTAAAAGC GAAAGAGCTT GGCTTTGATG TGAAAATTCA TGCGGATGAA ATAGACCCTC TTGGTGGTGC GGAAGCAGCG GCAGAAATTG GTGCAGCATC AGCGGACCAT TTAGTTGGCG CTTCTGATAA AGGGATTGAA ATGCTTGCAA ACTCGAATAC AGTAGCAACA TTATTACCAG GAACAACTTT CTATTTAAAT AAAGAAAGCT TTGCTCGTGG TCGTAAAATG ATTGATGAAG GTGTTGCAGT TGCGTTAGCT ACAGACTTTA ACCCAGGTAG CTGTCCAACT GAAAATATTC AGCTTATTAT GAGCATTGCG ATGCTAAAAC TGAAAATGAC ACCAGAAGAA GTGTGGAATG CCGTAACGGT TAACTCATCA TATGCCATTA ACCGTGGTGA TGTAGCTGGG AAAATTAGAG TTGGTCGTAA GGCAGATTTA GTTTTATGGG ATGCTTATAA TTATGCTTAC GTACCGTATC ATTACGGCGT AAGTCATGTG AATACAGTAT GGAAGAATGG TAATATTGCA TATACAAGAG GTGAACAATC GTGGAGCACG GCCACTATTT AA
|
Protein sequence | MLDTLLINIG QLLTMDQEDG LLRREAMNTL PVIENGVVGI ENDVITFVGT AEEAKGLQAK EVIDCGGKMV SPGLVDPHTH LVFGGSRENE IALKLQGVPY LEILEQGGGI LSTVNATKQA SKEELVQKAK FHLDRMLSFG VTTVEAKSGY GLDDETEWKQ LEATAQLQKE HPIDLVSTFL GAHAVPKEYK GRSKEFLQWM LDLLPEMKEK QLAEFVDIFC ETGVFSVEES KEFLLKAKEL GFDVKIHADE IDPLGGAEAA AEIGAASADH LVGASDKGIE MLANSNTVAT LLPGTTFYLN KESFARGRKM IDEGVAVALA TDFNPGSCPT ENIQLIMSIA MLKLKMTPEE VWNAVTVNSS YAINRGDVAG KIRVGRKADL VLWDAYNYAY VPYHYGVSHV NTVWKNGNIA YTRGEQSWST ATI
|
| |