Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCZK0424 |
Symbol | hutI |
ID | 3023798 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus E33L |
Kingdom | Bacteria |
Replicon accession | NC_006274 |
Strand | - |
Start bp | 491827 |
End bp | 492957 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 637544624 |
Product | imidazolonepropionase |
Protein accession | YP_082033 |
Protein GI | 52144796 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1228] Imidazolonepropionase and related amidohydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.224274 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAATAT TGCTCAAACA AGCCATGGTC TATCCTATTA CATCCCAAAA ATTTCAAGGG GATGTACTCG TTATAGGAGA AAAAATTGCT GAGGTCAAGC CTTTCATTCA ACCTACTCAA GATATGACAG TTATAGATGC ACGTGCTCTT CATCTCTTAC CTGGATTTAT TGATGTCCAT ACTCATCTTG GTCTCTACGA TGAAGGTACT GGTTGGGCTG GCAATGATGC AAATGAAACG TCTGAAGTTT CAACACCACA TATCCGTTCT TTAGACGGAA TCCACCCTTT GGATATTGCA TTTCAAGATG CTGTACAAAA TGGAATTACA ACTGTTCACG TTATGCCAGG AAGTCAAAAC ATTATTGGTG GTACGACTTG TGTAATAAAA ACAGCCGGAA CTTGTATTGA TCATATGATT ATTCAAGAAC CTGCTGGCTT AAAGATTGCC TTTGGCGAAA ATCCTAAAAA AGTCCATAGT AATGGAACAA AAGAGTCCAT TACGCGTATG GGAATTATGG GACTACTTCG GGAATCATTT TATGAAGCAC AGCACTACGG GCATGAAGCT GATTTTCGAA TGCTTCCTAT TTTAAAAGCA TTACGCCGCG AAATACCCGT ACGTATCCAC GCTCACCGAG CAGACGATAT TAGTTCTGCT CTACGTTTTG CAACAGAGTT CAATCTCGAT TTACGTATTG AACATTGTAC AGAAGGACAC TTTATTGTTG AAGAACTTTC GAAGCACAAT CTGAAAGTTT CAGTTGGCCC CACGCTTACA CGCCGTTCTA AAATCGAACT AAAAAACAAA ACATGGGATA CTTACCATAT ATTGTCGAAA AGTGGAGTGG AAGTTTCCAT CACAACAGAT CACCCCTATA CACCCATTCA ATATTTAAAT CTTTGTGCTG CTGTCGCAGT AAGGGAAGGA TTAGACGAAA AAACTGCACT AGAAGGAATC ACTATATTTC CAGCACGAAA TTTACGTTTA GAGGATAGAA TTGGAAGCAT TGAGGCCGGA AAAGACGCTG ATCTTGTGCT GTGGACCCAT CATCCTTTCC ATTATTTAGC CAAGCCTGTA CTAACTATGA TTGATGGAAA AATAATTTAC AAAAAAAATA AAAAAAACTA G
|
Protein sequence | MKILLKQAMV YPITSQKFQG DVLVIGEKIA EVKPFIQPTQ DMTVIDARAL HLLPGFIDVH THLGLYDEGT GWAGNDANET SEVSTPHIRS LDGIHPLDIA FQDAVQNGIT TVHVMPGSQN IIGGTTCVIK TAGTCIDHMI IQEPAGLKIA FGENPKKVHS NGTKESITRM GIMGLLRESF YEAQHYGHEA DFRMLPILKA LRREIPVRIH AHRADDISSA LRFATEFNLD LRIEHCTEGH FIVEELSKHN LKVSVGPTLT RRSKIELKNK TWDTYHILSK SGVEVSITTD HPYTPIQYLN LCAAVAVREG LDEKTALEGI TIFPARNLRL EDRIGSIEAG KDADLVLWTH HPFHYLAKPV LTMIDGKIIY KKNKKN
|
| |