Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BT9727_0428 |
Symbol | hutI |
ID | 2854288 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus thuringiensis serovar konkukian str. 97-27 |
Kingdom | Bacteria |
Replicon accession | NC_005957 |
Strand | - |
Start bp | 498630 |
End bp | 499760 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 637511832 |
Product | imidazolonepropionase |
Protein accession | YP_034778 |
Protein GI | 49480128 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1228] Imidazolonepropionase and related amidohydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 44 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAATAT TGCTCAAACA AGCCATGGTC TATCCTATTA CATCCCAAAA ATTTCAAGGG GATGTACTCG TTATAGGAGA AAAAATTGCT GAGGTCAAGC CTTTCATTCA ACCTACTCAA GATATGACAG TTATAGATGC ACGTGCTCTT CATCTTTTAC CTGGATTTAT TGATGTCCAT ACTCATCTTG GTCTCTACGA TGAAGGTACT GGTTGGGCTG GCAATGATGC AAATGAAACA TCTGAAGTTT CAACACCACA TATCCGTTCT TTAGACGGAA TCCACCCTTT GGATATTGCA TTTCAAGATG CTGTACAAAA TGGAATTACA ACTGTTCACG TTATGCCAGG AAGTCAAAAC ATTATTGGTG GTACGACTTG TGTAATAAAA ACAGCCGGAA CTTGTATTGA TCATATGATT ATTCAAGAAC CTGCTGGCTT AAAGATTGCC TTTGGCGAAA ATCCTAAAAA AGTCCATAGT AATGGAACAA AAGAGTCCAT TACGCGTATG GGAATTATGG GATTACTTCG GGAATCATTT TATGAAGCAC AGCACTACGG GCATGAAGCT GATTTTCGAA TGCTTCCTAT TTTAAAAGCA TTACGCCGCG AAATACCCGT ACGTATCCAC GCTCACCGAG CAGACGATAT TAGTTCTGCT CTACGTTTTG CAACAGAGTT CAATCTCGAT TTACGTATTG AACATTGTAC AGAAGGACAC TTTATTGTTG AAGAACTTTC GAAGCACAAT TTGAAAGTTT CAGTTGGTCC CACGCTTACA CGCCGTTCTA AAATTGAACT AAAAAACAAA ACATGGGATA CTTACCATAT ATTGTCGAAA AGTGGAGTGG AAGTTTCCAT CACAACAGAT CACCCCTATA CACCCATTCA ATATTTAAAT CTTTGTGCTG CTGTCGCAGT AAGGGAAGGA TTAGACGAAA AAACTGCACT AGAAGGAATC ACTATATTTC CAGCACGAAA TTTACGTTTA GAGGATAGAA TTGGAAGCAT TGAGGTCGGA AAAGACGCTG ATCTTGTGCT GTGGACCCAT CATCCTTTCC ATTATTTAGC CAAGCCTGTA CTAACTATGA TTGATGGAAA AATAATTTAC AAAAAAAATA AAAAAAACTA G
|
Protein sequence | MKILLKQAMV YPITSQKFQG DVLVIGEKIA EVKPFIQPTQ DMTVIDARAL HLLPGFIDVH THLGLYDEGT GWAGNDANET SEVSTPHIRS LDGIHPLDIA FQDAVQNGIT TVHVMPGSQN IIGGTTCVIK TAGTCIDHMI IQEPAGLKIA FGENPKKVHS NGTKESITRM GIMGLLRESF YEAQHYGHEA DFRMLPILKA LRREIPVRIH AHRADDISSA LRFATEFNLD LRIEHCTEGH FIVEELSKHN LKVSVGPTLT RRSKIELKNK TWDTYHILSK SGVEVSITTD HPYTPIQYLN LCAAVAVREG LDEKTALEGI TIFPARNLRL EDRIGSIEVG KDADLVLWTH HPFHYLAKPV LTMIDGKIIY KKNKKN
|
| |