Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A0851 |
Symbol | hutI |
ID | 6483118 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | + |
Start bp | 856153 |
End bp | 857415 |
Gene Length | 1263 bp |
Protein Length | 420 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 642736263 |
Product | imidazolonepropionase |
Protein accession | YP_002040023 |
Protein GI | 194445732 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1228] Imidazolonepropionase and related amidohydrolases |
TIGRFAM ID | [TIGR01224] imidazolonepropionase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 92 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTATAC AACCCATGAC AGGCAAGAGA GCGACAGGAA TGCGGCAACT TTTACGGGGC GATACTGTCT GGCGAAACAT CAGGCTGGCG ACAATGGACC CGCAGCGGCA AGCCCCGTAC GGGCTGGTGG ATAACCAGGC GCTGATTGTA CGCGAAGGGC ATATTTGCGA TATCGTGCCA GAGACGCAGC TTCCTGTCAG TGGGGACAAT ATCCATGATA TGCAGGGACG ACTGGTAACC CCGGGACTTA TCGATTGCCA CACGCATCTG GTGTTTGCCG GTAACCGCGC CGCAGAGTGG GAGCAGCGGC TTAACGGCGC GTCATACCAG CATATTAGCG CTCAGGGCGG CGGCATTAAC GCGACGGTAT CAGCAACCCG CGCCTGTGCG GAGGAGACGC TCTACCTGCT GGCGCGCGAA CGCATGATGC GCCTTGCCAG CGAAGGCGTT ACGCTGCTGG AGATTAAATC CGGCTATGGC CTGGAGCTGG CGACAGAAGA AAAGCTGTTG CGCGTTGCTG CAAAACTTGC CGCCGAAAAC GCTATCGACA TTAGCCCCAC GCTATTGGCC GCTCATGCTA CGCCAGCGGA GTATCGTGAC GACCCGGACG GCTACATCAC TCTGGTCTGC GAGACGATGA TTCCGCAGCT CTGGCAAAAA GGGTTATTTG ATGCGGTAGA CCTCTTTTGC GAGAGCGTCG GCTTTAATGT GGCCCAGAGT GAGCGCGTGT TGCAGACGGC GAAGGCGTTA GGTATTCCCG TTAAAGGCCA TGTTGAGCAG CTTTCGCTGT TGGGCGGCGC GCAGCTGGTG AGTCGTTATC AGGGTTTATC GGCGGATCAT ATCGAATATC TTGATGAAGC GGGCGTCGCG GCGATGCGTG ACGGCGGTAC TGTCGGCGTG TTGTTGCCCG GCGCGTTTTA TTTTCTGCGC GAGACGCAGC GCCCGCCGGT TGAACTGCTG CGCCGCTATC AGGTGCCTGT CGCCGTCGCC AGCGATTTCA ATCCCGGCAC CAGCCCGTTT TGCAGTTTGC ATCTGGCGAT GAATATGGCC TGCGTACAGT TTGGTCTGAC GCCGGAAGAG GCATGGGCGG GCGTTACGCG CCATGCCGCT CGCGCGCTGG GAAGACAGGC GACGCATGGG CAGATCAGGG CCGGCTACCG GGCGGATTTT GTGGTGTGGG ATGCTGAACA GCCGGTAGAG ATAGTGTATG AGCCGGGGCG TAACCCTTTA TATCAGCGGG TATACAGAGG AAAAATCTCA TGA
|
Protein sequence | MSIQPMTGKR ATGMRQLLRG DTVWRNIRLA TMDPQRQAPY GLVDNQALIV REGHICDIVP ETQLPVSGDN IHDMQGRLVT PGLIDCHTHL VFAGNRAAEW EQRLNGASYQ HISAQGGGIN ATVSATRACA EETLYLLARE RMMRLASEGV TLLEIKSGYG LELATEEKLL RVAAKLAAEN AIDISPTLLA AHATPAEYRD DPDGYITLVC ETMIPQLWQK GLFDAVDLFC ESVGFNVAQS ERVLQTAKAL GIPVKGHVEQ LSLLGGAQLV SRYQGLSADH IEYLDEAGVA AMRDGGTVGV LLPGAFYFLR ETQRPPVELL RRYQVPVAVA SDFNPGTSPF CSLHLAMNMA CVQFGLTPEE AWAGVTRHAA RALGRQATHG QIRAGYRADF VVWDAEQPVE IVYEPGRNPL YQRVYRGKIS
|
| |