Gene SNSL254_A0851 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A0851 
SymbolhutI 
ID6483118 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp856153 
End bp857415 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content58% 
IMG OID642736263 
Productimidazolonepropionase 
Protein accessionYP_002040023 
Protein GI194445732 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID[TIGR01224] imidazolonepropionase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones92 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTATAC AACCCATGAC AGGCAAGAGA GCGACAGGAA TGCGGCAACT TTTACGGGGC 
GATACTGTCT GGCGAAACAT CAGGCTGGCG ACAATGGACC CGCAGCGGCA AGCCCCGTAC
GGGCTGGTGG ATAACCAGGC GCTGATTGTA CGCGAAGGGC ATATTTGCGA TATCGTGCCA
GAGACGCAGC TTCCTGTCAG TGGGGACAAT ATCCATGATA TGCAGGGACG ACTGGTAACC
CCGGGACTTA TCGATTGCCA CACGCATCTG GTGTTTGCCG GTAACCGCGC CGCAGAGTGG
GAGCAGCGGC TTAACGGCGC GTCATACCAG CATATTAGCG CTCAGGGCGG CGGCATTAAC
GCGACGGTAT CAGCAACCCG CGCCTGTGCG GAGGAGACGC TCTACCTGCT GGCGCGCGAA
CGCATGATGC GCCTTGCCAG CGAAGGCGTT ACGCTGCTGG AGATTAAATC CGGCTATGGC
CTGGAGCTGG CGACAGAAGA AAAGCTGTTG CGCGTTGCTG CAAAACTTGC CGCCGAAAAC
GCTATCGACA TTAGCCCCAC GCTATTGGCC GCTCATGCTA CGCCAGCGGA GTATCGTGAC
GACCCGGACG GCTACATCAC TCTGGTCTGC GAGACGATGA TTCCGCAGCT CTGGCAAAAA
GGGTTATTTG ATGCGGTAGA CCTCTTTTGC GAGAGCGTCG GCTTTAATGT GGCCCAGAGT
GAGCGCGTGT TGCAGACGGC GAAGGCGTTA GGTATTCCCG TTAAAGGCCA TGTTGAGCAG
CTTTCGCTGT TGGGCGGCGC GCAGCTGGTG AGTCGTTATC AGGGTTTATC GGCGGATCAT
ATCGAATATC TTGATGAAGC GGGCGTCGCG GCGATGCGTG ACGGCGGTAC TGTCGGCGTG
TTGTTGCCCG GCGCGTTTTA TTTTCTGCGC GAGACGCAGC GCCCGCCGGT TGAACTGCTG
CGCCGCTATC AGGTGCCTGT CGCCGTCGCC AGCGATTTCA ATCCCGGCAC CAGCCCGTTT
TGCAGTTTGC ATCTGGCGAT GAATATGGCC TGCGTACAGT TTGGTCTGAC GCCGGAAGAG
GCATGGGCGG GCGTTACGCG CCATGCCGCT CGCGCGCTGG GAAGACAGGC GACGCATGGG
CAGATCAGGG CCGGCTACCG GGCGGATTTT GTGGTGTGGG ATGCTGAACA GCCGGTAGAG
ATAGTGTATG AGCCGGGGCG TAACCCTTTA TATCAGCGGG TATACAGAGG AAAAATCTCA
TGA
 
Protein sequence
MSIQPMTGKR ATGMRQLLRG DTVWRNIRLA TMDPQRQAPY GLVDNQALIV REGHICDIVP 
ETQLPVSGDN IHDMQGRLVT PGLIDCHTHL VFAGNRAAEW EQRLNGASYQ HISAQGGGIN
ATVSATRACA EETLYLLARE RMMRLASEGV TLLEIKSGYG LELATEEKLL RVAAKLAAEN
AIDISPTLLA AHATPAEYRD DPDGYITLVC ETMIPQLWQK GLFDAVDLFC ESVGFNVAQS
ERVLQTAKAL GIPVKGHVEQ LSLLGGAQLV SRYQGLSADH IEYLDEAGVA AMRDGGTVGV
LLPGAFYFLR ETQRPPVELL RRYQVPVAVA SDFNPGTSPF CSLHLAMNMA CVQFGLTPEE
AWAGVTRHAA RALGRQATHG QIRAGYRADF VVWDAEQPVE IVYEPGRNPL YQRVYRGKIS