Gene SeSA_A0937 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeSA_A0937 
SymbolhutI 
ID6519614 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 
KingdomBacteria 
Replicon accessionNC_011094 
Strand
Start bp907971 
End bp909194 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content58% 
IMG OID642746069 
Productimidazolonepropionase 
Protein accessionYP_002113880 
Protein GI194737656 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID[TIGR01224] imidazolonepropionase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.91714 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGCAAC TTTTACCGGG CGATACTGTC TGGCGAAACA TCAGGCTGGC GACAATGGAC 
CCGCAGCGGC AAGCCCCGTA CGGGCTGGTG GATAACCAGG CGCTGATTGT ACGCGGAGGG
CATATTTGCG ATATCGTGCC AGAGACGCAG CTTCCTGTCA GCGGGGACAA TATCCATGAT
ATGCAGGGAC GACTGGTAAC CCCGGGACTT ATCGATTGCC ACACGCATCT GGTGTTTGCC
GGTAACCGCG CCGCAGAGTG GGAACAGCGG CTTAACGGCG CGTCATACCA GCATATTAGC
GCTCAGGGTG GCGGCATTAA CGCGACGGTA TCAGCAACCC GCGCCTGTGC AGAAGAGACG
CTCTACCTGC TGGCGCGCGA ACGCATGATG CGCCTTGCCA GCGAAGGCGT TACGCTGCTG
GAGATTAAAT CCGGCTATGG TCTGGAGCTG GCGACAGAAG AAAAGCTGTT GCGCGTCGCT
GCAAAACTTG CCGCCGAAAA CGCTATCGAC ATTAGCCCCA CGCTACTTGC CGCTCATGCT
ACGCCAGCGG AGTATCGTGA CGACCCGGAC GGCTACATCA CTCTGGTCTG CGAGACGATG
ATTCCGCAGC TCTGGCAAAA AGGGTTATTT GATGCGGTAG ACCTCTTTTG CGAGAGCGTC
GGCTTTAATG TGGCGCAGAG TGAGCGCGTA TTGCAGACGG CGAAGGCGTT AGGTATTCCC
GTTAAAGGCC ATGTTGAGCA GCTTTCGCTG TTGGGCGGCG CGCAGCTGGT GAGTCGTTAT
CAGGGTTTAT CGGCGGATCA TATCGAATAT CTTGATGAAG CGGGCGTCGC GGCGATGCGT
GACGGCGGTA CTGTCGGCGT GTTGTTGCCC GGCGCGTTTT ATTTTCTGCG CGAGACGCAG
CGCCCGCCGG TCGAACTGCT GCGCCGCTAT CAGGTGCCTG TCGCCGTCGC CAGCGATTTC
AATCCCGGCA CCAGCCCGTT TTGCAGTTTG CATCTGGCGA TGAATATGGC CTGCGTACAG
TTTGGTCTGA CGCCGGAAGA GGCATGGGCG GGCGTTACTC GCCATGCCGC TCGCGCGCTG
GGAAGACAGG CGACGCATGG GCAGATCAGG GCCGGCTACC GGGCGGATTT TGTGGTATGG
GATGCTGAAC AGCCGGTAGA GATAGTGTAT GAGCCGGGGC GTAACCCTTT ATATCAGCGG
GTATACAGAG GAAAAATCTC ATGA
 
Protein sequence
MRQLLPGDTV WRNIRLATMD PQRQAPYGLV DNQALIVRGG HICDIVPETQ LPVSGDNIHD 
MQGRLVTPGL IDCHTHLVFA GNRAAEWEQR LNGASYQHIS AQGGGINATV SATRACAEET
LYLLARERMM RLASEGVTLL EIKSGYGLEL ATEEKLLRVA AKLAAENAID ISPTLLAAHA
TPAEYRDDPD GYITLVCETM IPQLWQKGLF DAVDLFCESV GFNVAQSERV LQTAKALGIP
VKGHVEQLSL LGGAQLVSRY QGLSADHIEY LDEAGVAAMR DGGTVGVLLP GAFYFLRETQ
RPPVELLRRY QVPVAVASDF NPGTSPFCSL HLAMNMACVQ FGLTPEEAWA GVTRHAARAL
GRQATHGQIR AGYRADFVVW DAEQPVEIVY EPGRNPLYQR VYRGKIS