Gene SeHA_C0914 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C0914 
SymbolhutI 
ID6489364 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp900109 
End bp901332 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content58% 
IMG OID642741162 
Productimidazolonepropionase 
Protein accessionYP_002044815 
Protein GI194451343 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID[TIGR01224] imidazolonepropionase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.106012 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones93 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGCAAC TTTTACCGGG CGATACTGTC TGGCGAAACA TCAGGCTGGC GACAATGGAC 
CCGCAGCAGC AAGCCCTGTA CGGGCTGGTG GATAATCAGG CGCTGATTGT GCGCGAAGGG
CATATTTGCG ATATCGTGCC AGAGACGCAG CTTCCTGTCA GCGGGGACAA TATCCATGAT
ATGCAGGGAC GACTGGTAAC CCCGGGACTT ATCGATTGCC ACACGCATCT GGTGTTTGCC
GGTAACCGCG CCGCAGAGTG GGAACAGCGG CTTAACGGCG CGTCATACCA GCATATTAGC
GCTCAGGGCG GCGGCATTAA CGCGACGGTA TCAGCAACCC GCGCCTGTGC GGAGGAGACG
CTCTACCTGC TGGCGCGCGA ACGCATGATG CGCCTTGCCA GCGAAGGCGT TACGCTGCTG
GAGATTAAAT CCGGCTATGG TCTGGAGCTG GCGACAGAAG AAAAGCTGTT GCGCGTCGCT
GCAAAACTTG CCGCCGAAAA CGCTATCGAC ATTAGCCCCA CGCTATTGGC CGCTCATGCT
ACGCCAGCGG AGTATCGTGA CGACCCGGAC GGCTACATCA CTCTGGTCTG CGAGACGATG
ATTCCGCAGC TCTGGCAAAA AGGGTTATTT GATGCGGTAG ACCTCTTTTG CGAGAGCGTC
GGCTTTAATG TGGCGCAGAG TGAGCGCGTA TTGCAGACGG CGAAGGCGTT AGGTATTCCC
GTTAAAGGCC ATGTTGAGCA GCTTTCGCTG TTGGGCGGCG CGCAGTTGGT GAGCCGTTAT
CAGGGCTTAT CGGCGGATCA TATCGAATAT CTTGATGAAG TGGGCGTCGC GGCGATGCGT
GACGGCGGTA CTGTCGGCGT ATTATTGCCC GGCGCGTTTT ATTTTCTGCG CGAGACGCAG
CGCCCGCCGG TAGAACTGCT GCGCCGCTAT CAGGTGCCTG TCGCCGTCGC CAGCGATTTC
AATCCCGGCA CCAGCCCGTT TTGCAGTTTG CATCTGGCGA TGAATATGGC CTGCGTACAG
TTTGGTCTGA CGCCGGAAGA GGCATGGGCG GGCGTTACGC GCCATGCCGC TCGCGCGCTG
GGAAGACAGG CGACGCATGG GCAGATCAGG GCCGGCTACC GGGCGGATTT TGTGGTGTGG
GATGCTGAAC AGCCGGTAGA GATAGTGTAT GAGCCGGGGC GTAACCCTTT ATATCAGCGG
GTATACAGAG GACAAATCTC ATGA
 
Protein sequence
MRQLLPGDTV WRNIRLATMD PQQQALYGLV DNQALIVREG HICDIVPETQ LPVSGDNIHD 
MQGRLVTPGL IDCHTHLVFA GNRAAEWEQR LNGASYQHIS AQGGGINATV SATRACAEET
LYLLARERMM RLASEGVTLL EIKSGYGLEL ATEEKLLRVA AKLAAENAID ISPTLLAAHA
TPAEYRDDPD GYITLVCETM IPQLWQKGLF DAVDLFCESV GFNVAQSERV LQTAKALGIP
VKGHVEQLSL LGGAQLVSRY QGLSADHIEY LDEVGVAAMR DGGTVGVLLP GAFYFLRETQ
RPPVELLRRY QVPVAVASDF NPGTSPFCSL HLAMNMACVQ FGLTPEEAWA GVTRHAARAL
GRQATHGQIR AGYRADFVVW DAEQPVEIVY EPGRNPLYQR VYRGQIS