Gene SeD_A0882 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A0882 
SymbolhutI 
ID6872352 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp876022 
End bp877245 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content59% 
IMG OID642784077 
Productimidazolonepropionase 
Protein accessionYP_002214752 
Protein GI198244788 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID[TIGR01224] imidazolonepropionase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones80 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGCAAC TTTTACCGGG CGATACTGTC TGGCGAAACA TCAGGCTGGC GACAATGGAC 
CCGCAGCGGC AAGCCCCGTA CGGGCTGGTG GATAACCAGG CGCTGATTGT ACGCGAAGGG
CATATTTGCG ATATCGTGCC AGAGACGCAG CTTCCTGTCA GTGGGGACAA TATCCATGAT
ATGCAGGGAC GACTGGTAAC CCCGGGACTT ATCGATTGCC ACACGCATCT GGTGTTTGCC
GGTAACCGCG CCGCAGAGTG GGAGCAGCGG CTTAACGGCG CGTCATACCA GCATATTAGC
GCTCAGGGCG GCGGCATTAA CGCGACGGTA TCAGCAACCC GCGCCTGTGC GGAGGAGACG
CTCTACCTGC TGGCGCGCGA ACGCATGATG CGCCTTGCCA GCGAAGGCGT TACGCTGCTG
GAGATTAAAT CCGGCTATGG CCTGGAGCTG GCGACAGAAG AAAAGCTGTT GCGCGTTGCT
GCAAAACTTG CCGCCGAAAA CGCTATCGAC ATTAGCCCCA CGCTATTGGC CGCTCATGCT
ACGCCAGCGG AGTATCGTGA CGACCCGGAC GGCTACATCA CTCTGGTCTG CGAGACGATG
ATTCCGCAGC TCTGGCAAAA AGGGTTATTT GATGCGGTAG ACCTCTTTTG CGAGAGCGTC
GGCTTTAATG TGGCGCAGAG TGAGCGCGTA TTGCAGACGG CGAAGGCGTT AGGTATTCCC
GTTAAAGGCC ATGTTGAGCA GCTTTCGCTG TTGGGCGGCG CGCAGCTGGT GAGTCGCTAT
CAGGGTTTAT CGGCGGATCA TATCGAATAT CTTGATGAAG CGGGCGTCGC GGCGATGCGT
GACGGCGGTA CTGTCGGCGT GTTGTTGCCT GGCGCGTTTT ATTTTCTGCG CGAGACGCAG
CGCCCGCCGG TGGAACTGCT GCGCCGCTAT CAGGTGCCTG TCGCCGTCGC CAGCGATTTC
AATCCCGGCA CCAGCCCGTT TTGCAGTTTG CATCTGGCGA TGAATATGGC CTGCGTACAG
TTTGGTCTGA CGTCGGAAGA GGCATGGGCG GGCGTTACGC GCCATGCCGC TCGTGCGCTG
GGAAGACAGG CGACGCATGG GCAGCTCAGG GCCGACTACC GGGCGGATTT TGTGGTGTGG
GATGCTGAAC AGCCGGTAGA GGTTGTGTAT GAGCCGGGGC GTAATCCTTT ATATCAGCGG
GTATACAGAG GACAAATCTC ATGA
 
Protein sequence
MRQLLPGDTV WRNIRLATMD PQRQAPYGLV DNQALIVREG HICDIVPETQ LPVSGDNIHD 
MQGRLVTPGL IDCHTHLVFA GNRAAEWEQR LNGASYQHIS AQGGGINATV SATRACAEET
LYLLARERMM RLASEGVTLL EIKSGYGLEL ATEEKLLRVA AKLAAENAID ISPTLLAAHA
TPAEYRDDPD GYITLVCETM IPQLWQKGLF DAVDLFCESV GFNVAQSERV LQTAKALGIP
VKGHVEQLSL LGGAQLVSRY QGLSADHIEY LDEAGVAAMR DGGTVGVLLP GAFYFLRETQ
RPPVELLRRY QVPVAVASDF NPGTSPFCSL HLAMNMACVQ FGLTSEEAWA GVTRHAARAL
GRQATHGQLR ADYRADFVVW DAEQPVEVVY EPGRNPLYQR VYRGQIS