Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A0882 |
Symbol | hutI |
ID | 6872352 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | + |
Start bp | 876022 |
End bp | 877245 |
Gene Length | 1224 bp |
Protein Length | 407 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 642784077 |
Product | imidazolonepropionase |
Protein accession | YP_002214752 |
Protein GI | 198244788 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1228] Imidazolonepropionase and related amidohydrolases |
TIGRFAM ID | [TIGR01224] imidazolonepropionase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 80 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGCAAC TTTTACCGGG CGATACTGTC TGGCGAAACA TCAGGCTGGC GACAATGGAC CCGCAGCGGC AAGCCCCGTA CGGGCTGGTG GATAACCAGG CGCTGATTGT ACGCGAAGGG CATATTTGCG ATATCGTGCC AGAGACGCAG CTTCCTGTCA GTGGGGACAA TATCCATGAT ATGCAGGGAC GACTGGTAAC CCCGGGACTT ATCGATTGCC ACACGCATCT GGTGTTTGCC GGTAACCGCG CCGCAGAGTG GGAGCAGCGG CTTAACGGCG CGTCATACCA GCATATTAGC GCTCAGGGCG GCGGCATTAA CGCGACGGTA TCAGCAACCC GCGCCTGTGC GGAGGAGACG CTCTACCTGC TGGCGCGCGA ACGCATGATG CGCCTTGCCA GCGAAGGCGT TACGCTGCTG GAGATTAAAT CCGGCTATGG CCTGGAGCTG GCGACAGAAG AAAAGCTGTT GCGCGTTGCT GCAAAACTTG CCGCCGAAAA CGCTATCGAC ATTAGCCCCA CGCTATTGGC CGCTCATGCT ACGCCAGCGG AGTATCGTGA CGACCCGGAC GGCTACATCA CTCTGGTCTG CGAGACGATG ATTCCGCAGC TCTGGCAAAA AGGGTTATTT GATGCGGTAG ACCTCTTTTG CGAGAGCGTC GGCTTTAATG TGGCGCAGAG TGAGCGCGTA TTGCAGACGG CGAAGGCGTT AGGTATTCCC GTTAAAGGCC ATGTTGAGCA GCTTTCGCTG TTGGGCGGCG CGCAGCTGGT GAGTCGCTAT CAGGGTTTAT CGGCGGATCA TATCGAATAT CTTGATGAAG CGGGCGTCGC GGCGATGCGT GACGGCGGTA CTGTCGGCGT GTTGTTGCCT GGCGCGTTTT ATTTTCTGCG CGAGACGCAG CGCCCGCCGG TGGAACTGCT GCGCCGCTAT CAGGTGCCTG TCGCCGTCGC CAGCGATTTC AATCCCGGCA CCAGCCCGTT TTGCAGTTTG CATCTGGCGA TGAATATGGC CTGCGTACAG TTTGGTCTGA CGTCGGAAGA GGCATGGGCG GGCGTTACGC GCCATGCCGC TCGTGCGCTG GGAAGACAGG CGACGCATGG GCAGCTCAGG GCCGACTACC GGGCGGATTT TGTGGTGTGG GATGCTGAAC AGCCGGTAGA GGTTGTGTAT GAGCCGGGGC GTAATCCTTT ATATCAGCGG GTATACAGAG GACAAATCTC ATGA
|
Protein sequence | MRQLLPGDTV WRNIRLATMD PQRQAPYGLV DNQALIVREG HICDIVPETQ LPVSGDNIHD MQGRLVTPGL IDCHTHLVFA GNRAAEWEQR LNGASYQHIS AQGGGINATV SATRACAEET LYLLARERMM RLASEGVTLL EIKSGYGLEL ATEEKLLRVA AKLAAENAID ISPTLLAAHA TPAEYRDDPD GYITLVCETM IPQLWQKGLF DAVDLFCESV GFNVAQSERV LQTAKALGIP VKGHVEQLSL LGGAQLVSRY QGLSADHIEY LDEAGVAAMR DGGTVGVLLP GAFYFLRETQ RPPVELLRRY QVPVAVASDF NPGTSPFCSL HLAMNMACVQ FGLTSEEAWA GVTRHAARAL GRQATHGQLR ADYRADFVVW DAEQPVEVVY EPGRNPLYQR VYRGQIS
|
| |