Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeHA_C0914 |
Symbol | hutI |
ID | 6489364 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Heidelberg str. SL476 |
Kingdom | Bacteria |
Replicon accession | NC_011083 |
Strand | + |
Start bp | 900109 |
End bp | 901332 |
Gene Length | 1224 bp |
Protein Length | 407 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 642741162 |
Product | imidazolonepropionase |
Protein accession | YP_002044815 |
Protein GI | 194451343 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1228] Imidazolonepropionase and related amidohydrolases |
TIGRFAM ID | [TIGR01224] imidazolonepropionase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.106012 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 93 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGCAAC TTTTACCGGG CGATACTGTC TGGCGAAACA TCAGGCTGGC GACAATGGAC CCGCAGCAGC AAGCCCTGTA CGGGCTGGTG GATAATCAGG CGCTGATTGT GCGCGAAGGG CATATTTGCG ATATCGTGCC AGAGACGCAG CTTCCTGTCA GCGGGGACAA TATCCATGAT ATGCAGGGAC GACTGGTAAC CCCGGGACTT ATCGATTGCC ACACGCATCT GGTGTTTGCC GGTAACCGCG CCGCAGAGTG GGAACAGCGG CTTAACGGCG CGTCATACCA GCATATTAGC GCTCAGGGCG GCGGCATTAA CGCGACGGTA TCAGCAACCC GCGCCTGTGC GGAGGAGACG CTCTACCTGC TGGCGCGCGA ACGCATGATG CGCCTTGCCA GCGAAGGCGT TACGCTGCTG GAGATTAAAT CCGGCTATGG TCTGGAGCTG GCGACAGAAG AAAAGCTGTT GCGCGTCGCT GCAAAACTTG CCGCCGAAAA CGCTATCGAC ATTAGCCCCA CGCTATTGGC CGCTCATGCT ACGCCAGCGG AGTATCGTGA CGACCCGGAC GGCTACATCA CTCTGGTCTG CGAGACGATG ATTCCGCAGC TCTGGCAAAA AGGGTTATTT GATGCGGTAG ACCTCTTTTG CGAGAGCGTC GGCTTTAATG TGGCGCAGAG TGAGCGCGTA TTGCAGACGG CGAAGGCGTT AGGTATTCCC GTTAAAGGCC ATGTTGAGCA GCTTTCGCTG TTGGGCGGCG CGCAGTTGGT GAGCCGTTAT CAGGGCTTAT CGGCGGATCA TATCGAATAT CTTGATGAAG TGGGCGTCGC GGCGATGCGT GACGGCGGTA CTGTCGGCGT ATTATTGCCC GGCGCGTTTT ATTTTCTGCG CGAGACGCAG CGCCCGCCGG TAGAACTGCT GCGCCGCTAT CAGGTGCCTG TCGCCGTCGC CAGCGATTTC AATCCCGGCA CCAGCCCGTT TTGCAGTTTG CATCTGGCGA TGAATATGGC CTGCGTACAG TTTGGTCTGA CGCCGGAAGA GGCATGGGCG GGCGTTACGC GCCATGCCGC TCGCGCGCTG GGAAGACAGG CGACGCATGG GCAGATCAGG GCCGGCTACC GGGCGGATTT TGTGGTGTGG GATGCTGAAC AGCCGGTAGA GATAGTGTAT GAGCCGGGGC GTAACCCTTT ATATCAGCGG GTATACAGAG GACAAATCTC ATGA
|
Protein sequence | MRQLLPGDTV WRNIRLATMD PQQQALYGLV DNQALIVREG HICDIVPETQ LPVSGDNIHD MQGRLVTPGL IDCHTHLVFA GNRAAEWEQR LNGASYQHIS AQGGGINATV SATRACAEET LYLLARERMM RLASEGVTLL EIKSGYGLEL ATEEKLLRVA AKLAAENAID ISPTLLAAHA TPAEYRDDPD GYITLVCETM IPQLWQKGLF DAVDLFCESV GFNVAQSERV LQTAKALGIP VKGHVEQLSL LGGAQLVSRY QGLSADHIEY LDEVGVAAMR DGGTVGVLLP GAFYFLRETQ RPPVELLRRY QVPVAVASDF NPGTSPFCSL HLAMNMACVQ FGLTPEEAWA GVTRHAARAL GRQATHGQIR AGYRADFVVW DAEQPVEIVY EPGRNPLYQR VYRGQIS
|
| |