Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeSA_A0937 |
Symbol | hutI |
ID | 6519614 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 |
Kingdom | Bacteria |
Replicon accession | NC_011094 |
Strand | + |
Start bp | 907971 |
End bp | 909194 |
Gene Length | 1224 bp |
Protein Length | 407 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 642746069 |
Product | imidazolonepropionase |
Protein accession | YP_002113880 |
Protein GI | 194737656 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1228] Imidazolonepropionase and related amidohydrolases |
TIGRFAM ID | [TIGR01224] imidazolonepropionase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.91714 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGCAAC TTTTACCGGG CGATACTGTC TGGCGAAACA TCAGGCTGGC GACAATGGAC CCGCAGCGGC AAGCCCCGTA CGGGCTGGTG GATAACCAGG CGCTGATTGT ACGCGGAGGG CATATTTGCG ATATCGTGCC AGAGACGCAG CTTCCTGTCA GCGGGGACAA TATCCATGAT ATGCAGGGAC GACTGGTAAC CCCGGGACTT ATCGATTGCC ACACGCATCT GGTGTTTGCC GGTAACCGCG CCGCAGAGTG GGAACAGCGG CTTAACGGCG CGTCATACCA GCATATTAGC GCTCAGGGTG GCGGCATTAA CGCGACGGTA TCAGCAACCC GCGCCTGTGC AGAAGAGACG CTCTACCTGC TGGCGCGCGA ACGCATGATG CGCCTTGCCA GCGAAGGCGT TACGCTGCTG GAGATTAAAT CCGGCTATGG TCTGGAGCTG GCGACAGAAG AAAAGCTGTT GCGCGTCGCT GCAAAACTTG CCGCCGAAAA CGCTATCGAC ATTAGCCCCA CGCTACTTGC CGCTCATGCT ACGCCAGCGG AGTATCGTGA CGACCCGGAC GGCTACATCA CTCTGGTCTG CGAGACGATG ATTCCGCAGC TCTGGCAAAA AGGGTTATTT GATGCGGTAG ACCTCTTTTG CGAGAGCGTC GGCTTTAATG TGGCGCAGAG TGAGCGCGTA TTGCAGACGG CGAAGGCGTT AGGTATTCCC GTTAAAGGCC ATGTTGAGCA GCTTTCGCTG TTGGGCGGCG CGCAGCTGGT GAGTCGTTAT CAGGGTTTAT CGGCGGATCA TATCGAATAT CTTGATGAAG CGGGCGTCGC GGCGATGCGT GACGGCGGTA CTGTCGGCGT GTTGTTGCCC GGCGCGTTTT ATTTTCTGCG CGAGACGCAG CGCCCGCCGG TCGAACTGCT GCGCCGCTAT CAGGTGCCTG TCGCCGTCGC CAGCGATTTC AATCCCGGCA CCAGCCCGTT TTGCAGTTTG CATCTGGCGA TGAATATGGC CTGCGTACAG TTTGGTCTGA CGCCGGAAGA GGCATGGGCG GGCGTTACTC GCCATGCCGC TCGCGCGCTG GGAAGACAGG CGACGCATGG GCAGATCAGG GCCGGCTACC GGGCGGATTT TGTGGTATGG GATGCTGAAC AGCCGGTAGA GATAGTGTAT GAGCCGGGGC GTAACCCTTT ATATCAGCGG GTATACAGAG GAAAAATCTC ATGA
|
Protein sequence | MRQLLPGDTV WRNIRLATMD PQRQAPYGLV DNQALIVRGG HICDIVPETQ LPVSGDNIHD MQGRLVTPGL IDCHTHLVFA GNRAAEWEQR LNGASYQHIS AQGGGINATV SATRACAEET LYLLARERMM RLASEGVTLL EIKSGYGLEL ATEEKLLRVA AKLAAENAID ISPTLLAAHA TPAEYRDDPD GYITLVCETM IPQLWQKGLF DAVDLFCESV GFNVAQSERV LQTAKALGIP VKGHVEQLSL LGGAQLVSRY QGLSADHIEY LDEAGVAAMR DGGTVGVLLP GAFYFLRETQ RPPVELLRRY QVPVAVASDF NPGTSPFCSL HLAMNMACVQ FGLTPEEAWA GVTRHAARAL GRQATHGQIR AGYRADFVVW DAEQPVEIVY EPGRNPLYQR VYRGKIS
|
| |