Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeAg_B0823 |
Symbol | hutI |
ID | 6797275 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Agona str. SL483 |
Kingdom | Bacteria |
Replicon accession | NC_011149 |
Strand | + |
Start bp | 815498 |
End bp | 816721 |
Gene Length | 1224 bp |
Protein Length | 407 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 642775100 |
Product | imidazolonepropionase |
Protein accession | YP_002145743 |
Protein GI | 197250391 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1228] Imidazolonepropionase and related amidohydrolases |
TIGRFAM ID | [TIGR01224] imidazolonepropionase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.641221 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGGCAAC TTTTACCGGG CGATACTGTC TGGCGAAACA TCAGGCTGGC GACAATGGAC CCGCAGCGGC AAGCCCCGTA CGGGCTGGTG GATAACCAGG CGCTGATTGT ACGCGAAGGG CATATTTGCG ATATCGTGCC AGAGACGCAG CTTCCTGTCA GTGGGGACAA TATCCATGAT ATGCAGGGAC GACTGGTAAC CCCGGGACTT ATCGATTGCC ACACGCATCT GGTGTTTGCC GGTAACCGCG CCGCAGAGTG GGAGCAGCGG CTTAACGGCG CGTCATACCA GCATATTAGC GCTCAGGGCG GCGGCATTAA CGCGACGGTA TCAGCAACCC GCGCCTGTGC GGAGGAGCCG CTCTACCTGC TGGCGCGCGA ACGCATGATG CGCCTTGCCA GCGAAGGCGT TACGCTGCTG GAGATTAAAT CCGGCTATGG TCTGGAGCTG GCGACAGAAG AAAAGCTGTT GCGCGTTGCT GCAAAACTTG CCGCCGAAAA CGCTATCGAC ATTAGCCCCA CGCTATTGGC CGCTCATGCT ACACCAGCGG AGTATCGTGA CGACCCGGAC GGCTACATCA CTCTGGTCTG CGAGACGATG ATTCCGCAGC TCTGGCAAAA AGGGTTATTT GATGCGGTAG ACCTCTTTTG CGAGAGCGTC GGTTTTAATG TGGCGCAGAG TGAGCGCGTG TTGCAGACGG CGAAGGCGTT AGGTATTCCC GTTAAAGGCC ATGTTGAGCA GCTTTCGCTG TTGGGCGGCG CGCAGTTGTT GAGCCGTTAT CAGGGTTTAT CGGCGGATCA TATCGAATAT CTTGATGAAG CGGGCGTCGC GGCGATGCGT GACGGCGGTA CTGTCGGCGT ATTATTGCCC GGCGCGTTTT ATTTTCTGCG CGAGAGGCAG CGCCCGCCGG TAGAACTGCT GCGCCGCTAT CAGGTGCCTG TCGCCGTCGC CAGCGATTTC AATCCCGGCA CCAGCCCGTT TTGCAGTTTG CATCTGGCGA TGAATATGGC CTGCGTACAG TTTGGTCTGA CGCCGGAAGA GGTATGGGCG GGCGTTACGC GCCATGCCGC TCGCGCGCTG GGAAGACAGG CGACGCATGG GCAGATCAGG GCCGGCTACC GGGCGGATTT TGTGGTGTGG GATGCTGAAC AGCCGGTAGA GATTGTGTAT GAGCCGGGGC GTAACCCTTT ATATCAGCGG GTATACAGAG GACAAATCTC ATGA
|
Protein sequence | MRQLLPGDTV WRNIRLATMD PQRQAPYGLV DNQALIVREG HICDIVPETQ LPVSGDNIHD MQGRLVTPGL IDCHTHLVFA GNRAAEWEQR LNGASYQHIS AQGGGINATV SATRACAEEP LYLLARERMM RLASEGVTLL EIKSGYGLEL ATEEKLLRVA AKLAAENAID ISPTLLAAHA TPAEYRDDPD GYITLVCETM IPQLWQKGLF DAVDLFCESV GFNVAQSERV LQTAKALGIP VKGHVEQLSL LGGAQLLSRY QGLSADHIEY LDEAGVAAMR DGGTVGVLLP GAFYFLRERQ RPPVELLRRY QVPVAVASDF NPGTSPFCSL HLAMNMACVQ FGLTPEEVWA GVTRHAARAL GRQATHGQIR AGYRADFVVW DAEQPVEIVY EPGRNPLYQR VYRGQIS
|
| |