Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeAg_B0826 |
Symbol | hutU |
ID | 6794376 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Agona str. SL483 |
Kingdom | Bacteria |
Replicon accession | NC_011149 |
Strand | + |
Start bp | 818626 |
End bp | 820311 |
Gene Length | 1686 bp |
Protein Length | 561 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 642775103 |
Product | urocanate hydratase |
Protein accession | YP_002145746 |
Protein GI | 197250926 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2987] Urocanate hydratase |
TIGRFAM ID | [TIGR01228] urocanate hydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTGAAA GCAAGTATCG TCAGCAGACT ATCCGCGCGC CCAGAGGCAC GGTATTAACG GCGAAAAGCT GGCTGACAGA AGCCCCGCTG CGGATGTTAA TGAATAATCT CGATCCTGAC GTGGCGGAAA ATCCGCATGA GCTGGTGGTC TACGGCGGGA TTGGTCGCGC CGCGCGCAAC TGGGAATGCT ATGACGCTAT TGTTGATGCG CTCACCAGGC TGGAGGCGGA CGAAACGTTG CTTATTCAGT CTGGCAAACC GGTCGGCGTA TTTAAAACGC ACGACAACGC GCCGCGGGTA TTAATCGCCA ACTCCAACCT GGTTCCCCAC TGGGCGACAT GGGAACACTT TAACGAACTG GATGCGAAAG GGCTGGCGAT GTACGGTCAA ATGACGGCCG GAAGCTGGAT CTATATCGGC AGCCAGGGAA TCGTGCAGGG AACATACGAA ACCTTTGTCG AGGCGGGGCG TCAGCACTAT AACGGCACGC TGGCGGGACG CTGGGTGCTG ACTGCCGGGC TGGGCGGCAT GGGCGGCGCG CAACCGCTAG CCGCGACGCT GGCCGGAGCG TGTTCGCTGA CGATTGAATG CCAGCAAAGC CGTATCGATT TTCGTCTGCG TACTCGTTAC GTGGATGAGC AGGCCGCCAC GCTGGATGAC GCGCTGGCCC GCATTACGCG CTACACCCGC GAGGGGAAAG CCGTGTCCGT CGCCCTGTGC GCGAACGCGG CGGATATCCT GCCGGAACTG GTTAATCGCG GCGTGCGCCC GGACCTGGTG ACCGATCAGA CCAGCGCCCA CGATCCGCTA CATGGCTATT TACCCTCCGG CTGGCGCTGG GAGGAGTATC AGAAAAACGC GCAATCCGAT CCCCACGGGA CGATGCAGGC AGCGAAACGT TCCATGGCGG CACATGTTCG GGCGATGCTG GCGTTCAGTC AAATGGGCGT GCCGACCTTT GACTATGGCA ACAATATTCG CCAGATGGCG AAAGAGATGG GGGTGGAAAA CGCCTTTGAT TTTCCGGGAT TTGTGCCAGC CTATATTCGT CCGCTGTTCT GTCGTGGTAT CGGGCCGTTT CGCTGGGTGG CGCTGTCCGG CGATCCGCAG GATATCTATA AAACCGATGC CAAAGTCAAA GAGATAGTGG CTGAGGATAA ACATCTGCAT CACTGGCTGG ATATGGCGCG CGAGCGCATT CATTTTCAGG GGCTACCGGC GCGTATCTGC TGGGTAGGCC TGGAGTGGCG GCAAAAACTG GGGCTGGCGT TCAACGAAAT GGTGCGTTGC GGCGAGGTAT CCGCGCCCAT TGTGATTGGC CGCGATCACC TGGATTCCGG TTCTGTCGCC AGCCCTAACC GTGAAACCGA AGCGATGCGC GACGGCTCCG ACGCGGTTTC CGACTGGCCG CTGTTAAATG CGTTGCTGAA TACCGCCAGC GGGGCGACAT GGGTATCGCT CCATCATGGC GGCGGAGTGG GGATGGGTTT TTCGCAACAC GCCGGTATGG TGATTGTCTG TGATGGCACT GACGAGGCCG CCGCGCGTAT TCGCCGCGTG TTACACAACG ATCCGGCGAC GGGCGTCATG CGCCATGCCG ATGCCGGATA TGATCTCGCG GTGGAATGCG CTGTTGAGCA AGGTCTGAAT TTACCGATGG TTGCGGCGAC GCAGGGGAAA GGCTGA
|
Protein sequence | MPESKYRQQT IRAPRGTVLT AKSWLTEAPL RMLMNNLDPD VAENPHELVV YGGIGRAARN WECYDAIVDA LTRLEADETL LIQSGKPVGV FKTHDNAPRV LIANSNLVPH WATWEHFNEL DAKGLAMYGQ MTAGSWIYIG SQGIVQGTYE TFVEAGRQHY NGTLAGRWVL TAGLGGMGGA QPLAATLAGA CSLTIECQQS RIDFRLRTRY VDEQAATLDD ALARITRYTR EGKAVSVALC ANAADILPEL VNRGVRPDLV TDQTSAHDPL HGYLPSGWRW EEYQKNAQSD PHGTMQAAKR SMAAHVRAML AFSQMGVPTF DYGNNIRQMA KEMGVENAFD FPGFVPAYIR PLFCRGIGPF RWVALSGDPQ DIYKTDAKVK EIVAEDKHLH HWLDMARERI HFQGLPARIC WVGLEWRQKL GLAFNEMVRC GEVSAPIVIG RDHLDSGSVA SPNRETEAMR DGSDAVSDWP LLNALLNTAS GATWVSLHHG GGVGMGFSQH AGMVIVCDGT DEAAARIRRV LHNDPATGVM RHADAGYDLA VECAVEQGLN LPMVAATQGK G
|
| |