Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A0854 |
Symbol | hutU |
ID | 6485173 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | + |
Start bp | 859320 |
End bp | 861005 |
Gene Length | 1686 bp |
Protein Length | 561 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 642736266 |
Product | urocanate hydratase |
Protein accession | YP_002040026 |
Protein GI | 194445492 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2987] Urocanate hydratase |
TIGRFAM ID | [TIGR01228] urocanate hydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.668496 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 84 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTGAAA GCAAGTATCG TCAGCAGACT ATCCGCGCGC CCAGAGGCAC GGTATTAACG GCGAAAAGCT GGCTGACAGA AGCCCCGCTG CGGATGTTAA TGAATAATCT CGATCCTGAC GTGGCGGAAA ATCCGCATGA GCTGGTGGTC TACGGCGGGA TTGGTCGCGC CGCGCGCAAC TGGGAATGCT ATGACGCTAT TGTTGATGCG CTCACCCGGC TGGAGGCGGA CGAAACGTTG CTTATTCAGT CTGGCAAACC GGTCGGCGTA TTTAAAACGC ACGACAACGC GCCGCGGGTA TTAATCGCCA ACTCCAACCT GGTTCCCCAC TGGGCGACAT GGGAACACTT TAACGAACTG GATGCGAAAG GGCTGGCGAT GTACGGTCAA ATGACGGCCG GAAGCTGGAT CTATATCGGC AGTCAGGGAA TCGTGCAGGG AACATACGAA ACCTTTGTCG AGGCGGGGCG TCAGCACTAT AACGGTACGC TGGCGGGACG CTGGGTGCTG ACCGCCGGAC TGGGCGGCAT GGGCGGCGCG CAACCGCTAG CCGCGACGCT GGCTGGAGCG TGTTCGCTGA CGATTGAATG CCAGCAAAGC CGTATCGATT TTCGTCTGCG TACTCGCTAC GTGGATGAGC AGGCCGCCAC GCTGGATGAC GCGCTGGCCC GCATTACGCG CTACACCCGC GAGGGGAAAG CCGTGTCCGT CGCCCTGTGC GCGAACGCGG CGGATATCCT GCCGGAACTG GTTAATCGCG GCGTGCGCCC GGACCTGGTG ACCGATCAGA CCAGCGCCCA CGATCCGCTA CATGGCTATT TACCCTCCGG CTGGCGCTGG GAGGAGTATC AGAAAAACGC GCAATCCGAT CCCCACGGGA CGATGCAGGC AGCGAAACGT TCCATGGCGG CGCATGTTCG GGCGATGCTG GCGTTCAGTA AAATGGGCGT GCCGACCTTT GACTATGGCA ACAATATTCG CCAGATGGCG AAAGAGATGG GGGTGGAAAA CGCCTTTGAT TTTCCGGGAT TTGTGCCAGC CTATATTCGT CCGCTGTTCT GTCGTGGCAT CGGGCCGTTT CGCTGGGTGG CGCTGTCCGG CGATCCGCAG GATATCTATA AAACCGATGC CAAAGTCAAA GAGATAGTGG CTGAGGATAA ACATCTGCAT CACTGGCTGG ATATGGCGCG CGAGCGCATT CATTTTCAGG GGTTACCGGC GCGTATCTGC TGGGTAGGCC TGGAGTGGCG GCAAAAACTG GGGCTGGCGT TCAACGAAAT GGTGCGTTGC GGCGAGGTAT CCGCGCCCAT TGTGATTGGC CGCGATCACC TGGATTCCGG CTCTGTCGCC AGCCCTAACC GTGAAACCGA AGCGATGCGC GACGGTTCCG ACGCGGTTTC CGACTGGCCG CTGTTAAATG CGTTGCTGAA TACCGCCAGC GGGGCGACAT GGGTATCGCT CCATCATGGC GGCGGGGTGG GAATGGGGTT TTCGCAACAC GCCGGTATGG TGATTGTCTG TGATGGCACT GACGAGGCCG CCGCGCGTAT TCGCCGCGTG TTACACAACG ATCCGGCGAC GGGCGTCATG CGCCATGCCG ATGCCGGATA TGATCTCGCG GTGGAATGCT CTGTTGAGCA AGGTCTGAAT TTACCGATGG TTGCGGCGAC GCAGGGGAAA GGCTGA
|
Protein sequence | MPESKYRQQT IRAPRGTVLT AKSWLTEAPL RMLMNNLDPD VAENPHELVV YGGIGRAARN WECYDAIVDA LTRLEADETL LIQSGKPVGV FKTHDNAPRV LIANSNLVPH WATWEHFNEL DAKGLAMYGQ MTAGSWIYIG SQGIVQGTYE TFVEAGRQHY NGTLAGRWVL TAGLGGMGGA QPLAATLAGA CSLTIECQQS RIDFRLRTRY VDEQAATLDD ALARITRYTR EGKAVSVALC ANAADILPEL VNRGVRPDLV TDQTSAHDPL HGYLPSGWRW EEYQKNAQSD PHGTMQAAKR SMAAHVRAML AFSKMGVPTF DYGNNIRQMA KEMGVENAFD FPGFVPAYIR PLFCRGIGPF RWVALSGDPQ DIYKTDAKVK EIVAEDKHLH HWLDMARERI HFQGLPARIC WVGLEWRQKL GLAFNEMVRC GEVSAPIVIG RDHLDSGSVA SPNRETEAMR DGSDAVSDWP LLNALLNTAS GATWVSLHHG GGVGMGFSQH AGMVIVCDGT DEAAARIRRV LHNDPATGVM RHADAGYDLA VECSVEQGLN LPMVAATQGK G
|
| |