Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeHA_C0917 |
Symbol | hutU |
ID | 6491999 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Heidelberg str. SL476 |
Kingdom | Bacteria |
Replicon accession | NC_011083 |
Strand | + |
Start bp | 903237 |
End bp | 904922 |
Gene Length | 1686 bp |
Protein Length | 561 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 642741165 |
Product | urocanate hydratase |
Protein accession | YP_002044818 |
Protein GI | 194447536 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2987] Urocanate hydratase |
TIGRFAM ID | [TIGR01228] urocanate hydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0937043 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 95 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTGAAA GCAAGTATCG TCAGCAGACT ATCCGCGCGC CCCGAGGCAC GGTATTAACG GCGAAAAGCT GGCTGACAGA AGCCCCGCTG CGGATGTTAA TGAATAATCT CGATCCTGAC GTGGCGGAAA ATCCGCATGA GCTGGTGGTC TACGGCGGGA TTGGTCGCGC CGCGCGCAAC TGGGAATGCT ATGACGCTAT TGTTGATGCG CTCACCCGGC TGGAGGCGGA CGAAACGTTG CTTATTCAGT CTGGCAAACC GGTCGGCGTA TTTAAAACGC ACGACAACGC GCCGCGGGTA TTAATCGCCA ACTCCAACCT GGTTCCCCAC TGGGCGACAT GGGAACACTT TAACGAACTG GATGCGAAAG GGCTGGCGAT GTACGGTCAA ATGACGGCCG GAAGCTGGAT CTATATCGGC AGCCAGGGAA TCGTGCAGGG AACATACGAA ACCTTTGTCG AGGCGGGGCG TCAGCACTAT AACGGCACGC TGGCGGGACG CTGGGTGCTG ACTGCCGGGC TGGGCGGCAT GGGCGGCGCG CAACCGCTAG CCGCGACGCT GGCCGGAGCG TGTTCGCTGA CGATTGAATG CCAGCAAAGC CGTATCGATT TTCGTCTGCG TACTCGCTAC GTGGATGAGC AGGCCGCCAC GCTGGATGAC GCGCTGGCCC GCATTACTCG CTACACCCGC GAGGGGAAAG CCGTGTCCGT CGCCCTGTGC GCGAATGCGG CGGATATCCT GCCGGAACTG GTTAATCGCG GCGTGCGCCC GGACCTGGTG ACCGATCAGA CCAGCGCCCA CGATCCGCTA CATGGCTATT TACCCTCCGG CTGGCGCTGG GAGGAGTATC AGAAAAACGC GCAATCCGAT CCCCACGGGA CGATGCAGGC AGCGAAACGT TCCATGGCGG CACATGTTCG GGCGATGCTG GCGTTCAGTC AAATGGGCGT GCCGACCTTT GACTATGGCA ACAATATTCG TCAGATGGCG AAAGAGATGG GGGTGGAAAA CGCCTTTGAT TTTCCGGGAT TTGTGCCAGC CTATATTCGT CCGCTGTTCT GCCGTGGCAT CGGGCCGTTT CGCTGGGTGG CGCTGTCCGG CGATCCGCAG GATATCTATA AAACCGATGC CAAAGTCAAA GAGATAGTGG CTGAGGATAA ACATCTGCAT CACTGGCTGG ATATGGCGCG CGAGCGCATT CATTTTCAGG GGTTACCGGC GCGTATCTGC TGGGTAGGCC TGGAGTGGCG GCAAAAACTG GGGCTGGCGT TCAACGAAAT GGTGCGTTGC GGCGAGGTAT CCGCGCCCAT TGTGATTGGC CGCGATCACC TGGATTCCGG CTCTGTCGCC AGCCCTAACC GTGAAACCGA AGCGATGCGC GACGGTTCCG ACGCGGTTTC CGACTGGCCG CTGTTAAATG CGTTGCTGAA TACCGCCAGC GGGGCGACAT GGGTATCGCT CCATCATGGC GGCGGGGTGG GAATGGGGTT TTCGCAACAC GCCGGTATGG TGATTGTCTG TGATGGCACT GACGAGGCCG CCGCGCGTAT TCGCCGCGTG TTACACAACG ATCCGGCGAC GGGCGTCATG CGCCATGCCG ATGCCGGATA TGATCTCGCG GTGGAATGCG CTGTTGAGCA AGGTCTGAAT TTACCGATGG TTGCGGCGAC GCAGGGGAAA GGCTGA
|
Protein sequence | MPESKYRQQT IRAPRGTVLT AKSWLTEAPL RMLMNNLDPD VAENPHELVV YGGIGRAARN WECYDAIVDA LTRLEADETL LIQSGKPVGV FKTHDNAPRV LIANSNLVPH WATWEHFNEL DAKGLAMYGQ MTAGSWIYIG SQGIVQGTYE TFVEAGRQHY NGTLAGRWVL TAGLGGMGGA QPLAATLAGA CSLTIECQQS RIDFRLRTRY VDEQAATLDD ALARITRYTR EGKAVSVALC ANAADILPEL VNRGVRPDLV TDQTSAHDPL HGYLPSGWRW EEYQKNAQSD PHGTMQAAKR SMAAHVRAML AFSQMGVPTF DYGNNIRQMA KEMGVENAFD FPGFVPAYIR PLFCRGIGPF RWVALSGDPQ DIYKTDAKVK EIVAEDKHLH HWLDMARERI HFQGLPARIC WVGLEWRQKL GLAFNEMVRC GEVSAPIVIG RDHLDSGSVA SPNRETEAMR DGSDAVSDWP LLNALLNTAS GATWVSLHHG GGVGMGFSQH AGMVIVCDGT DEAAARIRRV LHNDPATGVM RHADAGYDLA VECAVEQGLN LPMVAATQGK G
|
| |