Gene SNSL254_A0854 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A0854 
SymbolhutU 
ID6485173 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp859320 
End bp861005 
Gene Length1686 bp 
Protein Length561 aa 
Translation table11 
GC content59% 
IMG OID642736266 
Producturocanate hydratase 
Protein accessionYP_002040026 
Protein GI194445492 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2987] Urocanate hydratase 
TIGRFAM ID[TIGR01228] urocanate hydratase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.668496 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones84 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGAAA GCAAGTATCG TCAGCAGACT ATCCGCGCGC CCAGAGGCAC GGTATTAACG 
GCGAAAAGCT GGCTGACAGA AGCCCCGCTG CGGATGTTAA TGAATAATCT CGATCCTGAC
GTGGCGGAAA ATCCGCATGA GCTGGTGGTC TACGGCGGGA TTGGTCGCGC CGCGCGCAAC
TGGGAATGCT ATGACGCTAT TGTTGATGCG CTCACCCGGC TGGAGGCGGA CGAAACGTTG
CTTATTCAGT CTGGCAAACC GGTCGGCGTA TTTAAAACGC ACGACAACGC GCCGCGGGTA
TTAATCGCCA ACTCCAACCT GGTTCCCCAC TGGGCGACAT GGGAACACTT TAACGAACTG
GATGCGAAAG GGCTGGCGAT GTACGGTCAA ATGACGGCCG GAAGCTGGAT CTATATCGGC
AGTCAGGGAA TCGTGCAGGG AACATACGAA ACCTTTGTCG AGGCGGGGCG TCAGCACTAT
AACGGTACGC TGGCGGGACG CTGGGTGCTG ACCGCCGGAC TGGGCGGCAT GGGCGGCGCG
CAACCGCTAG CCGCGACGCT GGCTGGAGCG TGTTCGCTGA CGATTGAATG CCAGCAAAGC
CGTATCGATT TTCGTCTGCG TACTCGCTAC GTGGATGAGC AGGCCGCCAC GCTGGATGAC
GCGCTGGCCC GCATTACGCG CTACACCCGC GAGGGGAAAG CCGTGTCCGT CGCCCTGTGC
GCGAACGCGG CGGATATCCT GCCGGAACTG GTTAATCGCG GCGTGCGCCC GGACCTGGTG
ACCGATCAGA CCAGCGCCCA CGATCCGCTA CATGGCTATT TACCCTCCGG CTGGCGCTGG
GAGGAGTATC AGAAAAACGC GCAATCCGAT CCCCACGGGA CGATGCAGGC AGCGAAACGT
TCCATGGCGG CGCATGTTCG GGCGATGCTG GCGTTCAGTA AAATGGGCGT GCCGACCTTT
GACTATGGCA ACAATATTCG CCAGATGGCG AAAGAGATGG GGGTGGAAAA CGCCTTTGAT
TTTCCGGGAT TTGTGCCAGC CTATATTCGT CCGCTGTTCT GTCGTGGCAT CGGGCCGTTT
CGCTGGGTGG CGCTGTCCGG CGATCCGCAG GATATCTATA AAACCGATGC CAAAGTCAAA
GAGATAGTGG CTGAGGATAA ACATCTGCAT CACTGGCTGG ATATGGCGCG CGAGCGCATT
CATTTTCAGG GGTTACCGGC GCGTATCTGC TGGGTAGGCC TGGAGTGGCG GCAAAAACTG
GGGCTGGCGT TCAACGAAAT GGTGCGTTGC GGCGAGGTAT CCGCGCCCAT TGTGATTGGC
CGCGATCACC TGGATTCCGG CTCTGTCGCC AGCCCTAACC GTGAAACCGA AGCGATGCGC
GACGGTTCCG ACGCGGTTTC CGACTGGCCG CTGTTAAATG CGTTGCTGAA TACCGCCAGC
GGGGCGACAT GGGTATCGCT CCATCATGGC GGCGGGGTGG GAATGGGGTT TTCGCAACAC
GCCGGTATGG TGATTGTCTG TGATGGCACT GACGAGGCCG CCGCGCGTAT TCGCCGCGTG
TTACACAACG ATCCGGCGAC GGGCGTCATG CGCCATGCCG ATGCCGGATA TGATCTCGCG
GTGGAATGCT CTGTTGAGCA AGGTCTGAAT TTACCGATGG TTGCGGCGAC GCAGGGGAAA
GGCTGA
 
Protein sequence
MPESKYRQQT IRAPRGTVLT AKSWLTEAPL RMLMNNLDPD VAENPHELVV YGGIGRAARN 
WECYDAIVDA LTRLEADETL LIQSGKPVGV FKTHDNAPRV LIANSNLVPH WATWEHFNEL
DAKGLAMYGQ MTAGSWIYIG SQGIVQGTYE TFVEAGRQHY NGTLAGRWVL TAGLGGMGGA
QPLAATLAGA CSLTIECQQS RIDFRLRTRY VDEQAATLDD ALARITRYTR EGKAVSVALC
ANAADILPEL VNRGVRPDLV TDQTSAHDPL HGYLPSGWRW EEYQKNAQSD PHGTMQAAKR
SMAAHVRAML AFSKMGVPTF DYGNNIRQMA KEMGVENAFD FPGFVPAYIR PLFCRGIGPF
RWVALSGDPQ DIYKTDAKVK EIVAEDKHLH HWLDMARERI HFQGLPARIC WVGLEWRQKL
GLAFNEMVRC GEVSAPIVIG RDHLDSGSVA SPNRETEAMR DGSDAVSDWP LLNALLNTAS
GATWVSLHHG GGVGMGFSQH AGMVIVCDGT DEAAARIRRV LHNDPATGVM RHADAGYDLA
VECSVEQGLN LPMVAATQGK G