Gene SeSA_A0940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeSA_A0940 
SymbolhutU 
ID6516233 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Schwarzengrund str. CVM19633 
KingdomBacteria 
Replicon accessionNC_011094 
Strand
Start bp911099 
End bp912784 
Gene Length1686 bp 
Protein Length561 aa 
Translation table11 
GC content60% 
IMG OID642746072 
Producturocanate hydratase 
Protein accessionYP_002113883 
Protein GI194735125 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2987] Urocanate hydratase 
TIGRFAM ID[TIGR01228] urocanate hydratase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00715979 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.983555 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGAAA GCAAGTATCG TCAGCAGACT ATCCGCGCGC CCAGAGGCAC GGTATTAACG 
GCGAAAAGCT GGCTGACAGA AGCCCCGCTG CGGATGTTAA TGAATAATCT CGATCCTGAC
GTGGCGGAAA ATCCGCATGA GCTGGTGGTC TACGGCGGGA TTGGTCGCGC CGCGCGCAAC
TGGGAATGCT ATGACGCTAT TGTTGATGCG CTCACCCGGC TGGAGGCGGA CGAAACGTTG
CTTATTCAGT CTGGCAAACC GGTCGGCGTA TTTAAAACGC ACGACAACGC GCCGCGGGTA
TTAATCGCCA ACTCCAACCT GGTTCCCCAC TGGGCGACAT GGGAACATTT TAACGAACTG
GATGCGAAAG GGCTGGCGAT GTACGGTCAG ATGACGGCCG GAAGCTGGAT CTATATCGGC
AGCCAGGGAA TCGTGCAGGG AACATACGAA ACCTTTGTCG AGGCGGGGCG TCAGCACTAT
AACGGCACGC TGGCGGGACG CTGGGTGCTG ACCGCCGGGC TGGGCGGCAT GGGCGGCGCG
CAACCGCTAG CCGCGACGCT GGCCGGAGCG TGTTCGCTGA CGATTGAATG CCAGCAAAGC
CGTATCGATT TTCGTCTGCG TACTCGCTAC GTGGATGAGC AGGCCGCCAC GCTGGATGAC
GCGTTGGCCC GCATTACGCG CTACACCCGC GAGGGGAAAG CCGTGTCCGT CGCCCTGTGC
GCGAACGCGG CGGATATCCT GCCGGAACTG GTTAATCGCG GCGTGCGCCC GGACCTGGTG
ACCGATCAGA CCAGCGCCCA CGATCCGCTA CATGGCTATT TACCCTCCGG CTGGCGCTGG
GAGGAGTATC AGAAAAACGC GCAATCCGAT CCCCACGGGA CGATGCAGGC AGCGAAACGT
TCCATGGCGG CGCATGTTCG GGCGATGCTG GCGTTCAGTC AAATGGGCGT GCCGACCTTT
GACTATGGCA ACAATATTCG CCAGATGGCG AAAGAGATGG GGGTGGAAAA CGCCTTTGAT
TTTCCGGGAT TTGTGCCAGC CTATATTCGT CCGCTGTTCT GTCGTGGCAT CGGGCCGTTT
CGCTGGGTGG CGCTGTCCGG CGATCCGCAG GATATCTATA AAACCGATGC CAAAGTCAAA
GAGATAGTGG CTGAGGATAA ACATCTGCAT CACTGGCTGG ATATGGCGCG CGAGCGCATT
CATTTTCAGG GGCTACCGGC GCGTATCTGC TGGGTAGGTC TGGAGTGGCG GCAAAAACTG
GGGCTGGCGT TCAACGAAAT GGTGCGTTGC GGTGAGGTAT CCGCGCCTAT TGTGATTGGC
CGCGATCACC TGGATTCCGG TTCTGTCGCC AGCCCTAACC GTGAAACCGA AGCGATGCGC
GACGGTTCCG ACGCGGTTTC CGACTGGCCG CTGTTAAATG CGTTGCTGAA TACCGCCAGC
GGGGCGACAT GGGTATCGCT CCATCATGGC GGCGGGGTGG GGATGGGGTT TTCGCAACAC
GCCGGTATGG TGATTGTCTG TGATGGCACT GACGAGGCCG CCGCGCGTAT TCGCCGCGTG
TTACACAACG ATCCGGCGAC GGGCGTCATG CGCCATGCCG ATGCCGGATA TGATCTCGCG
GTGGAATGCG CTGTTGAGCA AGGTCTGAAT TTACCGATGG TTGCGGCGAC GCAGGGGAAA
GGCTGA
 
Protein sequence
MPESKYRQQT IRAPRGTVLT AKSWLTEAPL RMLMNNLDPD VAENPHELVV YGGIGRAARN 
WECYDAIVDA LTRLEADETL LIQSGKPVGV FKTHDNAPRV LIANSNLVPH WATWEHFNEL
DAKGLAMYGQ MTAGSWIYIG SQGIVQGTYE TFVEAGRQHY NGTLAGRWVL TAGLGGMGGA
QPLAATLAGA CSLTIECQQS RIDFRLRTRY VDEQAATLDD ALARITRYTR EGKAVSVALC
ANAADILPEL VNRGVRPDLV TDQTSAHDPL HGYLPSGWRW EEYQKNAQSD PHGTMQAAKR
SMAAHVRAML AFSQMGVPTF DYGNNIRQMA KEMGVENAFD FPGFVPAYIR PLFCRGIGPF
RWVALSGDPQ DIYKTDAKVK EIVAEDKHLH HWLDMARERI HFQGLPARIC WVGLEWRQKL
GLAFNEMVRC GEVSAPIVIG RDHLDSGSVA SPNRETEAMR DGSDAVSDWP LLNALLNTAS
GATWVSLHHG GGVGMGFSQH AGMVIVCDGT DEAAARIRRV LHNDPATGVM RHADAGYDLA
VECAVEQGLN LPMVAATQGK G