Gene SeHA_C0917 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C0917 
SymbolhutU 
ID6491999 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp903237 
End bp904922 
Gene Length1686 bp 
Protein Length561 aa 
Translation table11 
GC content60% 
IMG OID642741165 
Producturocanate hydratase 
Protein accessionYP_002044818 
Protein GI194447536 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2987] Urocanate hydratase 
TIGRFAM ID[TIGR01228] urocanate hydratase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0937043 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones95 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGAAA GCAAGTATCG TCAGCAGACT ATCCGCGCGC CCCGAGGCAC GGTATTAACG 
GCGAAAAGCT GGCTGACAGA AGCCCCGCTG CGGATGTTAA TGAATAATCT CGATCCTGAC
GTGGCGGAAA ATCCGCATGA GCTGGTGGTC TACGGCGGGA TTGGTCGCGC CGCGCGCAAC
TGGGAATGCT ATGACGCTAT TGTTGATGCG CTCACCCGGC TGGAGGCGGA CGAAACGTTG
CTTATTCAGT CTGGCAAACC GGTCGGCGTA TTTAAAACGC ACGACAACGC GCCGCGGGTA
TTAATCGCCA ACTCCAACCT GGTTCCCCAC TGGGCGACAT GGGAACACTT TAACGAACTG
GATGCGAAAG GGCTGGCGAT GTACGGTCAA ATGACGGCCG GAAGCTGGAT CTATATCGGC
AGCCAGGGAA TCGTGCAGGG AACATACGAA ACCTTTGTCG AGGCGGGGCG TCAGCACTAT
AACGGCACGC TGGCGGGACG CTGGGTGCTG ACTGCCGGGC TGGGCGGCAT GGGCGGCGCG
CAACCGCTAG CCGCGACGCT GGCCGGAGCG TGTTCGCTGA CGATTGAATG CCAGCAAAGC
CGTATCGATT TTCGTCTGCG TACTCGCTAC GTGGATGAGC AGGCCGCCAC GCTGGATGAC
GCGCTGGCCC GCATTACTCG CTACACCCGC GAGGGGAAAG CCGTGTCCGT CGCCCTGTGC
GCGAATGCGG CGGATATCCT GCCGGAACTG GTTAATCGCG GCGTGCGCCC GGACCTGGTG
ACCGATCAGA CCAGCGCCCA CGATCCGCTA CATGGCTATT TACCCTCCGG CTGGCGCTGG
GAGGAGTATC AGAAAAACGC GCAATCCGAT CCCCACGGGA CGATGCAGGC AGCGAAACGT
TCCATGGCGG CACATGTTCG GGCGATGCTG GCGTTCAGTC AAATGGGCGT GCCGACCTTT
GACTATGGCA ACAATATTCG TCAGATGGCG AAAGAGATGG GGGTGGAAAA CGCCTTTGAT
TTTCCGGGAT TTGTGCCAGC CTATATTCGT CCGCTGTTCT GCCGTGGCAT CGGGCCGTTT
CGCTGGGTGG CGCTGTCCGG CGATCCGCAG GATATCTATA AAACCGATGC CAAAGTCAAA
GAGATAGTGG CTGAGGATAA ACATCTGCAT CACTGGCTGG ATATGGCGCG CGAGCGCATT
CATTTTCAGG GGTTACCGGC GCGTATCTGC TGGGTAGGCC TGGAGTGGCG GCAAAAACTG
GGGCTGGCGT TCAACGAAAT GGTGCGTTGC GGCGAGGTAT CCGCGCCCAT TGTGATTGGC
CGCGATCACC TGGATTCCGG CTCTGTCGCC AGCCCTAACC GTGAAACCGA AGCGATGCGC
GACGGTTCCG ACGCGGTTTC CGACTGGCCG CTGTTAAATG CGTTGCTGAA TACCGCCAGC
GGGGCGACAT GGGTATCGCT CCATCATGGC GGCGGGGTGG GAATGGGGTT TTCGCAACAC
GCCGGTATGG TGATTGTCTG TGATGGCACT GACGAGGCCG CCGCGCGTAT TCGCCGCGTG
TTACACAACG ATCCGGCGAC GGGCGTCATG CGCCATGCCG ATGCCGGATA TGATCTCGCG
GTGGAATGCG CTGTTGAGCA AGGTCTGAAT TTACCGATGG TTGCGGCGAC GCAGGGGAAA
GGCTGA
 
Protein sequence
MPESKYRQQT IRAPRGTVLT AKSWLTEAPL RMLMNNLDPD VAENPHELVV YGGIGRAARN 
WECYDAIVDA LTRLEADETL LIQSGKPVGV FKTHDNAPRV LIANSNLVPH WATWEHFNEL
DAKGLAMYGQ MTAGSWIYIG SQGIVQGTYE TFVEAGRQHY NGTLAGRWVL TAGLGGMGGA
QPLAATLAGA CSLTIECQQS RIDFRLRTRY VDEQAATLDD ALARITRYTR EGKAVSVALC
ANAADILPEL VNRGVRPDLV TDQTSAHDPL HGYLPSGWRW EEYQKNAQSD PHGTMQAAKR
SMAAHVRAML AFSQMGVPTF DYGNNIRQMA KEMGVENAFD FPGFVPAYIR PLFCRGIGPF
RWVALSGDPQ DIYKTDAKVK EIVAEDKHLH HWLDMARERI HFQGLPARIC WVGLEWRQKL
GLAFNEMVRC GEVSAPIVIG RDHLDSGSVA SPNRETEAMR DGSDAVSDWP LLNALLNTAS
GATWVSLHHG GGVGMGFSQH AGMVIVCDGT DEAAARIRRV LHNDPATGVM RHADAGYDLA
VECAVEQGLN LPMVAATQGK G