Gene PP_5033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPP_5033 
SymbolhutU 
ID1042837 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePseudomonas putida KT2440 
KingdomBacteria 
Replicon accessionNC_002947 
Strand
Start bp5735763 
End bp5737436 
Gene Length1674 bp 
Protein Length557 aa 
Translation table11 
GC content65% 
IMG OID637148432 
Producturocanate hydratase 
Protein accessionNP_747134 
Protein GI26991709 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2987] Urocanate hydratase 
TIGRFAM ID[TIGR01228] urocanate hydratase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGACA ACAACAAATA CCGTGACGTT GAAATCCGTG CCCCACGTGG CAACAAGCTG 
ACCGCCAAAA GCTGGCTGAC CGAAGCGCCA CTGCGCATGC TGATGAACAA CCTCGATCCA
CAGGTCGCGG AAAACCCGAA AGAGCTGGTG GTATACGGCG GTATCGGCCG CGCCGCGCGC
AACTGGGCCT GCTACGACAA GATCGTCGAA ACCCTGACCC GCCTGGAAGA CGACGAAACC
CTGCTGGTGC AGTCGGGCAA GCCGGTCGGT GTGTTCAAGA CCCACAGCAA CGCACCGCGC
GTGCTGATTG CCAACTCCAA CCTGGTGCCA CACTGGGCCA ACTGGGAACA CTTCAACGAA
CTGGACGCCA AGGGCCTGGC GATGTATGGC CAGATGACCG CCGGCAGCTG GATCTACATC
GGCAGCCAGG GCATCGTCCA GGGCACCTAT GAAACCTTCG TCGAAGCCGG TCGCCAGCAC
TACGGCGGCA GCCTGAAAGG CAAGTGGGTA CTCACCGCTG GCCTGGGCGG CATGGGCGGC
GCCCAGCCAC TGGCCGCGAC CCTGGCTGGG GCTTGCTCGC TGAACATCGA ATGCCAGCAG
AGCCGTATCG ACTTCCGCCT GGAAACCCGC TACGTCGACG AGCAGGCCAC TGACCTCGAC
GACGCCCTCG CACGCATCGC CAAGTACACC GCCGAAGGCA AGGCCATCTC CATCGCCCTG
CACGGCAACG CCGCCGAAAT CCTCCCAGAG CTGGTCAAAC GTGGCGTCCG CCCGGACATG
GTCACCGACC AGACCAGCGC CCACGACCCA CTGAACGGCT ACCTGCCAGC CGGCTGGACC
TGGGAACAGT ACCGCGATCG TGCGCAGACC GAACCGGCTG CAGTGGTCAA GGCCGCCAAG
CAGTCGATGG CCGTGCACGT GCAGGCCATG CTGGACTTCC AGAAGCAGGG CATCCCGACC
TTCGATTACG GCAACAACAT CCGCCAGATG GCCAAGGAGG AGGGCGTGGC CAATGCCTTC
GACTTCCCAG GCTTCGTCCC GGCCTACATC CGCCCACTGT TCTGCCGCGG CGTCGGCCCG
TTCCGCTGGG CGGCGCTGTC CGGTGAAGCC GAGGACATCT ACAAGACCGA CGCCAAGGTC
AAGGAACTGA TCCCCGACGA CGCCCACCTG CACCGCTGGC TGGACATGGC CCGCGAGCGC
ATCAGCTTCC AGGGCCTGCC GGCACGTATC TGCTGGGTGG GGCTGGGCCT TCGCGCCAAG
CTGGGCCTGG CTTTCAACGA AATGGTCCGC AGCGGCGAGC TGTCGGCACC GGTGGTGATC
GGCCGTGACC ACCTCGACTC CGGCTCGGTA TCCAGCCCGA ACCGCGAAAC CGAGGCCATG
CGTGATGGTT CGGACGCTGT TTCCGACTGG CCGCTGCTCA ACGCCCTGCT GAACACCGCA
GGCGGCGCCA CCTGGGTATC GCTGCACCAT GGCGGTGGCG TGGGCATGGG CTTCTCGCAG
CACTCGGGCA TGGTCATCGT CTGCGACGGT ACCGATGAGG CCGCCGAGCG CATCGCCCGG
GTACTGACCA ACGACCCAGG GACTGGCGTC ATGCGTCACG CCGATGCCGG TTATGACATC
GCCATCGACT GCGCCAAGGA GCAGGGCCTG GACCTGCCGA TGATCACCGG CTGA
 
Protein sequence
MTDNNKYRDV EIRAPRGNKL TAKSWLTEAP LRMLMNNLDP QVAENPKELV VYGGIGRAAR 
NWACYDKIVE TLTRLEDDET LLVQSGKPVG VFKTHSNAPR VLIANSNLVP HWANWEHFNE
LDAKGLAMYG QMTAGSWIYI GSQGIVQGTY ETFVEAGRQH YGGSLKGKWV LTAGLGGMGG
AQPLAATLAG ACSLNIECQQ SRIDFRLETR YVDEQATDLD DALARIAKYT AEGKAISIAL
HGNAAEILPE LVKRGVRPDM VTDQTSAHDP LNGYLPAGWT WEQYRDRAQT EPAAVVKAAK
QSMAVHVQAM LDFQKQGIPT FDYGNNIRQM AKEEGVANAF DFPGFVPAYI RPLFCRGVGP
FRWAALSGEA EDIYKTDAKV KELIPDDAHL HRWLDMARER ISFQGLPARI CWVGLGLRAK
LGLAFNEMVR SGELSAPVVI GRDHLDSGSV SSPNRETEAM RDGSDAVSDW PLLNALLNTA
GGATWVSLHH GGGVGMGFSQ HSGMVIVCDG TDEAAERIAR VLTNDPGTGV MRHADAGYDI
AIDCAKEQGL DLPMITG