Gene Smed_4656 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4656 
Symbol 
ID5318819 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1165915 
End bp1167594 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content64% 
IMG OID640776454 
Producturocanate hydratase 
Protein accessionYP_001313386 
Protein GI150376790 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2987] Urocanate hydratase 
TIGRFAM ID[TIGR01228] urocanate hydratase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.312199 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0628347 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACATGA ACAACCCGCG CCACAATATT CGCGAAGTAC GCAGCCCGCG AGGCACCGAG 
ATCAGCGCCA GGAGCTGGCT GACCGAAGCG CCGCTCAGAA TGCTGATGAA CAATCTCGAT
CCTGATGTCG CCGAGAACCC GCACGAACTC GTCGTCTATG GCGGCATCGG CCGCGCTGCA
CGCACCTGGG CCGATTTCGA CCGCATCGTC GCCTCGCTCA GGTATCTGAA CGAAGACGAG
ACCCTGCTCG TCCAGTCTGG CAAGCCGGTC GGAGTCTTTC GCACGCACCG GGATGCGCCG
CGGGTGTTGA TCGCCAACTC CAATCTCGTG CCGCACTGGG CGAACTGGGA TCATTTCAAT
GAGCTGGATA AGAAGGGTCT CGCCATGTAT GGCCAGATGA CCGCCGGCTC GTGGATCTAT
ATCGGCACTC AAGGGATCGT GCAGGGCACG TATGAGACCT TCGTCGAGGC GGGCCGCCAG
CACTATGGCG GCAGCCTGAA AGGCAAGTGG ATCCTGACGG GCGGCCTCGG CGGTATGGGC
GGAGCACAGC CCCTTGCGGC GGTCATGGCC GGGGCCTGCT GCCTCGCCGT CGAATGCAAC
CCGGATTCGA TCGATTTCCG CCTCCGTACC CGCTACCTCG ACGAAAAGGC CGAGACGCTG
GAGGAAGCCA TGGAAATGAT CGAGCGCTGG ACGAAAGCCG GCGAAGCCAA GTCCGTCGGT
CTTCTCGGCA ATGCAGCTGA AATTCTGCCC CAAATGGTCC GCCGCGGCAT CCGCCCCGAT
ATCGTCACCG ACCAGACCTC CGCCCACGAT CCGGTGAATG GCTACCTGCC GAAGGGCTGG
ACGATCGCCG AATGGAAGGC CAAGCGCGAG AGCGACCCGA AGACGGTCGA GAAGGCCGCC
CGCGCGTCGA TGCGCGACCA TGTCGAAGCA ATGCTTGCCT TCTGGGATTC CGGCATACCG
ACGCTCGACT ACGGGAACAA CATCCGCCAG GTCGCCAAGG ACGAAGGTCT CGAGCGCGCC
TTTGATTTCC CCGGTTTCGT GCCGGCCTAT ATCCGTCCGC TTTTCTGCCG GGGCATCGGT
CCGTTCCGCT GGGCGGCTCT CTCGGGCGAT CCGGAAGATA TCTACAAGAC CGACCAGAAG
GTGAAGGAGC TGCTGCCGGA TAACAAGCAC CTGCACAACT GGCTCGACAT GGCGCGCGAG
CGCATCGCCT TTCAGGGCCT GCCCGCACGC ATCTGCTGGG TTGGCCTTGG CGATCGCCAC
CGCCTCGGCC TCGCCTTCAA CGAGATGGTG CGCAGCGGCG AGCTGAAGGC GCCGATCGTC
ATCGGGCGCG ACCATCTCGA CTCCGGCTCC GTCGCCTCGC CAAACCGCGA GACCGAAGCG
ATGAAGGACG GGTCCGATGC CGTCTCCGAT TGGCCGCTCC TGAACGCTCT CCTCAACACC
GCCTCCGGTG CCACCTGGGT ATCACTGCAC CACGGCGGCG GGGTCGGCAT GGGCTTCTCG
CAGCATTCCG GAATGGTGAT CTGCTGCGAC GGTACGGACG ATGCGGCACG CCGCGTTGAG
CGGGTGCTGT GGAACGACCC GGCAACCGGC GTCATGCGCC ACGCCGATGC CGGATATGAC
ATCGCATTGG ACTGCGCCCG CGAAAAAGGC CTCCGCCTGC CGGGCATATT GGGCGAGTGA
 
Protein sequence
MNMNNPRHNI REVRSPRGTE ISARSWLTEA PLRMLMNNLD PDVAENPHEL VVYGGIGRAA 
RTWADFDRIV ASLRYLNEDE TLLVQSGKPV GVFRTHRDAP RVLIANSNLV PHWANWDHFN
ELDKKGLAMY GQMTAGSWIY IGTQGIVQGT YETFVEAGRQ HYGGSLKGKW ILTGGLGGMG
GAQPLAAVMA GACCLAVECN PDSIDFRLRT RYLDEKAETL EEAMEMIERW TKAGEAKSVG
LLGNAAEILP QMVRRGIRPD IVTDQTSAHD PVNGYLPKGW TIAEWKAKRE SDPKTVEKAA
RASMRDHVEA MLAFWDSGIP TLDYGNNIRQ VAKDEGLERA FDFPGFVPAY IRPLFCRGIG
PFRWAALSGD PEDIYKTDQK VKELLPDNKH LHNWLDMARE RIAFQGLPAR ICWVGLGDRH
RLGLAFNEMV RSGELKAPIV IGRDHLDSGS VASPNRETEA MKDGSDAVSD WPLLNALLNT
ASGATWVSLH HGGGVGMGFS QHSGMVICCD GTDDAARRVE RVLWNDPATG VMRHADAGYD
IALDCAREKG LRLPGILGE