Gene Rleg2_5930 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5930 
Symbol 
ID6977317 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011366 
Strand
Start bp347855 
End bp349528 
Gene Length1674 bp 
Protein Length557 aa 
Translation table11 
GC content65% 
IMG OID643393383 
Producturocanate hydratase 
Protein accessionYP_002278201 
Protein GI209546311 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2987] Urocanate hydratase 
TIGRFAM ID[TIGR01228] urocanate hydratase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.138114 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAATC CACGTCATAA CATCCGCGAA ATCCGCGCGC CCCGCGGCAA CGAGCTCAAT 
TCCAAGAGCT GGATGACCGA AGCGCCGCTG CGCATGCTGA TGAACAACCT CGATCCCGAC
GTCGCCGAAA ATCCGAACGA GCTCGTCGTC TACGGCGGCA TCGGCCGCGC CGCCCGCACC
TGGGAGGATT TCGACCGCAT CGTCGCGACG CTGAAGACGC TGACGGAAGA AGAAACGCTG
CTGGTGCAGT CCGGCAAGCC GGTTGGCGTC TTCCGCACCC ACAAGGATGC GCCACGCGTG
CTGATCGCCA ATTCCAACCT CGTCCCGCAT TGGGCGACAT GGGATCATTT CAATGAGCTG
GATAAGAAGG GCCTTGCCAT GTACGGCCAG ATGACGGCCG GCTCGTGGAT CTATATCGGC
ACGCAAGGGA TCGTGCAGGG CACCTACGAA ACCTTCGTCG AGGCCGGCCG CCAGCATTAC
GGCGGCAATC TCAAGGGCAA ATGGATCCTG ACCGGCGGCC TCGGCGGCAT GGGCGGCGCC
CAGCCTCTCG CCGCCGTCAT GGCCGGCGCC TGCTGCCTCG CCGTCGAATG CAATCCAGAC
TCGATCGATT TCCGCCTGCG CACCCGCTAT GTCGACGCCA AGGCCGAGAC GCTCGACGAA
GCGCTCGAAA TGATCGACCG CTGGACCAAA GCCGGTGAAG CCAAATCCGT CGGCCTGCTC
GGCAATGCCG CCGAAATCCT GCCGGAAATG GTCCGCCGCG GCATCCGCCC CGACATGGTC
ACCGACCAGA CCTCGGCGCA CGACCCGATC AACGGCTACC TGCCGAAGGG CTGGACGATG
GCCGAGTGGA AGGCCAAGCG CGAAAGCGAT CCGAAGGCCG TGGAAAAGGC CGCACGCGCC
TCGATGCGCG AGCATGTCGA AGCGATGATC GCCTTCTGGG ACGCCGGCAT TCCGACGCTC
GACTACGGCA ACAATATCCG CCAGGTCGCC AAGGAAGAAG GCTTGGAAAA CGCCTTCGCC
TTCCCGGGCT TCGTGCCGGC TTATATCCGT CCGCTGTTCT GCCGCGGCAT TGGCCCCTTC
CGCTGGGCCG CCTTGTCGGG CGACCCGGAG GATATCTACA AGACCGATGC CAAGGTGAGG
GAGCTGACCC CCGGCAATAC CCATCTGCAC AACTGGCTCG ATATGGCCAG GGAGCGCATC
GCCTTCCAGG GCCTGCCGGC GCGCATCTGC TGGGTCGGCC TCGGCGACCG CCACCGCCTA
GCTCTGGCCT TCAATGAAAT GGTCAGGAAC GGCGAGCTTT CCGCGCCGAT CGTCATCGGC
CGCGACCATC TCGACTCCGG CTCCGTCGCC TCGCCGAACC GCGAAACCGA GGCGATGAAG
GACGGCTCCG ACGCCGTCTC CGACTGGCCG CTGCTGAACG CCCTGCTCAA CACGGCGTCG
GGCGCCACCT GGGTATCGCT GCATCATGGC GGCGGCGTCG GCATGGGCTT CTCCCAGCAT
TCGGGTGTCG TTATTTGCGC CGACGGCAGC GACGATGCGG CCAAGCGTCT CGAGCGGGTG
CTCTGGAACG ACCCGGCGAC CGGCGTCATG CGCCACGCCG ATGCCGGCTA CGACATCGCC
CTCGACTGCG CCCGGGACAA GGGCCTGCGC CTGCCCGGCA TCCTGGGGAA CTGA
 
Protein sequence
MNNPRHNIRE IRAPRGNELN SKSWMTEAPL RMLMNNLDPD VAENPNELVV YGGIGRAART 
WEDFDRIVAT LKTLTEEETL LVQSGKPVGV FRTHKDAPRV LIANSNLVPH WATWDHFNEL
DKKGLAMYGQ MTAGSWIYIG TQGIVQGTYE TFVEAGRQHY GGNLKGKWIL TGGLGGMGGA
QPLAAVMAGA CCLAVECNPD SIDFRLRTRY VDAKAETLDE ALEMIDRWTK AGEAKSVGLL
GNAAEILPEM VRRGIRPDMV TDQTSAHDPI NGYLPKGWTM AEWKAKRESD PKAVEKAARA
SMREHVEAMI AFWDAGIPTL DYGNNIRQVA KEEGLENAFA FPGFVPAYIR PLFCRGIGPF
RWAALSGDPE DIYKTDAKVR ELTPGNTHLH NWLDMARERI AFQGLPARIC WVGLGDRHRL
ALAFNEMVRN GELSAPIVIG RDHLDSGSVA SPNRETEAMK DGSDAVSDWP LLNALLNTAS
GATWVSLHHG GGVGMGFSQH SGVVICADGS DDAAKRLERV LWNDPATGVM RHADAGYDIA
LDCARDKGLR LPGILGN