Gene Rleg2_0637 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_0637 
Symbol 
ID6979353 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp663179 
End bp663868 
Gene Length690 bp 
Protein Length229 aa 
Translation table11 
GC content63% 
IMG OID643395349 
ProductHAD-superfamily hydrolase, subfamily IA, variant 3 
Protein accessionYP_002280160 
Protein GI209548243 
COG category[R] General function prediction only 
COG ID[COG0637] Predicted phosphatase/phosphohexomutase 
TIGRFAM ID[TIGR01493] Haloacid dehalogenase superfamily, subfamily IA, variant 2 with 3rd motif like haloacid dehalogenase
[TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED
[TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value0.872179 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.213307 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGGCT TCGATCTCAT TATCTTCGAT TGCGACGGCG TGCTCGTCGA TTCCGAAATC 
ATCGCCGCAG AAGTCGAATC CGCGCTTCTG ACGGAGGCGG GATATCCGAT CGGCGTCGAG
GAAATGGGCG AACGTTTCGC CGGCATGACA TGGCGCAACA TCCTGCTGCA GATCGAGCGC
GAAGCGAGCA TTCCGTTTTC GGCCTCGCTG CTTGAGAAGT CCGAGCAACT GCTCGACACC
AGGCTGGCAA ATGACGTCAA GGCCATTCCG GGCGTCGAAT TCGCCGTCTC AAGGCTCTCG
ATGAAGCGCT GCATCTGCTC GAATTCGAGC AGCAAGCGGC TCGACATGAT GCTCGGCAAG
GTGGGGCTGA AGCCGCTGTT TGCCCCCAAT ATCTTTTCCG CCAAGGATCT CGGCCCCGAC
CGGGCCAAGC CGAAGCCCGA CATCTTCCTG CACGGCGCAA GCCAGATGGG TGTCTCGCCC
GACAAGGTGG TCGTCGTCGA GGATTCCGTG CACGGCGTGC ATGCGGCGCG CGCCGCCGGC
ATGCGCGTCA TCGGCTTCAC CGGCGCCTCG CACAGCTATC CCGCCCATGC CGACAAGCTG
ACCGATGCCG GCGCCGAAAC GGCGATCTCC CGCATGAACG ACCTGCCTGG TGTCGTCGCC
GCGCTTGCGG CCTGGGAAGG CGTTCTCTAG
 
Protein sequence
MNGFDLIIFD CDGVLVDSEI IAAEVESALL TEAGYPIGVE EMGERFAGMT WRNILLQIER 
EASIPFSASL LEKSEQLLDT RLANDVKAIP GVEFAVSRLS MKRCICSNSS SKRLDMMLGK
VGLKPLFAPN IFSAKDLGPD RAKPKPDIFL HGASQMGVSP DKVVVVEDSV HGVHAARAAG
MRVIGFTGAS HSYPAHADKL TDAGAETAIS RMNDLPGVVA ALAAWEGVL