Gene Rleg_6064 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_6064 
Symbol 
ID8016326 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012852 
Strand
Start bp100343 
End bp100990 
Gene Length648 bp 
Protein Length215 aa 
Translation table11 
GC content56% 
IMG OID644827372 
ProductHAD-superfamily hydrolase, subfamily IA, variant 3 
Protein accessionYP_002978572 
Protein GI241258688 
COG category[R] General function prediction only 
COG ID[COG0637] Predicted phosphatase/phosphohexomutase 
TIGRFAM ID[TIGR01493] Haloacid dehalogenase superfamily, subfamily IA, variant 2 with 3rd motif like haloacid dehalogenase
[TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED
[TIGR01549] haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.219512 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTGATA TTCTATCGCT GGAAAATGAT TTCGAGGCCG TGATTTTTGA TTGCGACGGC 
ACACTTGTTC ACACGCCCCC TGTCTATGCG GACGCATGGG CGGCCGGCTT TGCCCTTTCC
GGCAAACCGA TGAGTGAGGC TTGGTATTTG ATGCGGGCCG GAATGTCTGA AGTGGTTCTT
ATGGACGCCT TTGAGCGAGA ATTCGACGTT GTGCTCGATC GCGCATCCGT CATCGCCACC
ATGCGATCGC ACTTTCTCCA GAATGTCCAC AATGTGCGCG AAGTTCGCGC GGTTGCTGCA
ACCGTTCGCC GGTTGGCCGG CTTGCGTCCC ATGGCTGTTG CATCCGGCGG TTCGCGAGAA
ATTGTCACAG CGACCCTACA AGGAACGGGC CTGCGGGAGT ATTTCGACCA GGTGGTCACG
ATCGATGACG TGCCAAATCC AAAGCCGGCG CCGGACCTGT TTCTGCAGGC GGCCGCTTTG
TTGGGAATAG AACCAGCCCG GTGTGTCGTG TTTGAAGATA GCGAACAAGG TCTTGAAGCA
GCTCGACGGG CGGGTATGAG CGCCATTGAT GTGACCCGAC TTGATCTCCA TGAGCAGCAA
CAGCTTGCAG ATGAATGCCT GAGCATTCGG TCGGTGCAGG CGCATTAA
 
Protein sequence
MIDILSLEND FEAVIFDCDG TLVHTPPVYA DAWAAGFALS GKPMSEAWYL MRAGMSEVVL 
MDAFEREFDV VLDRASVIAT MRSHFLQNVH NVREVRAVAA TVRRLAGLRP MAVASGGSRE
IVTATLQGTG LREYFDQVVT IDDVPNPKPA PDLFLQAAAL LGIEPARCVV FEDSEQGLEA
ARRAGMSAID VTRLDLHEQQ QLADECLSIR SVQAH