Gene Rleg2_2052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_2052 
Symbol 
ID6980791 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp2114923 
End bp2115957 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content61% 
IMG OID643396774 
Productalpha/beta hydrolase fold 
Protein accessionYP_002281562 
Protein GI209549645 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.111851 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.974564 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGAGC AAATCAACCA TCACCGCCGC CGCTTCTTCG GCATGACGGC AATCGCCCTT 
GCGGCCGTCG AATTCGGCGT GGCCGGGACA GCCGTCGCCC AGTCGGCGCT GCCCGCCGTA
AAGGCCGGAA CCAATACGTC TTTCGAAGCG CTGAAGCAGG TGAAGGCGGG CGTGCTCGAT
ATTGGTTATG CCGAGGCCGG CAGCGCGGAT GGTCCCGTTG TTCTGCTGCT GCATGGCTGG
CCCTACGATA TTTATAGTTT CGTCGATGTT GCGCCGCTGC TGGCCTCGGC GGGTTACAGG
GTGATCGTCC CCTATCTGCG TGGTTACGGC ACGACCCGCT TCCTGGACGA CCAGACACCG
CGCAACGGTC AGCCGTCGGC GCTGGCTGCC GATATGATCG CGCTGCTCGA TGCGCTCGAT
ATCGAGAAGG CAGTGATTGC CGGCTATGAC TGGGGCGGGA GGACGGCCAA CATTATGGCG
GCGCTATGGC CGGAGCGCTG CAAGGCGATG GTCTCGGTGA GCGGCTACCT GATCGGCAGC
CAGGAGGCTA ATTTGAAGCC GCTGCCGCCG AAGGCGGAAC TGGCCTGGTG GTATCAGTTC
TATTTTGCAA CCGAACGTGG GCGGCTGGGT TACGAGAGCA ATACGCATGA TTTCGCAAAG
CTCATCTGGC AGACGGCTTC GCCGAAGTGG AATTTCGACG ATGCGACTTT CGACAGGTCG
GCGGCTGCCT TCGACAATCC CGACCATGTC GCGATCGTCA TTCACAATTA TCGCTGGCGC
CTGGGGCTTG TCGAAGGCGA GGCCAAGTAC GATGCCTATG AGAAGACGCT TGCCGCATTG
CCGATGATCT CCGTGCCGAC GATCACCATG GAGGGGGATG CAAACGGTGC GCCGCATCCG
GAGCCATCCG CCTATGCCGG AAAATTCTCC GGCAAATACG AGCATCGCAC GATTAATGGC
GGCATAGGCC ACAACCTGCC GCAGGAAGCG CCGCAGGCCT TTGCGCAGGC GGTCATCGAC
GTCGACCGCT TCTGA
 
Protein sequence
MSEQINHHRR RFFGMTAIAL AAVEFGVAGT AVAQSALPAV KAGTNTSFEA LKQVKAGVLD 
IGYAEAGSAD GPVVLLLHGW PYDIYSFVDV APLLASAGYR VIVPYLRGYG TTRFLDDQTP
RNGQPSALAA DMIALLDALD IEKAVIAGYD WGGRTANIMA ALWPERCKAM VSVSGYLIGS
QEANLKPLPP KAELAWWYQF YFATERGRLG YESNTHDFAK LIWQTASPKW NFDDATFDRS
AAAFDNPDHV AIVIHNYRWR LGLVEGEAKY DAYEKTLAAL PMISVPTITM EGDANGAPHP
EPSAYAGKFS GKYEHRTING GIGHNLPQEA PQAFAQAVID VDRF