Gene Rleg2_0140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_0140 
Symbol 
ID6978850 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp134362 
End bp136818 
Gene Length2457 bp 
Protein Length818 aa 
Translation table11 
GC content63% 
IMG OID643394851 
Productglycoside hydrolase family 2 sugar binding 
Protein accessionYP_002279668 
Protein GI209547751 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.143076 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGAAA AGACCGAGCT CAATTCCGGC TGGACGCTTC ACTGCAACGA TACAGGAAGG 
CCTGGCCTGC CGGAAACAAT CCCGGCGACG GTGCCGGGCT GCGTGCATCT CGATCTTCTC
GCCAACCGGC TGATCCCCGA TCCCTATATC GACGTCAACG AGATCACCAA TGACTGGATC
GGCAAGACCG ACTGGACCTA TCGCTGCACA TTCGAGGCCG CGCCTGACGA CGACACGGTG
CAGGAACTCG TCTTCGACGG GCTCGATACG ATCGCGGTGA TCGCGCTGAA CGGCGAGGAG
ATCGGCCGCA GCTTCAACAT GCACCGTACC TATCGCTTCG ATATTTCCGG GCTTCTGAAG
GTGGGGGCCA ACGACCTCGC GGTCAGCTTC CGCTCCGCCT ATGCCTATGG CGCCGAGATG
GAGAAGCACT ACGGCTACCG GCCTAACAAC TATCCGGGGC CGGGCAATCT GATGCGCAAG
ATGGCCTGCA ATTTCGGCTG GGACTGGGGT CCGACGCTGG TGACGGCGGG ACTCTGGAAG
AGGGTCAGGC TGGAAAGCTG GGATCAGGCG CGGCTTGCCG AAACACGGGT CTCGGCCACG
CTTGCCGGCG GCGACGGGCT GGTGAAGGTG CATGCGAGGC TGGCGCGCCA TGGGGACGCG
AAGCCGTGCC GGCTGGTCGC GACGATCGGC GGTGTGACGA CGACGGTTGC GATCGGCGCC
GGAGAAGACA CCATCGCCTT CGAGCTTCGC CTGCCCTCGC CAAAACTCTG GTGGCCGCAT
CATCTCGGCG CCCAGCCGCT CTATCCCCTG ACGCTCGAAC TGATCGACGA TGCCGGCGGC
GACCTGCTCG ACAGCTATCA GCGGGCGCTT GGTTTCCGCT CGTTGAGGCT TGACACCTCG
GCCGATGCGC ACGGCTCGGC CTTCACCTTC GTCATCAACG ACGTGCCGCT GTTCATCGCC
GGCGCGAACT GGATTCCCGA CGATTGTTTC CCTTCGCGGG TGACGGCGGG GCGCTACGCC
GCACGGATCG ACGAGGCGAA GGCCGCCAAT ATCCACATGC TGCGCGTCTG GGGCGGCGGC
ATCTTCGAGC GCGACGAATT CTACGAGGCC TGCGACCGCA TGGGCATGCT GGTCTGGCAG
GATTTCCTCT TTGCCTGCGC CGCCTATCCG GAGGAGGAGC CGCTGAAGAG CGAGGTCGAG
GCCGAAGTGC GCGATAATGT CGTGCGGCTG ATGCCGCATG CCAGCCTGAT CCTCTGGAAC
GGCAACAATG AGAATATCTG GGGCTTCGAC GAATGGGGCT GGCGGCCGGT CATCAAGGCC
GATGAAAGCT GGGGGCTCGG TTATTATCTC GACCTGCTGC CGAGGCTCTC AGCCGAGCTC
GATCCCGACC GGCCTTATTA TCCCGGCAGC CCCTATTCCG GTTCGATGGA GATCGCGCCG
AATGCCGATG CGCATGGCTG CAAACATATC TGGGACGTCT GGAACGATGT CGGCTACGAG
GTCTACCGCG ACTATGTCCC GCGCTTCTGC TCCGAATTCG GCTGGCAGGC GCCGGCCGCC
TGGGCGACGA TCGAAGAAAG CGTGCACGAC CAGCCGCTGA CGCCGCAATC GAACGGCGTC
TTCCACCATC AGAAGGCCAC CCAAGGCAAT GACAAGCTGA TCCGCGGCCT CTCCGGCCAC
CTGCCGGAAC CGCAAACGAT GGACGACTGG CACTTCGCCA CCCAGCTCAA CCAGGCCCGC
GCCATCCGCT TCGGCATCGA GCACATGCGC TCGCACCGCG ATATCTGCAA GGGCGCGGTG
GTCTGGCAGT TCAACGATTG TTGGCCGGTG ACCTCCTGGG CCGCACTCGA CTCGGCCGGG
CGCCGCAAGC CGCTCTGGTA TGCGCTGAGG GCCGCCTATG ATCCGCGCCT GCTGACCATT
CAGCCGCGCG GCGACGGGCT TTCGGCGGTG GCGGTTAATG AGAGAACGCT GTTCTGGCGG
GCGAAGATCA GCGGCAGGCG TTTGCGGCTC GACGGCAGCG TGCTGGCAGA GTTCGAATTC
TGGCGGCTGC TCTGCGACCG CTTCGAGGCA AAAGAGTTTC CGCTGCCCGA AGATATCGTC
AGTCCTGGCT TACCGAAAGA GGAGGTCGTT GTCGTCGAAA TGCTCGACAG GCGGGCCTTT
CATTATTTCG TCGAGGATAT CGAGCTTGCC CTGCCGGCGC CGCGGCTGAG CGTCGATGTT
GCTGCGAGCG ATGGCGGGTT TGCGGTTACG GTGACGGCTG AGAGTTTCCT CAAGGATCTC
TGCCTGATGG CGGATCGGCT GGATCCGGAC GCCGTGGTCG ATACGATGCT GGTGACGCTG
CTGCCGGGCG AGAGTCATGT GTTTGCGGTG AAGACGGCGA AGGGGATTTC GGCCAATGAC
ATTGTCGTCG GTACAGTGCT GCGGTCGGCC AATGATCTTG TCGCAGGGCG GCAATAA
 
Protein sequence
MIEKTELNSG WTLHCNDTGR PGLPETIPAT VPGCVHLDLL ANRLIPDPYI DVNEITNDWI 
GKTDWTYRCT FEAAPDDDTV QELVFDGLDT IAVIALNGEE IGRSFNMHRT YRFDISGLLK
VGANDLAVSF RSAYAYGAEM EKHYGYRPNN YPGPGNLMRK MACNFGWDWG PTLVTAGLWK
RVRLESWDQA RLAETRVSAT LAGGDGLVKV HARLARHGDA KPCRLVATIG GVTTTVAIGA
GEDTIAFELR LPSPKLWWPH HLGAQPLYPL TLELIDDAGG DLLDSYQRAL GFRSLRLDTS
ADAHGSAFTF VINDVPLFIA GANWIPDDCF PSRVTAGRYA ARIDEAKAAN IHMLRVWGGG
IFERDEFYEA CDRMGMLVWQ DFLFACAAYP EEEPLKSEVE AEVRDNVVRL MPHASLILWN
GNNENIWGFD EWGWRPVIKA DESWGLGYYL DLLPRLSAEL DPDRPYYPGS PYSGSMEIAP
NADAHGCKHI WDVWNDVGYE VYRDYVPRFC SEFGWQAPAA WATIEESVHD QPLTPQSNGV
FHHQKATQGN DKLIRGLSGH LPEPQTMDDW HFATQLNQAR AIRFGIEHMR SHRDICKGAV
VWQFNDCWPV TSWAALDSAG RRKPLWYALR AAYDPRLLTI QPRGDGLSAV AVNERTLFWR
AKISGRRLRL DGSVLAEFEF WRLLCDRFEA KEFPLPEDIV SPGLPKEEVV VVEMLDRRAF
HYFVEDIELA LPAPRLSVDV AASDGGFAVT VTAESFLKDL CLMADRLDPD AVVDTMLVTL
LPGESHVFAV KTAKGISAND IVVGTVLRSA NDLVAGRQ