Gene Rleg2_3792 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_3792 
Symbol 
ID6982555 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp3922342 
End bp3924822 
Gene Length2481 bp 
Protein Length826 aa 
Translation table11 
GC content64% 
IMG OID643398514 
Productglycoside hydrolase family 2 sugar binding 
Protein accessionYP_002283280 
Protein GI209551363 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.111976 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.296586 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGGGGC GTTTGGCAAG CATCGGTGCC GAGGAAAGCC TGCTTTCCGA AGGCTGGAAC 
CTGATCCTGA CGGAGCCGGG CGCCTGCGCC GTGCCGCATG ACATTCCGCT TTCCGCGCAA
TTCATTCCAG CACCCGTTCC CGGCACCGTT GCCGCAGCGC TCGAAAAGGC CGGGCTGTTC
GACCGGGAAA ATCCAGAGCC GTTGAACACC AGGGATGCCT GGTATCTCTG CCGGCTGTTC
GATGCCGAGC CGGGTGAAGC GATCCTGCGT TTCGCGGGAC TTGCCACACT TTGCCATGTC
TTCCTCAACG GCCAGGAAAT CCTGTTTTCC GAGAGCATGT TCACGGCCCA TGAAATCCCG
GTAACGCTTG TGGGTGGCGA CGAGCTGGCG CTGTGCTTCC GGGCGCTCGG CCCCCGCCTG
TCGGAGCCTG GCCCGCGTGC ACGCTGGCGG CCGCAGATGA TCACGCCGGC GGGTCTCAAA
AATTTCCGCA CGACGCTGCT CGGCCATATG CCGGGCTGGT GCCCCGATAT CCATGCGGTC
GGGCCATGGC GGCCGATCTC GCTGGTGCGG CGGAACCCCG TCTCGATCGA CAATGTCTCC
GTCCGCGCCG TGCTGGACGA GAATGGCGTC GGCCGTCTCA GCGTTTCCCT GTACAGCAAT
GCCGAGAATC CGGCAATGCT GCTGCGCTGC GGCGGCATGG AGCAGCCTTT CGAGAAGATC
GGCGACAGTC ATTACTCGGC TATCCTCAAG CTTGCCGACA TCGAGCCCTG GTGGCCGCAC
ACACATGGCC TCCCGCGTCT TTACGATTTA GCGCTGGTTT CCGACGGCGT GGAATATTCC
CTCGGCACGA CCGGCTTCCG GCGGATCGAC GTCGAGCGTG GCGTCGATGG CGACGACTTC
GCGCTCTTTA TTAACGGGGA ACGCATCTTT TGCCGCGGCG CGGTGTGGAC GACAGCCGAT
ATCGCGCGTC TGCCGGGCAC GCGGGCGGAT TATGAGCCGT TCCTGCGGCT GGCGGCGCAA
GCCGGCATGA ACATGATCCG AATCGGCGGC ACCATGGCCT ATGAGACACC GAATTTCTTC
GCGCTTTGCG ACGAACTCGG CCTGCTGGTG TGGCAGGATT TCATGTTCGC CAATTTCGAC
TATCCACGCA ACGACAAGGC GTTTCTCGGC CACGTGCATG CGGAGGTCGA GGAATTCCTG
CACGGCGTCC AGGCGTCGCC ATCGCTGGCG GTGCTCTGCG GCGGCAGCGA AATCCATCAG
CAGGCGGCGA TGCTCGGCCT GCCCGTGGAA TTCTGGAGCG GGCCGGTCAC CGATGAAATC
ATCCCGGCCA TCGTCACGCG CATGCGTCCC GACGTGCCCT ATGTTCCGAA TTCGCCCTAT
GGCGGGGCGA TGCCGTTTTC GCCGAATGCC GGTATCGCCC ATTATTACGG CGTCGGCGCC
TATATGCGGC CGATTGCCGA TGCGCGCCGT GCCGATGTGC GTTTTGCCTC CGAAAGCCTT
GCCTTTGCGC ATGTGCCGCA GCAAAGGACG CTGCATCGTC ATCTCGACGT GCCCTCAGTC
CACAGCCCGC TCTGGAAGGC TCGCGTGCCG CGCGACCGCA GCGCATCCTG GGATTTCGAG
GACGTTCGCG ATTTCTATCT GGAGCTTCTC TATGGTTTCG ATCCGGCCCG GCTGCGGCGC
GAAGACCAGC AACTTTATCT CGATTTCTCC CGCGCGGTCA CCGGCGAGGT GATCGAGGAG
ACCTTTGCCG AATGGCGGCG CAAGGGTTCG GCTTGCAACG GCGCGCTCGT CTGGACGCTG
CAGGACCTGC TGCCCGGTTC CGGCTGGGGG GTGATCGATT CCACCGGCGA GCCGAAGCCG
GTCTGGTATG CGATGCGCCG CGCCTTCCGG CCGGTGCAGA CGGTGTTCAC CGACGAGGGA
ACCAACGGCC TCGACGTGCA TGTCGTCAAC GAGACGGACG CGGACCTCGA CGTCGAGCTC
GAGGTCGTCT GTCTGCGCGA CGGAAAGCAG CAGGTCGTCA GCGGCAGCAG AGCCTTCAAG
CTGGCGGCAA GGGACACGGA GCGTCTTGCC ACGACCGCGC TGTTCGGCGC TTTCTTCGAT
ACGACCTATG CCTTCCGTTT CGGACCGCCG TCGCATGATG TCAGTGTGGC GCGCCTGCGT
TCGCTGGCGG ATGGCGCCAT TTTGGCGGAA AGCTTCCACT TTCCCTGCGG ACGGGACAAG
GCGCTGCATG AAGCAGGCAT CGAGGCATCG CTCGGAAGAG ACGGCGACGA CTGGTTCGTC
GATCTCAGGA CCGACCGGCT GGCGCAATCG GTGCATATCG ACGTCGACGG CTACCGGCCC
GATGACGACT GGTTCCATCT CGCCCCCGGC GCGATGCGGC GCATGAAACT CCACGCGCTG
CCGGGCACCG AGAGCGATAT TGCGCCTGCA GGCGAGATCA GAAGTCTAGG CAGTTCGCAT
CGTGTCGTGC TCTCGGGCTG A
 
Protein sequence
MRGRLASIGA EESLLSEGWN LILTEPGACA VPHDIPLSAQ FIPAPVPGTV AAALEKAGLF 
DRENPEPLNT RDAWYLCRLF DAEPGEAILR FAGLATLCHV FLNGQEILFS ESMFTAHEIP
VTLVGGDELA LCFRALGPRL SEPGPRARWR PQMITPAGLK NFRTTLLGHM PGWCPDIHAV
GPWRPISLVR RNPVSIDNVS VRAVLDENGV GRLSVSLYSN AENPAMLLRC GGMEQPFEKI
GDSHYSAILK LADIEPWWPH THGLPRLYDL ALVSDGVEYS LGTTGFRRID VERGVDGDDF
ALFINGERIF CRGAVWTTAD IARLPGTRAD YEPFLRLAAQ AGMNMIRIGG TMAYETPNFF
ALCDELGLLV WQDFMFANFD YPRNDKAFLG HVHAEVEEFL HGVQASPSLA VLCGGSEIHQ
QAAMLGLPVE FWSGPVTDEI IPAIVTRMRP DVPYVPNSPY GGAMPFSPNA GIAHYYGVGA
YMRPIADARR ADVRFASESL AFAHVPQQRT LHRHLDVPSV HSPLWKARVP RDRSASWDFE
DVRDFYLELL YGFDPARLRR EDQQLYLDFS RAVTGEVIEE TFAEWRRKGS ACNGALVWTL
QDLLPGSGWG VIDSTGEPKP VWYAMRRAFR PVQTVFTDEG TNGLDVHVVN ETDADLDVEL
EVVCLRDGKQ QVVSGSRAFK LAARDTERLA TTALFGAFFD TTYAFRFGPP SHDVSVARLR
SLADGAILAE SFHFPCGRDK ALHEAGIEAS LGRDGDDWFV DLRTDRLAQS VHIDVDGYRP
DDDWFHLAPG AMRRMKLHAL PGTESDIAPA GEIRSLGSSH RVVLSG