Gene Rleg_3782 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3782 
Symbol 
ID8014610 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp3837634 
End bp3839877 
Gene Length2244 bp 
Protein Length747 aa 
Translation table11 
GC content62% 
IMG OID644826345 
Productglycoside hydrolase family 2 TIM barrel 
Protein accessionYP_002977564 
Protein GI241206468 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.148395 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTTCCG TCGTTTCCTT CAACGAAGGC TGGAGTTTCC ATGAGGGCTT CGGCCAGCGT 
TTGCTCGAAG CTTTCGATGC CGCCAAGTCG GTCAGCCTGC CGCATACCGC CGTCGAACTG
CCCTTCAGCT ATTTCGACGA GACCAGCTAT CAGCGCGCCT TCACCTATCA GAAGGTGCTG
CGCTGGCTGC CGGAATTCGA GGGCCGCGAG GTTTCGCTCG TCTTCGATGC CGCCATGGCC
GACAGCGTCG TCTATCTGAA CGGCGAAGAA ATCATTGCCT ATAAGGACGG ATACACGCCC
TTCGAGGCCC GCCTCACCGG CAAGCTCGTC AAGGGCGAGA ACCTCGTCAC CGTCAAGATC
GACGGCAGCG AGAACCCTGA CATTCCGCCT TTCGGCGGCC GCATCGATTA TCTGACCTAT
GCCGGCATCT ACCGCGATGT CTGGCTGAAG GTCACCGACC CGGTCTCGAT CCGCAATCTC
AAGATCGAAA CCACTGACGT TCTTCGCCCG GAGAAATCGG CGACCATCCG CGTCGACATC
GCCAATCCCG AGGGCCGCAG CTTCTCGGCG ACCGTCACTG CCACGCTGAA ACAGGCCGAC
GGCACGGTGG TCGCCACTGC CGCGACCGAA ACGATCGGCA GCCGCACCAC GCTCTCCTTT
GGCGGCCTCA CCGGCATCGC CCTCTGGGAC ATCACCGATC CCACGCTCTA TGACGTCACC
GTCGAGCTCA GGACCGAACA CGGCTCCGAC CGTATTTCCA CCCGGTTCGG CTTCCGCACC
GCCGAATTCA CGCCGGAAGG TTTCCTCCTC AACGGCAAGC CATTGAAGCT GCGCGGCCTC
AACCGCCACC AGGCCTTCCC TTATGTCGGT TATGCCGCCG GCCGCTCTGC CCAGGAGCGT
GACGCCGACA TCATGAAGAC GGTGCTGAAG TGCAATATTG TCCGCACTTC GCATTATCCG
CAGTCGAAAT GGTTCCTCGA TCGATGCGAC GCAATCGGCC TGCTGGTTTT CGAGGAGATC
CCCGGCTGGC AGCATATCGG CGATGCCGAC TGGCAGCAGG AATCGATCGA GAACGTCCGC
CGCATGATTG AGCGCGACTG GAACCACCCC TCGATCATCA TCTGGGGCGT GCGCATCAAC
GAATCGCAGG ATAATCACGA TTTCTACGCC AAGACCAATC GCCTCGCCCG TGAACTCGAC
AGTACCCGCC AGACCGGCGG CGTGCGTTAT CTCACCGAGA GCGAGCTGCT CGAAGACGTC
TACACGATGA ACGACTTCAT CCTCGGCAAT GAAGAGCTGC CGGGCGCCAA CCGACCGCGC
ACCGCCCTGC GCGCCCAGCA GGAAAATACC GGACTATCGC ACAAGGTGCC GTACCTGATC
ACCGAGTTCA ACGGCCACAT GCACCCGACG AAGATCTATG ACCAGGAGCA GCGCCAGGCC
GAGCATGTGC GCCGGCACCT GGAAGTGCTG AATGCCGCCC ATGGCGATCC TGATATCTCC
GGCGCCATCG GCTGGTGCAT GTTCGATTAC AACACCCACA AGGATTTCGG CTCCGGCGAC
CGCATCTGCT ATCACGGCGT GATGGACATG TTCCGCGAGC CGAAATTCGC GGCCTATGCC
TATATCAGCC AGTGCGACCC TTCCGAGGAG ATCGTCATGA AGCCGGTGAC CTTCTGGGCG
CGCGGCGAAC GCAATATCGG CGGCGTGCTG CCGCTGATCA TCCTGACCAA TTGCGACGAG
GTGGAACTGC AATATGGCGG GCTTTCCAAG CGCATCGGTC CGGATCGCGA GAACTACCCG
CATCTGCCGC ACCCGCCCGT CGTGCTCGAC CATCGGCACT TTACCGCCGA TGAGCTCGGC
ACCTGGGGTC TCGAATGGAT CGACGGCACC TTCACCGGCT ATATCGGCGG CGAGCCGGTG
GCCAGCCTGA CGCTGGTGGC CGATCCGTTG CCGACGACTC TGGAAGTCGT TGCCGACAGC
TCGACGCTGA AGGCCCGTGA ACGCGACAGC ACGCGGGTCA TCATCCGCGC CCTCGACCAG
CGCGGCCAGC GCCTGCCTTT CTTGAACGAC AGCATTTCGC TGAAGGTTCA CGGCCCGGCT
AGGATCGTCG GCCCGACCAA TGTCCCGCTG CAGGGCGGCA CCGCCGGTTT CTGGCTGGAG
GCGACCGGGT TCACCGGCGA GATCACTATC GAAGCGGTTT CCACGCGTTT TGCGTCGGTG
ACGCTCGGCG TGACGGCTGC CTAG
 
Protein sequence
MRSVVSFNEG WSFHEGFGQR LLEAFDAAKS VSLPHTAVEL PFSYFDETSY QRAFTYQKVL 
RWLPEFEGRE VSLVFDAAMA DSVVYLNGEE IIAYKDGYTP FEARLTGKLV KGENLVTVKI
DGSENPDIPP FGGRIDYLTY AGIYRDVWLK VTDPVSIRNL KIETTDVLRP EKSATIRVDI
ANPEGRSFSA TVTATLKQAD GTVVATAATE TIGSRTTLSF GGLTGIALWD ITDPTLYDVT
VELRTEHGSD RISTRFGFRT AEFTPEGFLL NGKPLKLRGL NRHQAFPYVG YAAGRSAQER
DADIMKTVLK CNIVRTSHYP QSKWFLDRCD AIGLLVFEEI PGWQHIGDAD WQQESIENVR
RMIERDWNHP SIIIWGVRIN ESQDNHDFYA KTNRLARELD STRQTGGVRY LTESELLEDV
YTMNDFILGN EELPGANRPR TALRAQQENT GLSHKVPYLI TEFNGHMHPT KIYDQEQRQA
EHVRRHLEVL NAAHGDPDIS GAIGWCMFDY NTHKDFGSGD RICYHGVMDM FREPKFAAYA
YISQCDPSEE IVMKPVTFWA RGERNIGGVL PLIILTNCDE VELQYGGLSK RIGPDRENYP
HLPHPPVVLD HRHFTADELG TWGLEWIDGT FTGYIGGEPV ASLTLVADPL PTTLEVVADS
STLKARERDS TRVIIRALDQ RGQRLPFLND SISLKVHGPA RIVGPTNVPL QGGTAGFWLE
ATGFTGEITI EAVSTRFASV TLGVTAA