Gene Rleg_4121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4121 
Symbol 
ID8014918 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4202839 
End bp4205319 
Gene Length2481 bp 
Protein Length826 aa 
Translation table11 
GC content65% 
IMG OID644826691 
Productglycoside hydrolase family 2 sugar binding 
Protein accessionYP_002977901 
Protein GI241206805 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGGGGC GTTTGGCGAG CATCGGTGCC GAGGAAAGTC TACTTTCCGA AGGCTGGAAC 
CTGATCCTGA CGGAGCCGGG CGCCTGCGCC GTGCCGCACG ACATCCCGCT CTCCGCACAA
TTCATTCCAG CACCCGTTCC CGGCACCGTC GCCGCCGCGC TCGAAAAAGC CGGGCTCTTC
GACCGGGAAA ATCCCGAGCC GCTGAACACG CGGGATGCCT GGTATCTCTG CCGGCTTTTC
GATGCCGAGC CCGGCGACGC GATCCTGCGT TTCGCAGGGC TGGCAACGCT CTGCCATGTC
TTCCTGAACG GCCAGGAAAT CCTGTTTTCC GAGAGCATGT TCACGGCGCA CGAGATTCCG
GTGACGCTTT CGGGCGGCGA CGAACTGGCG CTGTGCTTCC GGGCGCTTGG GCCCCGCCTG
TCGGAGCCAG GCCCGCGCGC GCGCTGGCGG CCGCAGATGA TCACGCCGGC GGGCTTGAAG
AATTTCCGCA CGACGCTGCT CGGCCATATG CCGGGCTGGT GCCCCGATAT CCATGCCGTC
GGGCCATGGC GGCCGATTTC GATGGTGCGG CGTCATCCCG TCTCGATCGA CAATGTCTCC
ATCCGCGCCG TATTGGAGGA GAGTGGCGTC GGCCGTCTCA GCGTGTCCCT GCATAGCAAT
GCCGAGGATC CGGCGATGCT GCTGCGCTGC GGCGGGATGG AGCAGCCCTT CGAGAAGGTC
GGCGACAGTC ATTACTCGGC TATCCTCAAG CTTTCCGACA TCGAGGCCTG GTGGCCGCAT
ACGCATGGTG CTCCGCGTCT CTACGCGCTG ACACTGGTTT CGGACGGGGT GGAATATCCG
CTCGGCAGGA CCGGCTTTCG ACGTATCGAC GTCGACCGTG GTGCCGATGG CGACGACTTC
GCGCTTCTCG TCAACGGCGA ACGCATCTTC TGCCGTGGTG CCGTGTGGAC GACGGCCGAT
ATCGTGCGAT TGCCGGGCGG GCGGGCGGAT TATGAGCCGC TCCTGCGGCT TGCCGCTGCA
GGCGGCATGA ACATGATCCG CATCGGCGGC ACCATGGCCT ACGAAACGCC TGATTTCTTC
GCACTCTGCG ACGAGCTCGG TCTGCTCGTC TGGCAGGACT TCATGTTCGC CAATTTTGAT
TATCCACGCA ACGACAAGGC TTTTCTTGGT CACGTGCATG CGGAGGTCGA GGAATTCCTC
CACGGCGTGC AGGCGTCGCC TTCGTTGGCC GCGCTCTGCG GCGGCAGCGA AGTCCACCAG
CAGGCGGCAA TGCTCGGCCT GCCGGCGGAA TTCTGGAGCG GGCCGGTCAC CGATGAAATC
ATCCCGGCGG TCGTCGCGCG CATGCGCCCC GATGTGCCCT ATGTGCCGAA CTCGCCCTAT
GGCGGAGCGA TGCCCTTTTC GCCGAATGCC GGTATTGCCC ATTATTACGG CGTCGGCGCC
TATATGCGGC CAATTGCCGA TGCGCGCCGC GCCGATGTGC GTTTTGCCTC CGAAAGCCTC
GCCTTCGCGC ATGTGCCGCA GCAAAGGACG CTGCAGCGTC ATCTCGATGT GCCCGCCGTC
CACAGTCCGC TGTGGAAGGC CCGCGTGCCC CGCGACCGCA GCGCATCGTG GGATTTCGAG
GATGTTCGCG ATTTCTACCT GCAGCTTCTC TACGGTTTCG ATCCGGCTGA GCTGCGCCGC
GAAGATCCGG AACGCTATCT CGATCTCTCC CGCGCCGTTA CCGGCGAGGT GATCGAGGAG
ACTTTTGCCG AATGGCGGCG CAAGGGCTCC GCCTGCAACG GCGCACTCGT CTGGACGCTG
CAGGACCTGT TGCCCGGTCC CGGCTGGGGA GTGATCGATT CCACCGGAGA GCCGAAGCCT
GTCTGGTATG CGATGCGCCG TGCATTCCGG CCGGTGCAAG TGGTCTTCAC CGACGAGGGA
ACGAACGGTC TCGACGTGCA TGTCGTCAAC GAGACGGATG CCGCGCTTGA CGTGGAGCTC
GAGGTCGTCT GCCTGCGCGG CGGAAAACAG CAGGTCGTCA GCGGCAGCAG GGCCTTCAAG
CTGGCGGCAA GAGACACGGA GCGTCTTGCC TGCACCGCGC TGTTCGGCGC CTTCTTCGAT
ACGACCTATG CCTTCCGTTT CGGGCCGCCG GCGCATGATG CCAGTGTGGC GCGCCTGCGC
TCGCTGGCAG ACGGCGCCAT TCTCGCAGAG AGCTTCCACT TCCCGTGCGG ACGGGGGAAG
GCGCTGCATG ACGCAGGGAT CGAAGCATCA TTCACCAGAG ACGGCGACGA CTGGTTCGTC
GATCTCAGGA CCGACCGGCT GGCGCAATCG GTGCATATCG ACGTCGAAGG CTATCGGGCC
GACGACGACT GGTTCCACCT TGCCCCCGGC GCGATGCGGC GTGTGCAGCT CCACGCGCTG
TCCGGCGTGG AAAGCGATAC TCCGCCTGCG GGCGAGATCA GAAGTCTAGG CAGTTCGCAT
CGTGTCGCAA TCGAGGGCTG A
 
Protein sequence
MRGRLASIGA EESLLSEGWN LILTEPGACA VPHDIPLSAQ FIPAPVPGTV AAALEKAGLF 
DRENPEPLNT RDAWYLCRLF DAEPGDAILR FAGLATLCHV FLNGQEILFS ESMFTAHEIP
VTLSGGDELA LCFRALGPRL SEPGPRARWR PQMITPAGLK NFRTTLLGHM PGWCPDIHAV
GPWRPISMVR RHPVSIDNVS IRAVLEESGV GRLSVSLHSN AEDPAMLLRC GGMEQPFEKV
GDSHYSAILK LSDIEAWWPH THGAPRLYAL TLVSDGVEYP LGRTGFRRID VDRGADGDDF
ALLVNGERIF CRGAVWTTAD IVRLPGGRAD YEPLLRLAAA GGMNMIRIGG TMAYETPDFF
ALCDELGLLV WQDFMFANFD YPRNDKAFLG HVHAEVEEFL HGVQASPSLA ALCGGSEVHQ
QAAMLGLPAE FWSGPVTDEI IPAVVARMRP DVPYVPNSPY GGAMPFSPNA GIAHYYGVGA
YMRPIADARR ADVRFASESL AFAHVPQQRT LQRHLDVPAV HSPLWKARVP RDRSASWDFE
DVRDFYLQLL YGFDPAELRR EDPERYLDLS RAVTGEVIEE TFAEWRRKGS ACNGALVWTL
QDLLPGPGWG VIDSTGEPKP VWYAMRRAFR PVQVVFTDEG TNGLDVHVVN ETDAALDVEL
EVVCLRGGKQ QVVSGSRAFK LAARDTERLA CTALFGAFFD TTYAFRFGPP AHDASVARLR
SLADGAILAE SFHFPCGRGK ALHDAGIEAS FTRDGDDWFV DLRTDRLAQS VHIDVEGYRA
DDDWFHLAPG AMRRVQLHAL SGVESDTPPA GEIRSLGSSH RVAIEG