Gene Rleg2_4787 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_4787 
Symbol 
ID6977881 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp423775 
End bp426237 
Gene Length2463 bp 
Protein Length820 aa 
Translation table11 
GC content64% 
IMG OID643393950 
Productglycoside hydrolase family 3 domain protein 
Protein accessionYP_002278768 
Protein GI209546850 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.294133 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCGATT CCATTCTCGA CCAGATGACA CTCGAGGAGC AGGTCTCGCT GCTATCCGGC 
GCCGATTTCT GGACGACCGT CCCCGTCGAG CGCCTCGGCG TGCCGAAGAT CAAGGTGACG
GACGGCCCGA ATGGCGCGCG CGGCGCCGGT TCCCTGGTCG CCGGCGTCAA GGCCACCTGC
TTTCCCGTCG GCATCGCGCT TGGCGCCACC TGGAATCCCG AGCTCGTCTC GCAGATGGGC
AAAGCCCTTG CGCGTCAGGC AAAGAGCAAG GGTGCCGCGG TGCTGCTTGC GCCCACGGTG
AATATCCATC GTTCCGGGCT CAATGGCCGC AATTTCGAAT GTTATTCCGA AGATCCGATG
CTGACTGCCG AGCTGGCCGT TGCCTATATC GGCGGCGTGC AGGGCGAGGG GATCGCCGCG
ACGATCAAGC ACTTCGCCGG CAACGAGTCC GAGATCGAGC GGCAGACCAT GTCGTCCGAT
ATCGACGAGC GGTCGCTGCG CGAAATTTAC CTGCCGCCCT TCGAACAGGC CGTGCGCCGC
GCCGGTGTGA TGGCGGTCAT GTCCTCCTAT AACCGCCTCA ACGGTACCTA TACGAGCGAG
CATCACTGGC TGCTGACCAA GGTGCTGCGC GAGGAATGGG GTTTCCAAGG TATCGTCATG
TCCGACTGGT TCGGATCACA TTCGACTGAA GAGACGATCA ATGCCGGTCT CGATCTCGAA
ATGCCGGGTC CGGCGCGCGA TCGCGGCGAG AAACTGGTTG CCGCCGTGCA CGAAGGCAAG
GTCGAGGCGG CAACGGTACG GGCCGCGGCG CGGCGGATGC TGCTTCTGCT CGAGCGGGTC
GGCGCCTTCG AGAGCAAACC CGATCTGACC GAACGGGCGG TCGACCTGCC GGAAGACCGG
GCGCTGATCC GGCGCTTGGG CGCAGAGGGC GCGGTGCTCC TTAGGAATGA CGGCATCCTG
CCGCTGGCCA AGACCTCGCT CGACCGGATT GCCGTCATCG GCCCCAATGC GGCAAGCGCG
CGCCTCATGG GCGGCGGCAG CGCACAGATT GCGGCGCATT ATACGGTGAG CCCGCTCGAA
GGCATTCGCG CCGCCCTTGC CAATGCCAAC AGCATCAGCC ATGCGGCCGG CTGCCGCCAT
AACCGTCTGA TCGACGTATT CAAGGGAAAG ATCACGGTCG AATATTTCAA GGGCCGCGGC
TGCAAGGGCA ATCCGCTGCA TGTGGAGACC GTCGACAAGG GCGAATTCTT CTGGTTCGAG
CTGCCGTCGG GCGAACTCGA CCCTGCCGAT TTTTCGGCCA GGATGACCAT GCAATTCGTG
CCGGAGGAGA GCGGCGATCA TGTTTTCGGC ATGACCAATG CCGGCCTGGC GCGGCTCTTT
GTCGATAGCG CATTGACTGT CGACGGCCAT GAGGGCTGGA CGCGCGGCGA AAACTATTTC
GGCACGGCCA ATGACGAGCA GCGCGGCACC GTGGCGCTCG AAGCCGGCCG GGCCTATGCG
GTCACCGTCG AATATGAGCC GTCCACCGCG AGCGGGGAGG GCATCAACCT GATCGCCGTC
CGTTTCGGCG TGGAAAAGCC GCTCGGCGAG GCCGATATCG AGGCAGCCGT CGAGACGGCC
CGCAATGCCG ATCTGGCGCT TCTCTTCGTC GGCCGTGACG GCGAATGGGA TACGGAAGGT
CTTGATCTGC CCGACATGCG GCTTCCGGGC CGGCAGGAGG AACTGATCGA GCGGGTGGCC
GCCGCCAATG CCAACACCGT CGTCGTGCTG CAGACCGGCG GCCCGGTGGA AATGCCCTGG
CTCGGCAAGG TCCGCGCCGT GCTGCAGATC TGGTATCCGG GGCAGGAGCT GGGTAATGCC
GTCGCCGACG TGCTGTTCGG TGATGTCGAG CCGGGCGGCC GCCTGCCGCA GACCTTCCCG
AAGGCGCTCA CCGACAATTC CGCCATCACA GGCGATCCCG CCGTCTATCC CGGCAAGGAC
GGGCATGTGC GCTACGCCGA AGGCGTCTTC GTCGGCTACC GTCATCACGA TACTCACGCC
GTCGAGCCGC TCTTTCCCTT TGGCTTCGGT CTTGGTTATA CGCGCTTCAG CTGGGCTGAG
CCGCGGCTGT CGACCGGCGA AATGGGATCT GAAGGGATCA CCGTGAGCGT CGACCTGACC
AATATCGGCG ACCGGGCGGG CTCGGAACTG GTGCAGCTCT ATGTGCGGTC GCCGGAGACC
AAGGTGGAGC GGCCCGACAA GGAACTGCGC GCCTTCGCAA AGCTTTCGCT GCAGCCCGGC
GAGACCGGCA CGGCGGTGCT GAAGATCCTG CCGCGGGATC TCGCTTATTT CGATGTCGAG
ACCGGCGCCT TCCGCGCCGA ACCCGGCAAC TATCAGCTGG TCGTGGCGGC GAATGCGGCG
GATATCAGGT TTGTCATTGA TCTGTCCTCA CCACTCACCC ACGTGCTGCC GCCTTCGCAT
TAG
 
Protein sequence
MIDSILDQMT LEEQVSLLSG ADFWTTVPVE RLGVPKIKVT DGPNGARGAG SLVAGVKATC 
FPVGIALGAT WNPELVSQMG KALARQAKSK GAAVLLAPTV NIHRSGLNGR NFECYSEDPM
LTAELAVAYI GGVQGEGIAA TIKHFAGNES EIERQTMSSD IDERSLREIY LPPFEQAVRR
AGVMAVMSSY NRLNGTYTSE HHWLLTKVLR EEWGFQGIVM SDWFGSHSTE ETINAGLDLE
MPGPARDRGE KLVAAVHEGK VEAATVRAAA RRMLLLLERV GAFESKPDLT ERAVDLPEDR
ALIRRLGAEG AVLLRNDGIL PLAKTSLDRI AVIGPNAASA RLMGGGSAQI AAHYTVSPLE
GIRAALANAN SISHAAGCRH NRLIDVFKGK ITVEYFKGRG CKGNPLHVET VDKGEFFWFE
LPSGELDPAD FSARMTMQFV PEESGDHVFG MTNAGLARLF VDSALTVDGH EGWTRGENYF
GTANDEQRGT VALEAGRAYA VTVEYEPSTA SGEGINLIAV RFGVEKPLGE ADIEAAVETA
RNADLALLFV GRDGEWDTEG LDLPDMRLPG RQEELIERVA AANANTVVVL QTGGPVEMPW
LGKVRAVLQI WYPGQELGNA VADVLFGDVE PGGRLPQTFP KALTDNSAIT GDPAVYPGKD
GHVRYAEGVF VGYRHHDTHA VEPLFPFGFG LGYTRFSWAE PRLSTGEMGS EGITVSVDLT
NIGDRAGSEL VQLYVRSPET KVERPDKELR AFAKLSLQPG ETGTAVLKIL PRDLAYFDVE
TGAFRAEPGN YQLVVAANAA DIRFVIDLSS PLTHVLPPSH