Gene Rleg_6261 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_6261 
Symbol 
ID8016132 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012852 
Strand
Start bp325691 
End bp326977 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content58% 
IMG OID644827564 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002978764 
Protein GI241258880 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.689417 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCTCG GATTGGCAAA TGCAGGCGAC GCCAGTGCAG CTGATCGCGT CAAGATCGAG 
TGGTGGAACG CAGCAAACGG CCGCCTGGCC GAGATCACCA AACAGCTGAT TTCGGACTTC
AATGCCTCGC AGGACAAATA TGAGCTCGTT GGCATCAGCA AAGGCAATTA CGAGGAAACC
ATGGCGGCGA TGGTGGCTGC CTATCGCGTC GGTCAGCAGC CCGTGCTTAT CCAAGCCGCC
GAGCGAGGCT TTCTGACCAT GTATAATTCC GGCGCCATCA TCCCGGTGCC GGAGCTTATG
GAGAAGGAAG GCTACAAGAT CGACTGGGGC AATTTCATCG CTCCGGTCGC GGGCTTTTAT
CTCGTTGACG GCAAGCCGGC GGCAATGCCC TTCAACAGCT CGACACCGAT CTTCTGGTAC
AATGCCGATC ACTTCAAGGC AGCCGGCTTC GACAAGCCGG CCGAGACCTG GCAGGAACTC
GACAAGCAGT TGCACGCCAT CAAGGAGAAG GGAATTTCAA AGTGCCAGAT GGCGCTTGCG
AACGACTTCT ATTGGAGCCT GATCGAGAAC TACGCCGCGA TCCAGGACCA GCCTTACGGT
ACCAAGGCAA ACGGCTTCGG TGGTCTCGAT ACCGAATTCA TCTTCAACAA GAGCCCGCTG
ATCGTCGGCC AGGTGACACG CCTCAAGACG TGGCTCGACG ATGGGGTCCT GCAGATCGCA
GGGCAAGGCC TCTCACCCGA CCAGCTGTTT ACCTCTGGCA GTTGCTCGAC CTATGTGGCC
TCGACCGCGG CGCATGCCGC TGTTGAAAGC GGTGCGAAAT TTCAATGGAG CGCGACGTTC
CTGCCGCATG AGGAGGGCAT CGAGCCTAAG AACAGCACCA TTGGCGGCGG AGCGCTTTGG
GTGTTGAAAG GCAAGTCGGA CGAAGAATAC GCAGGCACTG CGGCCTTCTT GAATTTTGTC
GCCTTGCCGA AGACACAAGT CTGGTGGAGC AAGCAAACCG GCTATGTCCC GGTGACCAAT
GCCGCCTACG AAGAGGCCAA ATCCGAGGGT TATTTCAAGG AGCATCCGAC CCGCGAGGTC
GCCATTCTCC AACTCACGCG CGGCACGCCA ACCGACAATT CGCGCGGCTT CCGCTTTGGC
AATCACAACC AGTCGATGGC GCTTCTGGTT GAGGAGATCC AAGGCGTGTG GACCGGACAA
AAGACGCCGC AGCAGGCACT GGATGCTGCG GCGGCCCGCG GAAACCAGAT CCTTCGGCAG
TATGAGCAGC TTCATGCAGC AAAGTAA
 
Protein sequence
MTLGLANAGD ASAADRVKIE WWNAANGRLA EITKQLISDF NASQDKYELV GISKGNYEET 
MAAMVAAYRV GQQPVLIQAA ERGFLTMYNS GAIIPVPELM EKEGYKIDWG NFIAPVAGFY
LVDGKPAAMP FNSSTPIFWY NADHFKAAGF DKPAETWQEL DKQLHAIKEK GISKCQMALA
NDFYWSLIEN YAAIQDQPYG TKANGFGGLD TEFIFNKSPL IVGQVTRLKT WLDDGVLQIA
GQGLSPDQLF TSGSCSTYVA STAAHAAVES GAKFQWSATF LPHEEGIEPK NSTIGGGALW
VLKGKSDEEY AGTAAFLNFV ALPKTQVWWS KQTGYVPVTN AAYEEAKSEG YFKEHPTREV
AILQLTRGTP TDNSRGFRFG NHNQSMALLV EEIQGVWTGQ KTPQQALDAA AARGNQILRQ
YEQLHAAK