Gene Rleg_5198 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_5198 
Symbol 
ID8007093 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp609165 
End bp610493 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content61% 
IMG OID644822107 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002973367 
Protein GI241113532 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.06332 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCATCGT TTTTGAACCC GACGAGGCGC GGCTTTCTGG CGGGTACGGC CGCTTTTGGC 
GCCACCAGCA TGCTCGGCGT GCGGCTGGCA TCGGCTGCGG TCGATTGGAA GCGCTTTGCC
GGCACGACGC TCGAAGTCAA CCTGGTCAAG AGCCCGCGCA GCGAAATACT TCTGAAGTAC
CTGTCCGAAT TCGAGGAGCT CACCGGCATC AAGGTCAATG CCGAAGCGAC GCCCGAACAA
CAGCAACGTC AGAAGACGAC TATCGAGTTG AGCTCCGGCA AGCCGAGCTT CGATGTCGTG
CACATGAGCT ATCATGTCCA GAAGCGGCAA TTCGAAAAGG GCGGCTGGCT TGCCGATATC
AGTGGTTTTC TCAAGGACCC CTCTCTGACT GACCCGTCTC TGGTTGAAAG CGACTTCGCC
GAAGCCGGCC TGACTTTTGC CAAAGATCCG GGCGGCGTTC TGCGTTCGCT TCCATTCTCG
GTCGACTACT GGATCATCTA TTGGAACAAG GCGCTGTTCG AGAAGAAGGG GCTGGCCTAC
CCGACGACAT TCGAAGAACT CGCCAGTGCC GCGGAGGCGC TCACCGATCC TTCCACGAAT
ACCTACGGCT TCGTCGCCCG CGGCCTGAAG AACGCCAATA CGCCGGTCTG GACGTCGCTG
CTGCTTGGCT ATGGTTCGAG CCCGCTCGGC CCGGATGGCA AGCTGCGCAC GACATCGCAA
GAAGCGATCG ATGCGGCCAA GCTTTACCAA AGGCTAATGA CCAAGACCGC CCCTCCCGGC
GTCTCCGGCT TCAACTGGGC TGAGGCACAA TCTGCCTTCC TGCAGGGCAA GATCGGCATG
TGGCTGGATG GCGTCGGTTT TGCGCCGCCG ATCGAGAATC CGGAAAAGTC GCGCGTCGTC
GGCCAGGTCG GTTACGGCAT CATGCCGAAA GGTCCGAAGG CACAGGCCGC AGGCACCTTC
GGCGACGGGC TTGGCGTCGT CGCGGCAAGC CAGAAGAAGG AAGCCGGGTA CCTCTTCTGC
CAATGGGCGA TTTCGCATGA AATGGGCGCA CGTCTGCTGC AGGCCGGCGC CGGCGTTCCT
TTCCGCCAGT CCGTCCTCGA GGATGCGAAG GTCCGCGAAG GCGTCAAGAT GCCGGGCGCC
TGGCTGGATG CCGTCGTCGG TTCCGGCAAG ATTTCGCAGC TCGCGCTGCC GGTCATCATT
CCGGTCACCG AGTTCCGCGA CGTTTACGGT GTCGGTCTCA CCAACATGAT CGGCGGCGCC
GATCCCGAAA CCGAGCTGAA GGCAGCGACG GCACAGTTCG AACCCGTCCT GGCGAAAAGC
GAGGGATAA
 
Protein sequence
MSSFLNPTRR GFLAGTAAFG ATSMLGVRLA SAAVDWKRFA GTTLEVNLVK SPRSEILLKY 
LSEFEELTGI KVNAEATPEQ QQRQKTTIEL SSGKPSFDVV HMSYHVQKRQ FEKGGWLADI
SGFLKDPSLT DPSLVESDFA EAGLTFAKDP GGVLRSLPFS VDYWIIYWNK ALFEKKGLAY
PTTFEELASA AEALTDPSTN TYGFVARGLK NANTPVWTSL LLGYGSSPLG PDGKLRTTSQ
EAIDAAKLYQ RLMTKTAPPG VSGFNWAEAQ SAFLQGKIGM WLDGVGFAPP IENPEKSRVV
GQVGYGIMPK GPKAQAAGTF GDGLGVVAAS QKKEAGYLFC QWAISHEMGA RLLQAGAGVP
FRQSVLEDAK VREGVKMPGA WLDAVVGSGK ISQLALPVII PVTEFRDVYG VGLTNMIGGA
DPETELKAAT AQFEPVLAKS EG