Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_5198 |
Symbol | |
ID | 8007093 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012848 |
Strand | + |
Start bp | 609165 |
End bp | 610493 |
Gene Length | 1329 bp |
Protein Length | 442 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644822107 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002973367 |
Protein GI | 241113532 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.06332 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCATCGT TTTTGAACCC GACGAGGCGC GGCTTTCTGG CGGGTACGGC CGCTTTTGGC GCCACCAGCA TGCTCGGCGT GCGGCTGGCA TCGGCTGCGG TCGATTGGAA GCGCTTTGCC GGCACGACGC TCGAAGTCAA CCTGGTCAAG AGCCCGCGCA GCGAAATACT TCTGAAGTAC CTGTCCGAAT TCGAGGAGCT CACCGGCATC AAGGTCAATG CCGAAGCGAC GCCCGAACAA CAGCAACGTC AGAAGACGAC TATCGAGTTG AGCTCCGGCA AGCCGAGCTT CGATGTCGTG CACATGAGCT ATCATGTCCA GAAGCGGCAA TTCGAAAAGG GCGGCTGGCT TGCCGATATC AGTGGTTTTC TCAAGGACCC CTCTCTGACT GACCCGTCTC TGGTTGAAAG CGACTTCGCC GAAGCCGGCC TGACTTTTGC CAAAGATCCG GGCGGCGTTC TGCGTTCGCT TCCATTCTCG GTCGACTACT GGATCATCTA TTGGAACAAG GCGCTGTTCG AGAAGAAGGG GCTGGCCTAC CCGACGACAT TCGAAGAACT CGCCAGTGCC GCGGAGGCGC TCACCGATCC TTCCACGAAT ACCTACGGCT TCGTCGCCCG CGGCCTGAAG AACGCCAATA CGCCGGTCTG GACGTCGCTG CTGCTTGGCT ATGGTTCGAG CCCGCTCGGC CCGGATGGCA AGCTGCGCAC GACATCGCAA GAAGCGATCG ATGCGGCCAA GCTTTACCAA AGGCTAATGA CCAAGACCGC CCCTCCCGGC GTCTCCGGCT TCAACTGGGC TGAGGCACAA TCTGCCTTCC TGCAGGGCAA GATCGGCATG TGGCTGGATG GCGTCGGTTT TGCGCCGCCG ATCGAGAATC CGGAAAAGTC GCGCGTCGTC GGCCAGGTCG GTTACGGCAT CATGCCGAAA GGTCCGAAGG CACAGGCCGC AGGCACCTTC GGCGACGGGC TTGGCGTCGT CGCGGCAAGC CAGAAGAAGG AAGCCGGGTA CCTCTTCTGC CAATGGGCGA TTTCGCATGA AATGGGCGCA CGTCTGCTGC AGGCCGGCGC CGGCGTTCCT TTCCGCCAGT CCGTCCTCGA GGATGCGAAG GTCCGCGAAG GCGTCAAGAT GCCGGGCGCC TGGCTGGATG CCGTCGTCGG TTCCGGCAAG ATTTCGCAGC TCGCGCTGCC GGTCATCATT CCGGTCACCG AGTTCCGCGA CGTTTACGGT GTCGGTCTCA CCAACATGAT CGGCGGCGCC GATCCCGAAA CCGAGCTGAA GGCAGCGACG GCACAGTTCG AACCCGTCCT GGCGAAAAGC GAGGGATAA
|
Protein sequence | MSSFLNPTRR GFLAGTAAFG ATSMLGVRLA SAAVDWKRFA GTTLEVNLVK SPRSEILLKY LSEFEELTGI KVNAEATPEQ QQRQKTTIEL SSGKPSFDVV HMSYHVQKRQ FEKGGWLADI SGFLKDPSLT DPSLVESDFA EAGLTFAKDP GGVLRSLPFS VDYWIIYWNK ALFEKKGLAY PTTFEELASA AEALTDPSTN TYGFVARGLK NANTPVWTSL LLGYGSSPLG PDGKLRTTSQ EAIDAAKLYQ RLMTKTAPPG VSGFNWAEAQ SAFLQGKIGM WLDGVGFAPP IENPEKSRVV GQVGYGIMPK GPKAQAAGTF GDGLGVVAAS QKKEAGYLFC QWAISHEMGA RLLQAGAGVP FRQSVLEDAK VREGVKMPGA WLDAVVGSGK ISQLALPVII PVTEFRDVYG VGLTNMIGGA DPETELKAAT AQFEPVLAKS EG
|
| |