Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_5468 |
Symbol | |
ID | 6978562 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | + |
Start bp | 1115487 |
End bp | 1116815 |
Gene Length | 1329 bp |
Protein Length | 442 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643394568 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002279386 |
Protein GI | 209547468 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.303073 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCATCAT TTTTGAATCC GACGCGGAGA GGTTTTCTGG CGGGCACGGC CGCGCTTGGC GCCAGCAGCA TGCTCGGCAT GCGCTCGGCA TCTGCTGCCG TCGACTGGAA GCGCTTCACC GGCACCACGC TCGAGGTCAA TCTGGTCAAG AGCCCGCGCA GCGAAATACT CATGAAGTAC CTGCCCGAAT TCGAGGAGCT CACCGGCATC AAGGTCAATG CCGAGGCGAC GCCCGAGCAG CAGCAGCGCC AGAAAACGAC GATTGAGCTC AGCTCCGGCA AGCCGAGCTT CGACGTCGTG CATATGAGCT ATCACGTCCA GAAGCGGCAG TTCGAGAAGG GCGGCTGGCT TGCCGATATC GGCGGCTTCC TCAAGGATCC CGCGCTGACC GACCCGTCGC TGACGGAGGG TGATTTCGCC GAAGCCGGCC TCGCCTTCGC CAAGGACGCC GGTGGCGCTC TGCGTTCTCT TCCCTTCTCG GTCGATTACT GGATCATCTA CTGGAACAAG GCGCTGTTCG AAAAGAAGGG TCTTTCCTAC CCGACGACGT TCGAAGAGCT GGCGAGTGCG GCCGAAGCCC TCACCGATCC GTCGACCAAC ACCTACGGTT TCGTCGCTCG CGGACTGAAG AACGCCAACA CGCCGGTCTG GACGTCGCTG CTGCTTGGTT ATGGTTCGAG CCCGCTCGGC CCTGACGGCA AGCTGCGCAC GACATCGCCG GAAGCGATCG ATGCCGCCAA GCTTTATCAG AAGCTGATGA CCAAGACCGC CCCTCCCGGC GTTTCCGGCT TCAACTGGGC CGAGGCCCAG TCGGCCTTTC TGCAGGGCAA GATCGGGATG TGGCTCGATG GCGTCGGCTT TGCGCCGCCG ATCGAGAATC CGGAAAAGTC ACGCGTCGTC GGCCAGGTCG GTTACGGCAT CATGCCGAAA GGCCCGAAGG CGCAGGCCGC GGGAACCTTC GGCGACGGGC TTGGCGTCGT CGCGGCCAGC CAGAAAAAGG AAGCCGCCTA CCTCTTCTGC CAATGGGCGA TTTCGCATGA CATGGGCGCC CGCCTGCTGC AGGCCGGTGC CGGCGTTCCG TTCCGCCAGT CTGTCCTGGA GGATGCGAAA GTCCGCGAAG GCGTCAAGAT GCCTGGCGCG TGGCTCGATG CCGTCGTCGG TTCTGGCAAG ATCTCGCAGC TCGCGCTGCC GGTCATCATT CCGGTCACCG AGTTCCGCGA CATCTATGGC GTCGGTCTCA CCAATATGAT TGGCGGCGCC GATCCGGAAG CCGAACTCAA GGCGGCGACG GCGCAGTTCG AACCCGTCCT GGCGAAAAGC GAGGGATAA
|
Protein sequence | MPSFLNPTRR GFLAGTAALG ASSMLGMRSA SAAVDWKRFT GTTLEVNLVK SPRSEILMKY LPEFEELTGI KVNAEATPEQ QQRQKTTIEL SSGKPSFDVV HMSYHVQKRQ FEKGGWLADI GGFLKDPALT DPSLTEGDFA EAGLAFAKDA GGALRSLPFS VDYWIIYWNK ALFEKKGLSY PTTFEELASA AEALTDPSTN TYGFVARGLK NANTPVWTSL LLGYGSSPLG PDGKLRTTSP EAIDAAKLYQ KLMTKTAPPG VSGFNWAEAQ SAFLQGKIGM WLDGVGFAPP IENPEKSRVV GQVGYGIMPK GPKAQAAGTF GDGLGVVAAS QKKEAAYLFC QWAISHDMGA RLLQAGAGVP FRQSVLEDAK VREGVKMPGA WLDAVVGSGK ISQLALPVII PVTEFRDIYG VGLTNMIGGA DPEAELKAAT AQFEPVLAKS EG
|
| |