Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_1866 |
Symbol | |
ID | 8012918 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 1851115 |
End bp | 1852419 |
Gene Length | 1305 bp |
Protein Length | 434 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 644824456 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002975688 |
Protein GI | 241204592 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTTTTA GGTTCAAGAG CACCGCTTTT GGCGGCAGGC GTATCGCATC GATGGCCGCA GCTGCCGGCA TGTTGCTGGC CGGAGCAGGA GCTGCCTCGG CAACGACGGT GGTCAAGTGG CTGCATCTCG AGCTCGACCC GAAATACGTC GCGGCCTGGG AAGATATCGT CAAGAAATAC GAAGCCCAGC ATCCAGACGT CGATATCCAG ATGCAGTTTC TTGAAAACGA GGCCTTCAAG GCCAAGCTTC CGACATTGCT GCAATCTGAC GACGTGCCGG ATTTCTTTTT CAGCTGGGGC GGCGGCGTCT TGAAGCAGCA GTCCGAGACC GGCGCGCTCC AGGATGTGAC GCCGGCGCTG GATGCCGATG GCGGCAAACT GCGCGGCGCC TATAGCCCGG CTTCGGTCAG CGGCCTGACA TTTGAAGGCA AGACCTGGGC TATTCCCTAT AAGGTCGGTC TGGTCAGCTT TTTCTACAAC AAGGACCTGT TTGCCAAGGC CGGCGTCAAG GCCGAGGACA TCAAGAACTG GGCTGATTTT CTGGGTACCG TGAAGAAGAT CAAAGAGGCA GGCATCGTGC CGATCGCCGG CGGCGGCGGT GAGAAATGGC CGATCCACTT CTACTGGAGC TATCTCGTCA TGCGCGAAGG CGGACAGAAG GTCTTCGAAG CGGCAAAGAC TGGCCAGGGC GAAGGTTTCC TCGATCCTTC GATCATCAAG GCCGGCGACG ATCTCGCTGA ACTCGGAAAG CTCGAACCGT TCCAGCCCGG CTATCTCGGC TCCACCTGGC CGCAGGCGCT CGGCGTTTTC GGCGACGGCA AGGCGGCGAT CATCCTCGGC TTTGAAAATA CCGAGGCCAA CCAGCGCAAG AATGCCGGCG ACGGCAAGGG TCTCGCGCCC GAAAATATCG GCCGCTTCGC CTTCCCGGCC GTCGATGGCG GCGCCGGCAA GCCGACCGAT ACGCTTGGCG GTCTGAACGG CTGGGCTGTC ACCAAGAAGG CATCCAAGGA AGCGCTCGAT TTCCTCGCCT TCCTGACCAA TGCGGACAAT GAGCGGGCGA TGGCCAAGGC CGGCATGCTT CTGCCCGTTG CCGTCGGCGC CGGCGATGGC GTCACCAATC CGCTGCTTGC CGAATCGGCA AAACAGCTGG CCGGTTCGAC CTGGCATCAG AACTATTTCG ACCAGGATCT TGGCGCTGCG GTCGGCCGTG TCGTCAACGA CGTGTCGGTG GAAATCGTCT CCGGGCAGAT GAATTCCAAG GACGGCGCCC AGATGATCCA GGACGCTTTC GAACTGGAAC AATAA
|
Protein sequence | MTFRFKSTAF GGRRIASMAA AAGMLLAGAG AASATTVVKW LHLELDPKYV AAWEDIVKKY EAQHPDVDIQ MQFLENEAFK AKLPTLLQSD DVPDFFFSWG GGVLKQQSET GALQDVTPAL DADGGKLRGA YSPASVSGLT FEGKTWAIPY KVGLVSFFYN KDLFAKAGVK AEDIKNWADF LGTVKKIKEA GIVPIAGGGG EKWPIHFYWS YLVMREGGQK VFEAAKTGQG EGFLDPSIIK AGDDLAELGK LEPFQPGYLG STWPQALGVF GDGKAAIILG FENTEANQRK NAGDGKGLAP ENIGRFAFPA VDGGAGKPTD TLGGLNGWAV TKKASKEALD FLAFLTNADN ERAMAKAGML LPVAVGAGDG VTNPLLAESA KQLAGSTWHQ NYFDQDLGAA VGRVVNDVSV EIVSGQMNSK DGAQMIQDAF ELEQ
|
| |