Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_1685 |
Symbol | |
ID | 6980422 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 1715010 |
End bp | 1716314 |
Gene Length | 1305 bp |
Protein Length | 434 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643396409 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002281199 |
Protein GI | 209549282 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATTTTA TGCTGAAGAA CGCCGCTTTC GGCGGCAGGC GCATTGTATC CATGGCCGCG GCAGCCGGCC TGTTGCTGGC GGGAGCGGGT GCCGCCTCGG CGACGACGGT GGTGAAATGG CTGCATCTCG AGCTCGACCC GAAAAACGTT GCGGCCTGGG AAGATATCGT CAAGAAATAC GAAGCCCAGC ATCCCGACGT CGACATCCAG ATGCAGTTCC TCGAAAACGA GGCCTTCAAG GCCAAGCTTC CGACATTGCT GCAGTCCGAC GACGTGCCGG ATTTCTTTTT CAGCTGGGGC GGCGGCGTGT TGAAGCAGCA GTCCGAGACC GGCGCGCTCC AGGATGTGAC GGCAGCGCTT GATGCCGATG GCGGCAAGCT GCGCAGCGCC TATACCCCGG CTTCGGTCGA TGGCCTGACT TTCGAGGGCA AGACCTGGGC CATTCCCTAC AAGGTCGGTC TCGTCAGCTT CTTCTACAAC AAGGCGCTGT TTGCCAAGGC CGGCGTCAAG GCCGAAGACA TCAAGACCTG GAGCGATTTT CTCGCCACGG TGAAGAAGAT CAAGGCGGCC GGCATCGTGC CGATCGCCGG CGGCGGCGGT GAGAAATGGC CGATCCATTT CTACTGGAGC TATCTCGTCA TGCGCGAGGG CGGCCAGAAG GTCTTCGAAG CGGCCAAGAA CGGCGAGGGC GAAGGCTTCC TCGATCCCAC TATCATCAAG GCCGGCGACG ACCTCGCCGA ACTCGGCAAG CTCGAACCGT TCCAGCCCGG CTATCTCGGT GCGACCTGGC CGCAGACGCT CGGCGTTTTC GGCGACGGCA AGGCGGCGAT GATCCTCGGC TTTGAAGCGA CAGAGGCCAA CCAGCGCAAG AATGCCGGCG ACGGCAAGGG GCTTTCCTCA GACAATATCG GCCGTTTCGT CTTCCCGACG GTCGAAGGCG GCGCCGGCAA GCCGACCGAT ACGCTCGGCG GCTTGAACGG CTGGGCCGTC ACCAAGAAGG CCTCCAAGGA AGCGCTCAAT TTCCTCGCTT TCCTGACGAG CGCGGAGAAT GAACGGGCGA TGGCCAAATC AGGCATGTTG CTTCCCGTTG CCGTCGGCGC CGATGACGGC GTCGTCAATC CGTTGCTGGC CGAATCGGCC AAACAGCTTG CCGGTTCGAC CTGGCATCAG AACTTCTTCG ACCAGGATCT CGGCGCTGCC GTCGGCCGCG TCGTCAACGA CGTCTCCGTG GAAATCGTCT CCGGCCAGAT GAATTCCAAG GACGGCGCCC AGATGATCCA GGACGCTTTC GAGCTGGAAC AATAA
|
Protein sequence | MNFMLKNAAF GGRRIVSMAA AAGLLLAGAG AASATTVVKW LHLELDPKNV AAWEDIVKKY EAQHPDVDIQ MQFLENEAFK AKLPTLLQSD DVPDFFFSWG GGVLKQQSET GALQDVTAAL DADGGKLRSA YTPASVDGLT FEGKTWAIPY KVGLVSFFYN KALFAKAGVK AEDIKTWSDF LATVKKIKAA GIVPIAGGGG EKWPIHFYWS YLVMREGGQK VFEAAKNGEG EGFLDPTIIK AGDDLAELGK LEPFQPGYLG ATWPQTLGVF GDGKAAMILG FEATEANQRK NAGDGKGLSS DNIGRFVFPT VEGGAGKPTD TLGGLNGWAV TKKASKEALN FLAFLTSAEN ERAMAKSGML LPVAVGADDG VVNPLLAESA KQLAGSTWHQ NFFDQDLGAA VGRVVNDVSV EIVSGQMNSK DGAQMIQDAF ELEQ
|
| |