Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_5349 |
Symbol | |
ID | 6978443 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | + |
Start bp | 974497 |
End bp | 975717 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643394451 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002279269 |
Protein GI | 209547351 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.535832 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGGTTC TGAAGAAAGC GCTTTTGCTC GCCACAATCG GCGGCAGTCT TTTTGCCACA GCCGCATCGG CCGAACAGGT CAATCTGACC TGGCAGATGT GGACCGGTTC CGATGCCGAC ACCAAGGGCT GGCAGCACCT GGCCGACATG GTCACCGCCA AGTACCCCGA TATCAAGGTG GCGCTGACGA CGACCGGCTG GGTCGACTAC TGGACGAGGC TGCCGGTGCT GGCGGCGTCG GGCCAGCTTG CCGACATCGT TTCCATGCAG TCGCTGCGCA TGCCGAATTT CTATTCGCTG CTCGAGCCCC TGAATGACCG GATCGCCGCC GACAAATTCG ATGTCGGCGC TTTCACCCCC TCGATCATCG GCGGCATGTC CGTTGACAAG CAGCTCTATG GCCTGCCTTA CGACGTCGGT CCGTGGGTCA TCTATTATAA CCAGGACGCG CTCGAAGCCG CCGGCGTTCC GCTGCCGAAG CCGGGCTGGA CGCTTGCCGA ATTCACCGAT GCTGCCAAGA AGCTGACCAA GGACGGCAAA TACGGCTTCG GCATCACCCC GCAGAACTAT TCGGTCCTGG CGGCTGCCTG GGGCGATAAA TATGTCAATG ATGCAGGAGA GCTCGACCTC ACCAATCCGA GCGCCATAGC TGCGGCAGAG AGGGTGATCG GCTTTGCCGC CAAGGATAAG GTAGCGCCGC TGGTGCCGTC GAGCGCCGAT GCCGGCACGG TCATCCAAGG CCGGTTCTAT TCGGGCAATG TCGCCATGTA TGTCGACGGT CCATGGTCGA TCATCGGCAT GAAGGACAAG GTCAAGTTCA AGATTGGTTC CACTTCCCTG CCGCGCGGGG ACAGTGAGCT TACCGCCGTC ACGGCGGGCT CGGGTTTCGG CATCGCCACG ACGAGCAAGA ACAAGGATGC GGCCTGGAAG GCGATCCAGG TGCTGACCAG CCCGGAGGCG CTGCAGTATC TCGCCGAACA GGGACGTGCG CTGCCGGCGC GCACGGCGTC GCAATCCTCC TGGTACAAAG TCGCAGCCAA GGACATCACC AATGGCGGCG AGGCTATCGA CTATTCTCTG GCGCATTCCG TGCCCTACGT GATCACCAAC AACTGGGCGG CGGTCGAAAA CCTGTTCAAC CAGTATTTCC CGCCGGCCTT CGGCGGCAGC GCCGACGCCA AGCAGACGAT GGAGTCGATC CAGAGCCTCG CGCAGCAATA A
|
Protein sequence | MKVLKKALLL ATIGGSLFAT AASAEQVNLT WQMWTGSDAD TKGWQHLADM VTAKYPDIKV ALTTTGWVDY WTRLPVLAAS GQLADIVSMQ SLRMPNFYSL LEPLNDRIAA DKFDVGAFTP SIIGGMSVDK QLYGLPYDVG PWVIYYNQDA LEAAGVPLPK PGWTLAEFTD AAKKLTKDGK YGFGITPQNY SVLAAAWGDK YVNDAGELDL TNPSAIAAAE RVIGFAAKDK VAPLVPSSAD AGTVIQGRFY SGNVAMYVDG PWSIIGMKDK VKFKIGSTSL PRGDSELTAV TAGSGFGIAT TSKNKDAAWK AIQVLTSPEA LQYLAEQGRA LPARTASQSS WYKVAAKDIT NGGEAIDYSL AHSVPYVITN NWAAVENLFN QYFPPAFGGS ADAKQTMESI QSLAQQ
|
| |