Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_5744 |
Symbol | |
ID | 6977134 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011366 |
Strand | + |
Start bp | 146275 |
End bp | 147534 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643393200 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002278018 |
Protein GI | 209546128 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.593109 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAAAT TGCTTGGCGC CAGCGCGCTC GCGCTCGTTT GCATTGCGGC CTCCGCTAAG GCCGAGACCA TCAATATCCT CGTGGAAGGT GGCGGCGAAA TGCTGCAGAA GGCAGCCGCG GAGAAGTTCA CCGCCGAGAC CGGCATCAAG GTGAACTTCA CCGTCGTTCC CTATCAGGGC GTCTTTGACA AGTTTTCGGC CGAAATCGCC TCCGGTTCAT CGGCGTTCGA TGTGGTAACG ATCGACGTCG TGTGGAACGC GAAATTCGCA AGCCACGTCG AGGACCTCGC GCCGCTTTTC ACCGATACGG TTCGTAAGGA CCTGCCGCCT GCCTTGCTTG CCGACGCGAA GATCGGTGAC AAACTGATTG GCATGCCTGC TTGGGCCAAT GCCGAAATTG TTTTCTATCG CAAGGACCTC TTCGACAAAC CGGAGGAACA GAAAGCGTTC CAGGCAAAAT ATGGTTATCC CCTCGCCCCG CCGCAGACTT GGCAGCAATG GCGCGACATC GCAAAATTCT TCACCCGAGA CACAGACGGT GACGGCAAGG CCGATTTTTG GGGCACCGAC ACCATCGGCA GCTTCTCCGA AGAGTGGATG GCGCATGTGC TTCAGGCGGG CTCGCCAGGT GTGATCCTCG ACAAGGATGG GAAGGTGATC ATCGACAACG AGGCACATAA GAAGGCGCTC GAATTCTACA TCGCGCCCCA CTGCATCGAT CATTCCGTGC CCGAAAACGT CAACGAGATC GGCTGGGGCG AAGCTCAGAA CCTGTTCTAT CAAGGCAAGA CGGCGATGAT GAAATTCTGG GCGCACGCCT ACAAGATGAC GCCCACCGAT TCCAAGGTCA GCGGCAAGGT CGGCGTCGTC CCGATGCTAG CCGGCGATGC AGGGATCGCG GCCATTCCGG GTCCTTGGTA CAATGTCATT CCGTCGACCT CCCAGCACAA GGACGCGGCC AAGAAGTTCA TAGCCTTCGC GATCGCCAAC AATGCCCTGG GTATCGAAGC GCCGCTTGGA TTGGCTGCGA CGAACTCGGC CTATAACAGC TATGCCAGCA AGCCCGGTTA TGAGCATTTC ACGCCGTTGC TTGAGACGTT GCATGCTCCT GCCACGCAAG GCCGTCCGAT GAACGAAAAA TATCAGGAAA TTGTCGACGA AGCGGTTCTT CCCGCGGTGC AACAGGCCCT GACGTGCAAA GCGGATGTCG GCAAGGTCCT GACGGACGCC AAGGAAACGA TCGAAGACAT CCTCGACTAA
|
Protein sequence | MKKLLGASAL ALVCIAASAK AETINILVEG GGEMLQKAAA EKFTAETGIK VNFTVVPYQG VFDKFSAEIA SGSSAFDVVT IDVVWNAKFA SHVEDLAPLF TDTVRKDLPP ALLADAKIGD KLIGMPAWAN AEIVFYRKDL FDKPEEQKAF QAKYGYPLAP PQTWQQWRDI AKFFTRDTDG DGKADFWGTD TIGSFSEEWM AHVLQAGSPG VILDKDGKVI IDNEAHKKAL EFYIAPHCID HSVPENVNEI GWGEAQNLFY QGKTAMMKFW AHAYKMTPTD SKVSGKVGVV PMLAGDAGIA AIPGPWYNVI PSTSQHKDAA KKFIAFAIAN NALGIEAPLG LAATNSAYNS YASKPGYEHF TPLLETLHAP ATQGRPMNEK YQEIVDEAVL PAVQQALTCK ADVGKVLTDA KETIEDILD
|
| |