Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_1218 |
Symbol | |
ID | 6979939 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 1229401 |
End bp | 1230735 |
Gene Length | 1335 bp |
Protein Length | 444 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 643395932 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002280738 |
Protein GI | 209548821 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACAGAA CCGTGAAATC GATCAGCATA CTGCCGGCAT ACCGATTGAA AGCAGCTGTC GCGCTTGCCG GCGCGGCGCT TTTGAGTTTC GGCCTTTCCG CCGCCCGGGC CGACGATCTG GCCGTGTGGG ATGACCAGAC CTTCGAAGGC CAGAGCGCGG TCATCGAGCA ACTGAATAAG GATTTCGAAG CCGCGCATCC CGGTGTCACG ATCAAACGCA CCGCGCGCAC TTTCGATGAC ATGAAGCTGA CGCTGAAGCT TGCAGTTTCG GCAGGTGATG GCCCCGTCAT CACCAAGGTC AACCAGGGCG CCGGCGACAT GGGCGCGATG GTCAAGGAAG GCTTGCTCCT GCCGGTCGAC GAATACATCA AGAAATATGG TTGGGATAAG CGGCAGTCGG ATTCCGTGCT GGCCAGAGAC CGCTGGGAGG GCGCGAAATT CGGGGTCGGC AAGACCTACG GCATATCGGG TCTCGGCGAG ATCGTCGGCC TCTACTACAA TAAGAAGATC CTCGACGACG CGGGCGTGGC GCTGCCGCAG ACCTTCGAGG AACTGTTGGC CGATCTCGAC AAGCTGAAGG AAAAAGGCGT TGCGCCCTTC ATGATGGGCT CCGCCAAGCA GCATCTTGCC CTGCATATGA TCGGCGCTAT CGATCAAGCG CATATCGACG CGGCCAATCG CGCCGAGCTT GACGACCTGA TCTACGGCAA GGGCGGTTCC TGGAACACCA AAGGCAACAT CGAATCAGCC AAACTCGTGC AGAAATGGGC ACAGGGCGGC TATTTCTACC CCGGTTTCGA GGGCATCTCG GGTGACGACG CCGTCCAGCT GTTCATATCA GGGCAGGGCG CATTTCTGAT CTCCGGAACC TGGTACTTTG GCGACATGCA AAACAATCCG GATATCGGCT TCATGGCCAT TCCCGCGCCG AAGGGTGTCG CCAAACCCAT GAGCGTCGGC GGTGTGGATC TTGCCTGGGC GATAACCAGC CTTGCCAAAG ACAAGGCGAA GCAGGATCTG GCCGGCGAGT ACATCGACTA TATGGTGTCC GAAAAGGCCG CTGAAAGCTG GGCCGCTGCA GGCTATCTTC CTGCAACGTC GCTCCCGGCG GATGCAAAGC CCAAGCTGAC GCCGCTCCTG ACTTCCGGCA TCGAGATGTG GAAGACACTC AACGCCAACG ATGCGCTCGG CCATTACCCC GATTGGTCGA GCCCGACGAT GCTGAAGACA ATCGACGACA ACACGCCACT TCTCCTGTCC GGCAAGATCA CGCCCGAAGC CTTTGTCGAT GCCATGGACA AGGATTATCA GGCCTATTTG AAGGATCAGA AATAA
|
Protein sequence | MNRTVKSISI LPAYRLKAAV ALAGAALLSF GLSAARADDL AVWDDQTFEG QSAVIEQLNK DFEAAHPGVT IKRTARTFDD MKLTLKLAVS AGDGPVITKV NQGAGDMGAM VKEGLLLPVD EYIKKYGWDK RQSDSVLARD RWEGAKFGVG KTYGISGLGE IVGLYYNKKI LDDAGVALPQ TFEELLADLD KLKEKGVAPF MMGSAKQHLA LHMIGAIDQA HIDAANRAEL DDLIYGKGGS WNTKGNIESA KLVQKWAQGG YFYPGFEGIS GDDAVQLFIS GQGAFLISGT WYFGDMQNNP DIGFMAIPAP KGVAKPMSVG GVDLAWAITS LAKDKAKQDL AGEYIDYMVS EKAAESWAAA GYLPATSLPA DAKPKLTPLL TSGIEMWKTL NANDALGHYP DWSSPTMLKT IDDNTPLLLS GKITPEAFVD AMDKDYQAYL KDQK
|
| |