Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_2646 |
Symbol | |
ID | 6981389 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 2692874 |
End bp | 2694133 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643397358 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002282143 |
Protein GI | 209550226 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.0628324 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAACA CTGCTCTGAA AAGCTTCCTG CTTGCCTCCA GCTTGCTGAC ATCGGCAGGT CTCGTCCATG CCGCCGACGT CACGCTGACT GTGGAAAGCT GGCGTAACGA CGACCTGCAG ATCTGGCAGG AGAAGATCAT CCCGGCTTTC GAAGCCAAGA ACCCGGGCAT CAAGATCGTC TTCTCGCCGA CCGCGCCGAC CGAATACAAC GCGTCGCTGA ACGCCAAGCT GGATGCCGGT TCCGCAGGCG ATATCATCAC CTGCCGTCCG TTCGACGCCT CGCTCGAACT CTTCAACAAG AAGCAACTCG TCGACATCAC CAGCCTGCCC GGTATGGAGA ACTTCTCGCC GGTCGCCAAG GCCGCCTGGT CGACCGACGA CGGCAAGTCG ACCTTCTGCG TGCCGATGGC TTCGGTCATC CACGGTTTCA TCTACAACAA GGATGCCTTC GACAAGCTCG GCATCTCCGT GCCGAAGACG CAGGACGAAT TCTACGCGGC GCTCGACAAG ATCAAGGCCG ACGGCACCTA TATCCCGCTC GCCATGGGCA CGAAGGACCT CTGGGAAGCC GCAACCATGG GCTACCAGAA CATCGGCCCG AATTACTGGA AGGGCGAGGA CGGCCGCGAC GCCCTGATCG CCGGCAAGCA GAAGCTGACC GATGCCGACT GGGTCAAGCC CTATGAAGAG CTTGCCAAGT GGAAGCCCTA TCTCGGCGAC GGTTTCGAAG CCCAGACCTA TTCGGACAGT CAAAACCTCT TCACCCTCGG ACGCGCCGCC ATCTATCCGG CCGGTTCCTG GGAAATCGCG CTTTTCAACA CGCAGGCGCA GTTCAAGATG GGCGCCTTCC CGCCGCCGGT TCCGAAGGCC GGCGACCAGG GCTACATCTC CGACCATCCG GATATCGGCG TCGCCCTGAA TGCCAAGAGC AAGCATGCCG AGGAAGCCAA GAAATTCCTC AGCTGGGTCG CTTCGCCCGA GTTCGCCGAC ATTTACGCCA ACTCCCTGCC GGGCTTCTTC AGCCTGAACT CCAACCCCGT CAAGATGTCC GATCCGCTTG CTCAGGAATT CGTTTCCTGG CGCGGCCCGT ACAAGTCGAC CGTGCGCTCG ACCTACCAGA TCCTGTCGCG CGGCACGCCG AACCTCGAAA ACGAGACCTG GGTCGAATCG GCCAACGTGA TCAACGGCAC GGATACGCCG AAGGTCGCTG CCGAGAAGCT GCAGAAGGGC CTCGACAGCT GGTACAAGCC GGCCAAGTGA
|
Protein sequence | MKNTALKSFL LASSLLTSAG LVHAADVTLT VESWRNDDLQ IWQEKIIPAF EAKNPGIKIV FSPTAPTEYN ASLNAKLDAG SAGDIITCRP FDASLELFNK KQLVDITSLP GMENFSPVAK AAWSTDDGKS TFCVPMASVI HGFIYNKDAF DKLGISVPKT QDEFYAALDK IKADGTYIPL AMGTKDLWEA ATMGYQNIGP NYWKGEDGRD ALIAGKQKLT DADWVKPYEE LAKWKPYLGD GFEAQTYSDS QNLFTLGRAA IYPAGSWEIA LFNTQAQFKM GAFPPPVPKA GDQGYISDHP DIGVALNAKS KHAEEAKKFL SWVASPEFAD IYANSLPGFF SLNSNPVKMS DPLAQEFVSW RGPYKSTVRS TYQILSRGTP NLENETWVES ANVINGTDTP KVAAEKLQKG LDSWYKPAK
|
| |