Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_2098 |
Symbol | |
ID | 6980837 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 2159437 |
End bp | 2160672 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 643396820 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002281608 |
Protein GI | 209549691 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.072788 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATTC GTATTATGGC GGCCCTGCTT GCCGCTTCGG TCGCGCTTCC GTTCGGCGCC GCCAATGCTA CCGATCTCGA AGTCACGCAT TGGTGGACTT CAGGCGGCGA ATCGGCTGCT GTCGCCGAGT TGGCAAAGGC ATTCGACGCC ACCGGCAACC ACTGGGTCGA CGGCGCGATC GCCGGTTCCG GCGGCACCGC TCGTCCGATC ATGATCAGCC GCATCACCGG CGGCGACCCG ATGGGCGCCA CCCAGTTCAA TCACGGCCGC CAGGCTGAGG AGCTCGTTCA GGCTGGCCTG ATGCGCGACC TGACCGACGT GGCGACTGCC GAGCACTGGA AGGACATCAT CCGCCCGGCG AGCCTGCTCG ATTCCTGCAC GATCGACGGC AAGATCTATT GCGCTCCAGT CAACATCCAC TCCTGGCAGT GGCTGTGGCT GTCGAATGCC GCCTTCAAGA AGGCCGGCGT CGAGGTTCCG AAGAACTGGG ACGAGTTCGT CGCTGCCGCT CCGGCTCTCG AAAAGGCCGG CATCATTCCG CTCGCCGTCG GCGGTCAGCC GTGGCAGGCG ACAGGCGCCT TCGACGTGCT GATGGTCGCG GTTGCCGGCA AGGATACCTT CAACAAGGTT TTCAAGGACA AGGATGCGGA AGTTGCCGCC GGTCCTGAAA TCGCCAAGGT GTTCAAGGCC GCGGACGATG CGCGGCGCAT GGCCAAAGGC AGCAACGTAC AGGACTGGAA CCAGGCCACC AACCTCGTCA TCACAGGCAA GGCCGGCGGT CAGATCATGG GCGACTGGGC GCAGGGTGAA TTCGCGCTCG CCGGTCAGAA GGCCGGCACC GACTACACCT GCCTGCCGGG CCTCGGCGTG AATGAGATCA TCTCGACCGG CGGCGATGCC TTCTACTTCC CGCTGCTGAA GGACGAGGAA AAATCCAAGG CGCAGGCCGT GCTTGCCAAG ACCCTGCTCG ATCCCAAGAC CCAGGTTGCC TTCAACCTGA AAAAGGGCTC GCTGCCGGTT CGCGGCGATG TCGATCTCGC CGCCGCCAAC GACTGCATGA AGAAGGGTCT CGACATCCTC GCCAAGGGCA ACGTGATCCA GGGTACCGAC CAGTTGCTTT CGGCCGACAG CCAGAAGCAA AAGGAAGACC TCTTCTCCGA ATTTTTCGCC AACCCGTCGA TGACGCCGGA GGACGCTCAG AAGCGTTTCG CCAAAATCAT CGCTTCGGCT GACTGA
|
Protein sequence | MKIRIMAALL AASVALPFGA ANATDLEVTH WWTSGGESAA VAELAKAFDA TGNHWVDGAI AGSGGTARPI MISRITGGDP MGATQFNHGR QAEELVQAGL MRDLTDVATA EHWKDIIRPA SLLDSCTIDG KIYCAPVNIH SWQWLWLSNA AFKKAGVEVP KNWDEFVAAA PALEKAGIIP LAVGGQPWQA TGAFDVLMVA VAGKDTFNKV FKDKDAEVAA GPEIAKVFKA ADDARRMAKG SNVQDWNQAT NLVITGKAGG QIMGDWAQGE FALAGQKAGT DYTCLPGLGV NEIISTGGDA FYFPLLKDEE KSKAQAVLAK TLLDPKTQVA FNLKKGSLPV RGDVDLAAAN DCMKKGLDIL AKGNVIQGTD QLLSADSQKQ KEDLFSEFFA NPSMTPEDAQ KRFAKIIASA D
|
| |