Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_2337 |
Symbol | |
ID | 8013330 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | - |
Start bp | 2341145 |
End bp | 2342380 |
Gene Length | 1236 bp |
Protein Length | 411 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644824920 |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_002976150 |
Protein GI | 241205054 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.836128 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 31 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATCC GTATATTGGC GGCCGTACTT GCCGCTTCGG TCGCGCTTCC GTTCGGCGCC GCAAATGCCA CCGATCTCGA AGTCACGCAT TGGTGGACTT CGGGCGGCGA ATCAGCTGCC GTCGCCGAAC TGGCAAAGGC ATTCGACGCT ACCGGCAACC ACTGGGTCGA CGGTGCTATC GCCGGTTCCG GCGGCACTGC CCGTCCGATC ATGATCAGCC GCATCACCGG CGGCGACCCA ATGGGCGCCA CCCAGTTCAA CCATGGCCGC CAGGCCGAGG AACTCGTTCA GGCCGGGCTG ATGCGCGATC TGACGGATGT GGCGACTGCC GAGCACTGGA AGGACATCGT TCGTCCGTCG AGCCTGCTCG ATTCCTGCAC CATCGACGGC AAGATCTATT GCGCTCCCGT CAACATCCAC TCCTGGCAGT GGTTGTGGCT GTCGAACGCC GCCTTCAAGA AGGCCGGCGT CGAAGTTCCG AAGAACTGGG ACGAGTTCGT CGCCGCCGCT CCGGCGCTCG AAAAGGCCGG CATCATTCCG CTCGCCGTCG GCGGTCAGCC GTGGCAGGCG ACTGGCGCCT TCGACGTGCT GATGGTTGCC GTCGCCGGCA AGGATACCTT CAACAAGGTC TTCAAGGACA AGGATGCGGA AGTTGCCGCC GGTCCCGAAA TCGCCAAGGT GTTCAAGGCG GCCGACGACG CTCGGCGCAT GGCCAAAGGC AGCAACGTCC AGGATTGGAA CCAAGCCACC AACCTTGTCA TCACAGGCAA GGCCGGCGGT CAGATCATGG GCGACTGGGC GCAGGGTGAG TTCGCGCTCG CCGGTCAGAA GGCCGGTACC GACTATACCT GCCTGCCGGG CCTCGGCGTG AACGAGATCA TCTCGACTGG CGGCGACGCC TTCTACTTCC CGCTGCTGAA GGACGAGGAA AAGTCCAAGG CGCAGGCCGT GCTTGCCAAG ACCCTGCTCG ATCCCAAGAC CCAGGTTGCC TTCAACCTGA AGAAGGGTTC TCTGCCGGTT CGCGGCGACG TCGATCTCGC CGCCGCCAAC GATTGCATGA AGAAGGGTCT CGAAATCCTG GCCAAGGGCA ACGTGATCCA AGGTACCGAC CAGCTGCTTT CGGCCGACAG CCAGAAGCAG AAGGAAGACC TCTTCTCCGA ATTCTTCGCC AACCCATCGA TGACGCCGGA AGACGCTCAG AAGCGTTTCG CCGGGATCAT CGCTTCTGCT GACTGA
|
Protein sequence | MKIRILAAVL AASVALPFGA ANATDLEVTH WWTSGGESAA VAELAKAFDA TGNHWVDGAI AGSGGTARPI MISRITGGDP MGATQFNHGR QAEELVQAGL MRDLTDVATA EHWKDIVRPS SLLDSCTIDG KIYCAPVNIH SWQWLWLSNA AFKKAGVEVP KNWDEFVAAA PALEKAGIIP LAVGGQPWQA TGAFDVLMVA VAGKDTFNKV FKDKDAEVAA GPEIAKVFKA ADDARRMAKG SNVQDWNQAT NLVITGKAGG QIMGDWAQGE FALAGQKAGT DYTCLPGLGV NEIISTGGDA FYFPLLKDEE KSKAQAVLAK TLLDPKTQVA FNLKKGSLPV RGDVDLAAAN DCMKKGLEIL AKGNVIQGTD QLLSADSQKQ KEDLFSEFFA NPSMTPEDAQ KRFAGIIASA D
|
| |