Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_7124 |
Symbol | |
ID | 8022491 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012858 |
Strand | + |
Start bp | 535610 |
End bp | 537517 |
Gene Length | 1908 bp |
Protein Length | 635 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644833960 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002985094 |
Protein GI | 241667010 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.104115 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.2013 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGACGC GTCGCGTCTT TCTCGGCGGC CTCGTCGGTG CTGCGATCGC GCCCGCGGTG CTTCGCGCCG GACAGGCCAG CGAGCCGGAA TTCCTCAAGG AGCGGCTGAC ATCAGGCAGC CTGCCGCCGA TGGCCGAGCG CATTCCCGCC CGCCCGCGTA TCGTCAATCT CAAGGAGATG GGGCTCGAAC CCGGTGCCTA CGGCGGCACG GTGCGCACCA TCATCGGCAG CCAGCGCGAC ATCCGCTTCA TGACGATCTA CGGCTATGCC CGCCTGGTCG GCTACAACAA GCACCTGCAG TTCCAGCCGG ACATCCTGGC TTCCTTCCAG TCCGAGGACG ACACGGTCTT CACCTTCACG CTGCGCGAGG GCCATAAATG GTCCGACGGC CAGCCGTTCA CGGCCGACGA TTTCCGCTAC TGGTGGGAAG ACGTCATCCT GAACGACAAG CTGACGCCCG GCGGCGGCGC GCTGGAGCTT CGTCCGCACG GCAGCCTGCC GCGCTTCGAA ATGCTCGATC CGCTGACCGT GCGCTACACC TGGGAAAAAC CCAACCCGAT GTTCCTGCCG ACGCTGGCCG GCCCGCAGCC GCTCGTCATC TTCGGCCCCG GTCATTATCT CAAGCAGTTC CACAAGAAAT TCCAGCCCGA CCAGGCGAAG ATGGACGAGA TGATGAAGAC CTACCGCGTC AAGAAGTGGC AGGATCTGCA CATCAAGATG GCGCGTTCCT ACCGTCCGGA AAATCCGAAC CTGCCGACGC TCGATCCCTG GCGCAATACG ACGCCGCTGC CGTCCGAGCA GTTCGTCTTC GAGCGTAACC CGTTCTTCCA CCGCGTCGAC GAGACCGGCA GGCAGCTTCC CTATCTCGAC CGTTTCATCC TCAACGTCTC CTCCTCGTCG ATCATCGCCG CCAAGGCCGG TGCGGGCGAA GCCGACCTGC AGGTAACCGG CATCGATTTC AACGACTATA CCTTCCTGAA GGAGGCCGAG AAGCGCTTCC CGGTGAAGGT CAATCTCTGG AAGCTCGCGC GCGGCTCGCG CATCACGCTG CTGCCGAACC TCAACTGCGC CGACGAGGTA TGGCGCGGCC TCTTCCGCGA CGTGCGCCTG CGCCGCGCTC TGTCGCTGGC GATCGACCGG CACGAGGTCA ACATGGTCGC CTTTTACGGC CTCGGCACGC CAAGCGCCGA TACCGTCCTG CCCGACAGCC CGCTCTTCAA GCAGGAATAT GCCGACGCCT TCGTGAAGTT CGATCCCGAC GAGGCCAACC GTCTGCTCGA CGAGCTCGGC TTGACCAAAC GCGGCGACGA CGGCATGCGG CTGCTGCCCG ACGGGCGGCG CGCCGAGATC ACCGTCGAGA CCGCCGGCGA GAGCAATCTC GACACCGACG TGCTGGAGCT GGTGCACGAC CACTGGGCCA ATATCGGCCT GGCGCTCTAT ACCCGGACCT CGCAGCGCGA CGTCTTCCGC AACCGCGCCA TGAGCGGTTC GATCATGATG TCGATCTGGT ACGGCCTCGA CAATGGCGTG CCGACGGCCG ACATGTCGCC GGCAGGCCTT GCGCCGACGC TCGACGATCA GCTGCAATGG CCGCTCTGGG GCATGCATTA CCTTTCCGCC GGCCAGGAGG GCGTCGCACC GGATCTGCCG GAGGCAGCCG AACTCGTCGA CCTGCTCAGC CAATGGGGCT CGACGGCGAA ATTCGAGGAG CGCCAGCTGA TCTGGCACAA GATGCTGGCG CTCTATACGC AGCAGGTGTT CTCGATCGGC CTCATCAACA GCACGCTGCA GCCCGTTCTT TGCGCCGCCA AACTGCAGAA CCTGCCGGAG AAAGCGCTCT ACGGCTTCGA TCCCACCTCC TATCTCGGCG TCTACATGCC GGATGCATTC TGGTACAAGG AGGCCTGA
|
Protein sequence | MVTRRVFLGG LVGAAIAPAV LRAGQASEPE FLKERLTSGS LPPMAERIPA RPRIVNLKEM GLEPGAYGGT VRTIIGSQRD IRFMTIYGYA RLVGYNKHLQ FQPDILASFQ SEDDTVFTFT LREGHKWSDG QPFTADDFRY WWEDVILNDK LTPGGGALEL RPHGSLPRFE MLDPLTVRYT WEKPNPMFLP TLAGPQPLVI FGPGHYLKQF HKKFQPDQAK MDEMMKTYRV KKWQDLHIKM ARSYRPENPN LPTLDPWRNT TPLPSEQFVF ERNPFFHRVD ETGRQLPYLD RFILNVSSSS IIAAKAGAGE ADLQVTGIDF NDYTFLKEAE KRFPVKVNLW KLARGSRITL LPNLNCADEV WRGLFRDVRL RRALSLAIDR HEVNMVAFYG LGTPSADTVL PDSPLFKQEY ADAFVKFDPD EANRLLDELG LTKRGDDGMR LLPDGRRAEI TVETAGESNL DTDVLELVHD HWANIGLALY TRTSQRDVFR NRAMSGSIMM SIWYGLDNGV PTADMSPAGL APTLDDQLQW PLWGMHYLSA GQEGVAPDLP EAAELVDLLS QWGSTAKFEE RQLIWHKMLA LYTQQVFSIG LINSTLQPVL CAAKLQNLPE KALYGFDPTS YLGVYMPDAF WYKEA
|
| |