Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_5971 |
Symbol | |
ID | 6977357 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011366 |
Strand | + |
Start bp | 387086 |
End bp | 388993 |
Gene Length | 1908 bp |
Protein Length | 635 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643393423 |
Product | extracellular solute-binding protein family 5 |
Protein accession | YP_002278241 |
Protein GI | 209546351 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.989436 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.170626 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTGACGC GTCGCACCTT TCTGGGCGGC CTTGTCGGCG CGGCAATCGC CCCCGCAGTG CTTCGGGCTG AGCCGGCCGG CGAGCCTGAG TTTCTCAAGG AGCGGCTGGC ATCGGGCAGC CTGCCCCCGA TGGCCGAGCG CATTCCCGCC CGCCCGCGCA TCGTCAACCT GAAGGAGATG GGGCTTGCAC CCGGCAGCTA CGGCGGCACG GTGCGCACCA TCATCGGCAG CCAGCGCGAC ATCCGCTTCA TGACGATCTA CGGCTATGCC CGGCTGATCG GCTACAACAA GCACCTGCAG TTCCAGCCGG ATATCCTCGC CGATTTCCAA TCTGAAGACG ATACGATCTT CACCTTCACG CTGCGCGAGG GCCATAAGTG GTCGGACGGA GAGCCGTTCA CGGCCGACGA CTTCCGCTAC TGGTGGGAAG ACGTCATCCT GAACGACAAG CTGACGCCAG GCGGCGGCGC GCTGGAGCTT CGCCCGCACG GCAGCCTGCC GCGCTTCGAG GTGCTCAATC CGCTGACGGT GCGCTACACC TGGGAAAAAC CCAACCCGAT GTTCCTGCCG ACGCTGGCAG GGCCGATCCC GCTCGTCATC GTCGGGCCGG CGCATTATCT CAAGCAGTTC CATAAGAAGT TCCAGCCCGA CCAGGCGAAG ATGGAACAGA TGATGCAGAC CAACCGCGTC AAGAAATGGC AGGACCTGCA CATCAAGATG GCCCGCTCCT ACCGGCCGGA GAATCCCAAC TTGCCGACGC TCGATCCCTG GCGCAACACG ACGGCGCTGC CGGCCGAGCA GTTCGTCTTC GAGCGCAATC CGTTCTTCCA CCGCGTCGAC GAGACCGGCA GGCAGCTTCC CTATCTCGAC CGGTTCATTC TCAACGTCTC CTCCTCGTCG ATCATCGCCG CCAAGGCGGG TGCCGGCGAG GCCGACCTGC AGGCGACCGG CATCGACTTC AACGACTACA CCTTTCTGAA AGAAGCTGAG AAGCGCTTTC CGGTGAAGGT CAATCTCTGG AAGGTGGCGC GCGGCTCGCG CATCACGCTG CTGCCGAACC TCAACTGCGC CGACGAGGTA TGGCGCGGCC TTTTCCGCGA CGTGCGTCTG CGCCGCGCCC TGTCGCTGGC AATCGACCGG CACGAGATCA ACATGGTCGC CTTCTACGGC TTGGGCACGC CGAGCGCCGA TACCGTCCTG CCCGACAGCC CGCTGTTCAA GCAGGAATAT GCCGATGCCT TCGTGAAGTT CGATGCCGAC GAGGCCAATC GGCTGCTCGA CGAGATCGGC CTGACCAAGC GCGGCGATGA CGGCATAAGG CTGCTGCCGG ACGGGCGACG CGCCGAGATC ACCGTCGAAA CCGCCGGCGA GAGCAATCTC GATACCGACG TGCTGGAACT GGTGCACGAT CACTGGGCCA ATATCGGTCT TGCGCTTTAT ACCCGCACCT CGCAGCGCGA CGTCTTCCGC AACCGCGCCA TGAGCGGTTC GATCATGATG TCGATCTGGT ACGGCCTCGA CAATGGTGTG CCTACGGCCG ACATGTCGCC ATCGGGGCTG GCGCCGACGC TCGACGATCA GCTGCAATGG CCGCTCTGGG GCATGCATTA CCTCTCCGCC GGCCAGGAGG GCGCAGCCCC CGACCTGCCA GAGGCAGCCG AACTGGTCGA CCTGCTCGGC CAGTGGGGCT CAACGGCGAA ATTCGAGGAG CGCCAGGTGA TCTGGCACAA GATGCTGTCG CTCTATACGC AGCAGGTGTT CTCGATCGGG CTGATCAACA GCACATTGCA GCCGGTCCTT TGCGCCGCCA AGCTGCAGAA CCTGCCGGAG AAAGCCCTCT ACGGCTTCGA TCCCACCTCC TATCTCGGCA TCTACATGCC GGATGCATTC TGGTACAAGG AGGCCTGA
|
Protein sequence | MVTRRTFLGG LVGAAIAPAV LRAEPAGEPE FLKERLASGS LPPMAERIPA RPRIVNLKEM GLAPGSYGGT VRTIIGSQRD IRFMTIYGYA RLIGYNKHLQ FQPDILADFQ SEDDTIFTFT LREGHKWSDG EPFTADDFRY WWEDVILNDK LTPGGGALEL RPHGSLPRFE VLNPLTVRYT WEKPNPMFLP TLAGPIPLVI VGPAHYLKQF HKKFQPDQAK MEQMMQTNRV KKWQDLHIKM ARSYRPENPN LPTLDPWRNT TALPAEQFVF ERNPFFHRVD ETGRQLPYLD RFILNVSSSS IIAAKAGAGE ADLQATGIDF NDYTFLKEAE KRFPVKVNLW KVARGSRITL LPNLNCADEV WRGLFRDVRL RRALSLAIDR HEINMVAFYG LGTPSADTVL PDSPLFKQEY ADAFVKFDAD EANRLLDEIG LTKRGDDGIR LLPDGRRAEI TVETAGESNL DTDVLELVHD HWANIGLALY TRTSQRDVFR NRAMSGSIMM SIWYGLDNGV PTADMSPSGL APTLDDQLQW PLWGMHYLSA GQEGAAPDLP EAAELVDLLG QWGSTAKFEE RQVIWHKMLS LYTQQVFSIG LINSTLQPVL CAAKLQNLPE KALYGFDPTS YLGIYMPDAF WYKEA
|
| |