Gene Rleg_7124 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_7124 
Symbol 
ID8022491 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012858 
Strand
Start bp535610 
End bp537517 
Gene Length1908 bp 
Protein Length635 aa 
Translation table11 
GC content64% 
IMG OID644833960 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002985094 
Protein GI241667010 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.104115 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.2013 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGACGC GTCGCGTCTT TCTCGGCGGC CTCGTCGGTG CTGCGATCGC GCCCGCGGTG 
CTTCGCGCCG GACAGGCCAG CGAGCCGGAA TTCCTCAAGG AGCGGCTGAC ATCAGGCAGC
CTGCCGCCGA TGGCCGAGCG CATTCCCGCC CGCCCGCGTA TCGTCAATCT CAAGGAGATG
GGGCTCGAAC CCGGTGCCTA CGGCGGCACG GTGCGCACCA TCATCGGCAG CCAGCGCGAC
ATCCGCTTCA TGACGATCTA CGGCTATGCC CGCCTGGTCG GCTACAACAA GCACCTGCAG
TTCCAGCCGG ACATCCTGGC TTCCTTCCAG TCCGAGGACG ACACGGTCTT CACCTTCACG
CTGCGCGAGG GCCATAAATG GTCCGACGGC CAGCCGTTCA CGGCCGACGA TTTCCGCTAC
TGGTGGGAAG ACGTCATCCT GAACGACAAG CTGACGCCCG GCGGCGGCGC GCTGGAGCTT
CGTCCGCACG GCAGCCTGCC GCGCTTCGAA ATGCTCGATC CGCTGACCGT GCGCTACACC
TGGGAAAAAC CCAACCCGAT GTTCCTGCCG ACGCTGGCCG GCCCGCAGCC GCTCGTCATC
TTCGGCCCCG GTCATTATCT CAAGCAGTTC CACAAGAAAT TCCAGCCCGA CCAGGCGAAG
ATGGACGAGA TGATGAAGAC CTACCGCGTC AAGAAGTGGC AGGATCTGCA CATCAAGATG
GCGCGTTCCT ACCGTCCGGA AAATCCGAAC CTGCCGACGC TCGATCCCTG GCGCAATACG
ACGCCGCTGC CGTCCGAGCA GTTCGTCTTC GAGCGTAACC CGTTCTTCCA CCGCGTCGAC
GAGACCGGCA GGCAGCTTCC CTATCTCGAC CGTTTCATCC TCAACGTCTC CTCCTCGTCG
ATCATCGCCG CCAAGGCCGG TGCGGGCGAA GCCGACCTGC AGGTAACCGG CATCGATTTC
AACGACTATA CCTTCCTGAA GGAGGCCGAG AAGCGCTTCC CGGTGAAGGT CAATCTCTGG
AAGCTCGCGC GCGGCTCGCG CATCACGCTG CTGCCGAACC TCAACTGCGC CGACGAGGTA
TGGCGCGGCC TCTTCCGCGA CGTGCGCCTG CGCCGCGCTC TGTCGCTGGC GATCGACCGG
CACGAGGTCA ACATGGTCGC CTTTTACGGC CTCGGCACGC CAAGCGCCGA TACCGTCCTG
CCCGACAGCC CGCTCTTCAA GCAGGAATAT GCCGACGCCT TCGTGAAGTT CGATCCCGAC
GAGGCCAACC GTCTGCTCGA CGAGCTCGGC TTGACCAAAC GCGGCGACGA CGGCATGCGG
CTGCTGCCCG ACGGGCGGCG CGCCGAGATC ACCGTCGAGA CCGCCGGCGA GAGCAATCTC
GACACCGACG TGCTGGAGCT GGTGCACGAC CACTGGGCCA ATATCGGCCT GGCGCTCTAT
ACCCGGACCT CGCAGCGCGA CGTCTTCCGC AACCGCGCCA TGAGCGGTTC GATCATGATG
TCGATCTGGT ACGGCCTCGA CAATGGCGTG CCGACGGCCG ACATGTCGCC GGCAGGCCTT
GCGCCGACGC TCGACGATCA GCTGCAATGG CCGCTCTGGG GCATGCATTA CCTTTCCGCC
GGCCAGGAGG GCGTCGCACC GGATCTGCCG GAGGCAGCCG AACTCGTCGA CCTGCTCAGC
CAATGGGGCT CGACGGCGAA ATTCGAGGAG CGCCAGCTGA TCTGGCACAA GATGCTGGCG
CTCTATACGC AGCAGGTGTT CTCGATCGGC CTCATCAACA GCACGCTGCA GCCCGTTCTT
TGCGCCGCCA AACTGCAGAA CCTGCCGGAG AAAGCGCTCT ACGGCTTCGA TCCCACCTCC
TATCTCGGCG TCTACATGCC GGATGCATTC TGGTACAAGG AGGCCTGA
 
Protein sequence
MVTRRVFLGG LVGAAIAPAV LRAGQASEPE FLKERLTSGS LPPMAERIPA RPRIVNLKEM 
GLEPGAYGGT VRTIIGSQRD IRFMTIYGYA RLVGYNKHLQ FQPDILASFQ SEDDTVFTFT
LREGHKWSDG QPFTADDFRY WWEDVILNDK LTPGGGALEL RPHGSLPRFE MLDPLTVRYT
WEKPNPMFLP TLAGPQPLVI FGPGHYLKQF HKKFQPDQAK MDEMMKTYRV KKWQDLHIKM
ARSYRPENPN LPTLDPWRNT TPLPSEQFVF ERNPFFHRVD ETGRQLPYLD RFILNVSSSS
IIAAKAGAGE ADLQVTGIDF NDYTFLKEAE KRFPVKVNLW KLARGSRITL LPNLNCADEV
WRGLFRDVRL RRALSLAIDR HEVNMVAFYG LGTPSADTVL PDSPLFKQEY ADAFVKFDPD
EANRLLDELG LTKRGDDGMR LLPDGRRAEI TVETAGESNL DTDVLELVHD HWANIGLALY
TRTSQRDVFR NRAMSGSIMM SIWYGLDNGV PTADMSPAGL APTLDDQLQW PLWGMHYLSA
GQEGVAPDLP EAAELVDLLS QWGSTAKFEE RQLIWHKMLA LYTQQVFSIG LINSTLQPVL
CAAKLQNLPE KALYGFDPTS YLGVYMPDAF WYKEA