Gene Rleg2_5971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5971 
Symbol 
ID6977357 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011366 
Strand
Start bp387086 
End bp388993 
Gene Length1908 bp 
Protein Length635 aa 
Translation table11 
GC content64% 
IMG OID643393423 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002278241 
Protein GI209546351 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.989436 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.170626 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGACGC GTCGCACCTT TCTGGGCGGC CTTGTCGGCG CGGCAATCGC CCCCGCAGTG 
CTTCGGGCTG AGCCGGCCGG CGAGCCTGAG TTTCTCAAGG AGCGGCTGGC ATCGGGCAGC
CTGCCCCCGA TGGCCGAGCG CATTCCCGCC CGCCCGCGCA TCGTCAACCT GAAGGAGATG
GGGCTTGCAC CCGGCAGCTA CGGCGGCACG GTGCGCACCA TCATCGGCAG CCAGCGCGAC
ATCCGCTTCA TGACGATCTA CGGCTATGCC CGGCTGATCG GCTACAACAA GCACCTGCAG
TTCCAGCCGG ATATCCTCGC CGATTTCCAA TCTGAAGACG ATACGATCTT CACCTTCACG
CTGCGCGAGG GCCATAAGTG GTCGGACGGA GAGCCGTTCA CGGCCGACGA CTTCCGCTAC
TGGTGGGAAG ACGTCATCCT GAACGACAAG CTGACGCCAG GCGGCGGCGC GCTGGAGCTT
CGCCCGCACG GCAGCCTGCC GCGCTTCGAG GTGCTCAATC CGCTGACGGT GCGCTACACC
TGGGAAAAAC CCAACCCGAT GTTCCTGCCG ACGCTGGCAG GGCCGATCCC GCTCGTCATC
GTCGGGCCGG CGCATTATCT CAAGCAGTTC CATAAGAAGT TCCAGCCCGA CCAGGCGAAG
ATGGAACAGA TGATGCAGAC CAACCGCGTC AAGAAATGGC AGGACCTGCA CATCAAGATG
GCCCGCTCCT ACCGGCCGGA GAATCCCAAC TTGCCGACGC TCGATCCCTG GCGCAACACG
ACGGCGCTGC CGGCCGAGCA GTTCGTCTTC GAGCGCAATC CGTTCTTCCA CCGCGTCGAC
GAGACCGGCA GGCAGCTTCC CTATCTCGAC CGGTTCATTC TCAACGTCTC CTCCTCGTCG
ATCATCGCCG CCAAGGCGGG TGCCGGCGAG GCCGACCTGC AGGCGACCGG CATCGACTTC
AACGACTACA CCTTTCTGAA AGAAGCTGAG AAGCGCTTTC CGGTGAAGGT CAATCTCTGG
AAGGTGGCGC GCGGCTCGCG CATCACGCTG CTGCCGAACC TCAACTGCGC CGACGAGGTA
TGGCGCGGCC TTTTCCGCGA CGTGCGTCTG CGCCGCGCCC TGTCGCTGGC AATCGACCGG
CACGAGATCA ACATGGTCGC CTTCTACGGC TTGGGCACGC CGAGCGCCGA TACCGTCCTG
CCCGACAGCC CGCTGTTCAA GCAGGAATAT GCCGATGCCT TCGTGAAGTT CGATGCCGAC
GAGGCCAATC GGCTGCTCGA CGAGATCGGC CTGACCAAGC GCGGCGATGA CGGCATAAGG
CTGCTGCCGG ACGGGCGACG CGCCGAGATC ACCGTCGAAA CCGCCGGCGA GAGCAATCTC
GATACCGACG TGCTGGAACT GGTGCACGAT CACTGGGCCA ATATCGGTCT TGCGCTTTAT
ACCCGCACCT CGCAGCGCGA CGTCTTCCGC AACCGCGCCA TGAGCGGTTC GATCATGATG
TCGATCTGGT ACGGCCTCGA CAATGGTGTG CCTACGGCCG ACATGTCGCC ATCGGGGCTG
GCGCCGACGC TCGACGATCA GCTGCAATGG CCGCTCTGGG GCATGCATTA CCTCTCCGCC
GGCCAGGAGG GCGCAGCCCC CGACCTGCCA GAGGCAGCCG AACTGGTCGA CCTGCTCGGC
CAGTGGGGCT CAACGGCGAA ATTCGAGGAG CGCCAGGTGA TCTGGCACAA GATGCTGTCG
CTCTATACGC AGCAGGTGTT CTCGATCGGG CTGATCAACA GCACATTGCA GCCGGTCCTT
TGCGCCGCCA AGCTGCAGAA CCTGCCGGAG AAAGCCCTCT ACGGCTTCGA TCCCACCTCC
TATCTCGGCA TCTACATGCC GGATGCATTC TGGTACAAGG AGGCCTGA
 
Protein sequence
MVTRRTFLGG LVGAAIAPAV LRAEPAGEPE FLKERLASGS LPPMAERIPA RPRIVNLKEM 
GLAPGSYGGT VRTIIGSQRD IRFMTIYGYA RLIGYNKHLQ FQPDILADFQ SEDDTIFTFT
LREGHKWSDG EPFTADDFRY WWEDVILNDK LTPGGGALEL RPHGSLPRFE VLNPLTVRYT
WEKPNPMFLP TLAGPIPLVI VGPAHYLKQF HKKFQPDQAK MEQMMQTNRV KKWQDLHIKM
ARSYRPENPN LPTLDPWRNT TALPAEQFVF ERNPFFHRVD ETGRQLPYLD RFILNVSSSS
IIAAKAGAGE ADLQATGIDF NDYTFLKEAE KRFPVKVNLW KVARGSRITL LPNLNCADEV
WRGLFRDVRL RRALSLAIDR HEINMVAFYG LGTPSADTVL PDSPLFKQEY ADAFVKFDAD
EANRLLDEIG LTKRGDDGIR LLPDGRRAEI TVETAGESNL DTDVLELVHD HWANIGLALY
TRTSQRDVFR NRAMSGSIMM SIWYGLDNGV PTADMSPSGL APTLDDQLQW PLWGMHYLSA
GQEGAAPDLP EAAELVDLLG QWGSTAKFEE RQVIWHKMLS LYTQQVFSIG LINSTLQPVL
CAAKLQNLPE KALYGFDPTS YLGIYMPDAF WYKEA