Gene Rleg2_5780 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5780 
Symbol 
ID6977169 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011366 
Strand
Start bp190798 
End bp192444 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content59% 
IMG OID643393235 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002278053 
Protein GI209546163 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0222518 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0795178 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAT ATCTTCTTGC CGCCGCCGCA CTGACGCTGC TTTCGGGCTC CGCCATGGCG 
CAGACGGTCC TGACGGCGAA TATCGAGCCG GCGACGACCT GGGTTCGCAA CTTCAATCCG
TTCAACCAAA CCTCGTCGCG CCAGTCGACG CTCGACTTCA TCTACGAGCC GCTGGTTATT
TTCAACCGCT TCGACAGCAA CAAGCCGGTC TATCGCCTGG CCGAAAGCTT CACGCTCTCC
GATGATCTGA AGAGCATCGA TTTCAAGCTG CGCCCGAACC TGAAATGGTC TGATGGTAAG
CCGCTGACAT CAGCCGACGT CAAGTTCACC TATGATTATC TGAAGAAATT CCCGGCGCTC
GACTTCGTCA GCATCTGGAG CTTCATCACC GATGTCAAAG CCGTCGACGG CCAGACGGTG
CGCTTCACGC TCGCCAATCC GAGTTCGCTG GCTGCCGAGC AGATCTCGCA GCTGCCGATC
GTTCCGGAAC ATGTCTGGAA GGACGTCGCC GATCCGGTCA CTTTCGCCAA CGAGAACCCG
GTCGGCAGCG GCCCGCTGAC CGAGGTTCCG CGCTTCACCG GCCAGACCTA CGACCAGTGC
CGCAACCCGA ACTATTGGGA CAATGCGCAT CTGAAGGTCG ATTGCATGCG CTTCCCGCAG
CTTGCCGACA ACAACCAGAT GCTGACGGCA ACAGCCGACG GCACGCTCGA CTGGGGCGTC
TCCTTCATTC CCGATATCGA CAATGTCTAT GTGTCCAAGG ATCCGGCGCA TTTCCACTAT
TGGTATTCGC CGAGCAGCAT GGTCGCCTTC CTGTTCAACC TGGAAACGGC GAACGAGAAC
AATAAGAAGG CCTTCATCGA CCTGAAATTC CGCCGTGCCG TCTCCATGGC GCTCGACCGC
AAGACGATGA TCGATGTCGC CGGCTACGGC TATCCGACGC TGAACGAAGA CCCCGGCCTG
ATGGGCGAGC TTTACAAGAG CTGGGCAGAC CCCTCCGTCA AATCAGACTT CGGCAAGTTC
GCGACCTATG ACGCCGACGC TGCCAAGGCC CTGCTCGACG AGGCGGGTTA CAAGGACAAG
GACGGCGACG GCTTCCGCGA CAACCCCGAC GGCAGCAAGA TCTCTTTCTC GATCATCGTC
CCGAGCGCCT GGACCGACTG GATCGACACC GTCAATCTCG CCGTCGAAGG CATGCAGGCG
GTCGGCATCG ACGCCAAGAT CGAAACGCCT GAAGAAGCCG TCTGGACCGG CAACCTCATC
AACGGCACCT TCGATGCGGC GATCAACAGC CTGCCGGCAT CGGCTTCGCC CTATTATCCC
TACAAGCGCG CCTTCAGCGC TTCGGACAAG GGCAAGACCC GCTTCACCGC GCAGCGCTGG
TTCAATCCCG AGGTCGAGAA GCTCGTCACC GAGTTCACCC AGACGGCGGA TCTTGCCAAG
CAGAAGGACG CGATGAACAA GGCGCAACGC ATCGTCGCCG AAAACATGCC GATGATTCCG
GTGTTCAACA ATCCGAACTG GTATCAGTAC AACACCAAGC GCTTCACCGG CTGGTCGACC
AAGGAAAACC CCTTCGTCAA TCCGTCGATC TCGCGGACCA ACCCGGCACG CCTCTTGAAC
CTGCTCGCGC TCGAGCCGGT CAAGTAA
 
Protein sequence
MKKYLLAAAA LTLLSGSAMA QTVLTANIEP ATTWVRNFNP FNQTSSRQST LDFIYEPLVI 
FNRFDSNKPV YRLAESFTLS DDLKSIDFKL RPNLKWSDGK PLTSADVKFT YDYLKKFPAL
DFVSIWSFIT DVKAVDGQTV RFTLANPSSL AAEQISQLPI VPEHVWKDVA DPVTFANENP
VGSGPLTEVP RFTGQTYDQC RNPNYWDNAH LKVDCMRFPQ LADNNQMLTA TADGTLDWGV
SFIPDIDNVY VSKDPAHFHY WYSPSSMVAF LFNLETANEN NKKAFIDLKF RRAVSMALDR
KTMIDVAGYG YPTLNEDPGL MGELYKSWAD PSVKSDFGKF ATYDADAAKA LLDEAGYKDK
DGDGFRDNPD GSKISFSIIV PSAWTDWIDT VNLAVEGMQA VGIDAKIETP EEAVWTGNLI
NGTFDAAINS LPASASPYYP YKRAFSASDK GKTRFTAQRW FNPEVEKLVT EFTQTADLAK
QKDAMNKAQR IVAENMPMIP VFNNPNWYQY NTKRFTGWST KENPFVNPSI SRTNPARLLN
LLALEPVK