Gene Rleg2_5304 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5304 
Symbol 
ID6978398 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp933377 
End bp934987 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content59% 
IMG OID643394408 
Productextracellular solute-binding protein family 5 
Protein accessionYP_002279226 
Protein GI209547308 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.85757 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAGCA AGATTACCAA CTGGACCCGC TCTGACGACG CCATGATCGA AACCGCCATC 
CGTCGTGGCG CGACCCGCCG CGAGTTGCTG CATATGATGC TGGCGGGCGG CGTGGCCATG
TCCGCCGGCG GGCTCGTGCT CGGCCGCGCC GGCAAGGCGC TTGCGGCGAC GCCGGTTTCC
AGCGGTTCGC TCAAGGCAGC CGGCTGGTCG TCCTCGACGG CCGATACGCT CGACCCCGCC
AAGGCGTCGC TCTCCACCGA TTATGTCCGG TGCTGCTCCT TCTATAACCG CCTCACCATC
CTCGACCAGA GCGGCAAGCC GCAGATGGAG CTTGCCGACG CGATCGAGTC CAAGGATGCG
AAGACCTGGA CGGTCAAGCT GAAGAGCGGC GTCACATTTC ACGACGGCAA ACCGCTGACA
TCAGACGACG TGGTCTTCTC ACTGAAGCGC CATCTCGACC CCGCGGTCGG CTCGAAGGTC
GCCAAGATTG CCGCCCAGAT GACTGGCTTC AAGGCAGTCG ATAAGCAGAC CGTCGAGATT
ACCCTTGCCG ATGCAAACGC GGACTTGCCG ACCATCCTGT CGTTACACCA CTTCATGATT
GTCGCAGATG GCACCACCGA CTTTTCGAAG GCGAACGGCA CCGGTGCTTT CGTCAAGGAG
GTCTTCGAGC CAGGTGTGCG CTCGGTCGGC ATCAAGAACA AGAACTACTG GAAATCGGGC
CCGAACGTCG ATTCCTTTGA ATATTTCGCG ATCAGCGACG ACAATGCCCG CGTCAACGCA
CTGCTGGCCG GCGATATCCA CCTCGCCGCC TCGATCAATC CGCGCTCGAT GCGCCTCATC
GAGGCCCAGG GCGATGGGTT TACCTTGTCG AAGACGACCT CCGGCAACTA TACCAACCTC
AATATGCGGA TGGATATGGA ACCCGGCAAT AAGCAAGATT TCATCGAAGG CATGAAGTCC
CTTGTCAACC GCGAACAGAT CGTCAAGTCG GCGCTGCGCG GTCTCGGCGA GGTTGGCAAC
GACCAACCCA TTTCTCCGGC GAACTTCTAT CATGATGCGG ACTTGAAAGC GCGGGGCTTC
GATCCCGAAA AGGCGAAGTT CCACTTCGAA AAAGCAGGCG TCCTCGGCCA ATCGATTCCG
ATTATCGCTT CGGATGCGGC GAACTCGTCG ATCGACATGG CCATGATCAT CCAGGCGGCC
GGCGCCGAGA TAGGGATGAA GCTCGATGTC CAGCGCGTCC CCGCCGACGG CTACTGGGAC
AATTATTGGC TTAAGGCGCC TATTCACTTC GGCAATGTCA ATCCTCGTCC AACACCGGAC
ATTTTGTTCT CGTTGTTCTA CACCTCTCAA GCGCCCTGGA ATGAAAGCCG CTACAAGTCT
GAAAAATTCG ACAAGATGCT GATCGAGGCG CGCGGTTCGC TCGACCAGGA GAAGCGCAAG
ACGATCTACA ACGAGATGCA GGTTATGGTC GCTCAGGAAG CCGGCACCAT TATTCCAGCC
TATCTATCGA ATGTCGATGC CACCACTGCC AAGCTCAAGG GCTTGCTACC CAGCCCCCTT
GGCGGCCAGA TGGGATACGC GTTTGCCGAA TATGTCTGGC TCGAAGCCTG A
 
Protein sequence
MNSKITNWTR SDDAMIETAI RRGATRRELL HMMLAGGVAM SAGGLVLGRA GKALAATPVS 
SGSLKAAGWS SSTADTLDPA KASLSTDYVR CCSFYNRLTI LDQSGKPQME LADAIESKDA
KTWTVKLKSG VTFHDGKPLT SDDVVFSLKR HLDPAVGSKV AKIAAQMTGF KAVDKQTVEI
TLADANADLP TILSLHHFMI VADGTTDFSK ANGTGAFVKE VFEPGVRSVG IKNKNYWKSG
PNVDSFEYFA ISDDNARVNA LLAGDIHLAA SINPRSMRLI EAQGDGFTLS KTTSGNYTNL
NMRMDMEPGN KQDFIEGMKS LVNREQIVKS ALRGLGEVGN DQPISPANFY HDADLKARGF
DPEKAKFHFE KAGVLGQSIP IIASDAANSS IDMAMIIQAA GAEIGMKLDV QRVPADGYWD
NYWLKAPIHF GNVNPRPTPD ILFSLFYTSQ APWNESRYKS EKFDKMLIEA RGSLDQEKRK
TIYNEMQVMV AQEAGTIIPA YLSNVDATTA KLKGLLPSPL GGQMGYAFAE YVWLEA