Gene Rleg2_5566 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5566 
Symbol 
ID6978660 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp1215093 
End bp1216028 
Gene Length936 bp 
Protein Length311 aa 
Translation table11 
GC content64% 
IMG OID643394664 
Productextracellular solute-binding protein family 3 
Protein accessionYP_002279482 
Protein GI209547564 
COG category[E] Amino acid transport and metabolism
[T] Signal transduction mechanisms 
COG ID[COG0834] ABC-type amino acid transport/signal transduction systems, periplasmic component/domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.306318 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTGCCT TTCCGGCATT GCGGACCCTG CTGATCGCCG CCATGGCGTC CGGCCTTTCC 
TTCGGCGCAA CTCGTGCGGC CGACGATTTC GACCTGAGCC CGCAGCAGCC GGGCAGGCTG
CATGCCGCCA GGAACGAGGC GGCAATCGCT GCGATCCCCA AGGAGTTCAA GTTCGTCACG
CCAGGCAAGT TCACCATCGC CGTCAGTCCG GGCGGTCCGC CGCTTGCGAC CTATGCCACC
GACGCCAAGA CCGTCGTCGG GGCGGATCCC GATTATGCCT ATGCCATCGC CGACAGCCTC
GGCCTGACGC TGGAGATTGT GCCCGTGGCC TGGATCGACT GGCCGCTCGG CCTCGCCTCC
GGCAAGTATG ATGCCGTCAT TTCCAATGTC GGGGTCACCG AACAGCGCAA GGAGAAGTTC
GATTTCTCCA CCTATCGTCA GGGCCTGCAT GGCTTCTTCG TGAAATCCGA CAGCCCCATC
ACCTCGATCA AGCAGCCGAA GGATGCGGCG GGTTTGAGGA TCATCGTCGG GGCCGGAACC
AACCAGGAGC GCATCCTGGT GAAGTGGAGC GACGAGGATG TCGCCGCCGG CCTGAAGCCG
ATCGAGCTGC AATATTACGA CGACGAGGCG GCAAGCCTCC TCGCGCTCCG TTCCGGTCGG
GCCGATGTCA TCGTGCAGCC GCATGCGCAG CTCGTCTTCA TCGCGGCGCG CGACAAGAAC
ATCAAGCGTG TGGGCACGCT GAGCGCCGGC TGGCCCGATC GTTCCGACGT TGCGATCACC
ACCCGCAAGG GAAGCGGGCT CGCCGATGCG CTGACCGTCG CCACCAACGG CCTGATCAAG
GACGGCAGCT ACGCGAAGAT CCTCGATCAC TGGCATCTGT CCGAGGAAGC CTTGCCGGCA
TCCGAGACCA ATCCGCCCGG CCTGCCGAAA TACTGA
 
Protein sequence
MIAFPALRTL LIAAMASGLS FGATRAADDF DLSPQQPGRL HAARNEAAIA AIPKEFKFVT 
PGKFTIAVSP GGPPLATYAT DAKTVVGADP DYAYAIADSL GLTLEIVPVA WIDWPLGLAS
GKYDAVISNV GVTEQRKEKF DFSTYRQGLH GFFVKSDSPI TSIKQPKDAA GLRIIVGAGT
NQERILVKWS DEDVAAGLKP IELQYYDDEA ASLLALRSGR ADVIVQPHAQ LVFIAARDKN
IKRVGTLSAG WPDRSDVAIT TRKGSGLADA LTVATNGLIK DGSYAKILDH WHLSEEALPA
SETNPPGLPK Y