Gene Rleg2_2637 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_2637 
Symbol 
ID6981380 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp2683507 
End bp2684610 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content61% 
IMG OID643397349 
Productextracellular solute-binding protein family 1 
Protein accessionYP_002282134 
Protein GI209550217 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1840] ABC-type Fe3+ transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.142963 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCTCTA ACATTTCTCG ACTTCTGTCG CTCTCTACTG CGATGATCGT GGCTTCGACC 
GCGATTGCCG CCGCCGAGCC GAGCGCTGAA CTTATCGCCG CCGCCAAGAA GGAAGGCACC
CTGACCACGA TCGCTCTTCC GCACGACTGG TGCGGCTACG GCGACGTCAT TGCCGGCTTC
AAGGCCAAGT ATGGCCTCGA AGTCAACGAA CTGAACCCGG ACGCCGGTTC GGGCGACGAA
GTCGAAGCCA TCAAGGCCAA CAAGGGCAAC ACCGGGCCGC AGGCTCCTGA CGTCATCGAC
GTCGGCCTCT CCTTCGGTCC GTCCGCCAAG AAGGACGGCC TGATCCAGCC TTACAAGGTT
TCCACCTGGG ATTCGATCCC GGACACGGCC AAGGATGCCG AAGGCTTCTG GTACGGCGAC
TATTACGGCG TTCTCTCGTT CCTCGTGAAC AAGGACCTCG TCAAGGAATC GCCGGCCGAC
TGGACCGACC TTAAGAAGAG CGACTACGCA AACACCGTCG CGCTTGCAGG CGATCCGCGC
AGCGCCAACC AGGCCGTCCA AGGCGTCTAT GCCGCTGGTC TTTCCGCATC CGGCGGTGAC
GCGGCCAAGG CAGGCGAAGA AGGCCTGAAG TTCTTTGCCG AACTCAACAA GGCTGGCAAC
TTCGTGCCCG TCGTCGGCAA GGCTGCTCCC TTCGCGCAGG GCTCGACGCC GATCATCGTC
GCCTGGGACT ACAATGCCCT GTCCTGGGGC CAGAGCCTCA AGGGCAATCC TCCGTTCGAG
GTTGTCGTTC CGAAGACGGG CGTCGTTGCC GGTGTCTACG TCCAGGCGAT TTCCGCCTTC
GCTCCGCACC CGAACGCTGC CAAGCTCTGG ATGGAATACC TCTATTCCGA CGAAGGTCAG
CTCGGCTGGC TGAAGGGCTA TTGCCACCCG ATCCGCTTCA ACGATCTTGC CAAGAACAAC
AAGATCCCGA AGGACCTGCT CGACAAGCTG CCGCCGGCAG CAGCCTATGA AAAGGCTGTT
TTCCCGACGC TCGAAGAGCA GGCCGCCGGC AAGGAAACCA TCACCAAGAA CTGGGATTCC
GTGGTTGGCG CCAGCGTCAA GTAA
 
Protein sequence
MISNISRLLS LSTAMIVAST AIAAAEPSAE LIAAAKKEGT LTTIALPHDW CGYGDVIAGF 
KAKYGLEVNE LNPDAGSGDE VEAIKANKGN TGPQAPDVID VGLSFGPSAK KDGLIQPYKV
STWDSIPDTA KDAEGFWYGD YYGVLSFLVN KDLVKESPAD WTDLKKSDYA NTVALAGDPR
SANQAVQGVY AAGLSASGGD AAKAGEEGLK FFAELNKAGN FVPVVGKAAP FAQGSTPIIV
AWDYNALSWG QSLKGNPPFE VVVPKTGVVA GVYVQAISAF APHPNAAKLW MEYLYSDEGQ
LGWLKGYCHP IRFNDLAKNN KIPKDLLDKL PPAAAYEKAV FPTLEEQAAG KETITKNWDS
VVGASVK